From the course: NLP with Python for Machine Learning Essential Training

Unlock the full course today

Join today to access over 24,400 courses taught by industry experts.

Machine learning pipeline

Machine learning pipeline

- [Instructor] Up to this point, we've learned some basics of NLP and NLTK. We've learned how to read in messy text, and we've learned how to use regular expressions to search for and manipulate that text. In this lesson, we'll take a step back to understand how this all fits together in the broader machine learning pipeline before we dive into each step individually. This lesson is going to introduce some new topics as well. But don't worry, we'll cover each of these topics in later lessons. This is meant only to provide the proper context for how this all fits together. In a typical machine learning text pipeline, you'll start with some document with raw text in it, like the SMS data set that we're working with. It's important to note that at this stage, the computer has no idea what it's looking at. All it sees is a collection of characters. It doesn't know the word ham from the word spam. The characters mean nothing. It doesn't even know a space from a number or a letter. They're…

Contents