From the course: NLP with Python for Machine Learning Essential Training
Unlock the full course today
Join today to access over 24,400 courses taught by industry experts.
Machine learning pipeline - Python Tutorial
From the course: NLP with Python for Machine Learning Essential Training
Machine learning pipeline
- [Instructor] Up to this point, we've learned some basics of NLP and NLTK. We've learned how to read in messy text, and we've learned how to use regular expressions to search for and manipulate that text. In this lesson, we'll take a step back to understand how this all fits together in the broader machine learning pipeline before we dive into each step individually. This lesson is going to introduce some new topics as well. But don't worry, we'll cover each of these topics in later lessons. This is meant only to provide the proper context for how this all fits together. In a typical machine learning text pipeline, you'll start with some document with raw text in it, like the SMS data set that we're working with. It's important to note that at this stage, the computer has no idea what it's looking at. All it sees is a collection of characters. It doesn't know the word ham from the word spam. The characters mean nothing. It doesn't even know a space from a number or a letter. They're…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
(Locked)
What are NLP and NLTK?4m 7s
-
(Locked)
NLTK setup and overview6m 15s
-
(Locked)
Reading in text data11m 41s
-
(Locked)
Exploring the dataset6m 56s
-
(Locked)
What are regular expressions?4m 8s
-
(Locked)
Learning how to use regular expressions8m 44s
-
(Locked)
Regular expression replacements6m 3s
-
(Locked)
Machine learning pipeline4m 45s
-
(Locked)
Implementation: Removing punctuation9m 10s
-
(Locked)
Implementation: Tokenization3m 37s
-
(Locked)
Implementation: Removing stop words4m 2s
-
(Locked)
-
-
-
-
-