From the course: Build Three Real-World Python Applications
Unlock the full course today
Join today to access over 24,400 courses taught by industry experts.
How to clean text data - Python Tutorial
From the course: Build Three Real-World Python Applications
How to clean text data
- Many times when you bring in text data to analyze, it is very messy. It may contain misspelled words, words you don't want to analyze like the and and, or contain texts like chapter numbers that don't provide you value. I will show you how to clean text data in three different ways for our analysis. Remember that you will need to execute the code at the top of each file from here onwards in order to properly execute the code for each video. For our word cloud we will do a simple cleaning of our text to visualize our popular words. We will create a variable called word_cloud_text and equal it to that great_expect variable we had earlier and use the .lower method on it. This will lowercase all texts to make sure a capital soon is the same as a lowercase soon. Now we will type in our word_cloud_text variable again and equal it to this code where it allows us to remove any numbers in alpha numeric words that we don't need…
Contents
-
-
-
-
-
What is text analysis?2m 12s
-
(Locked)
Introducing Project Gutenberg and "Great Expectations"2m 5s
-
(Locked)
How to pull text data3m 33s
-
(Locked)
How to clean text data6m 34s
-
(Locked)
How to create a word cloud3m 39s
-
(Locked)
How to improve a word cloud3m 47s
-
(Locked)
How to analyze word frequency4m 10s
-
(Locked)
How to perform Vader sentiment analysis7m 28s
-
(Locked)
How to build a corpus and dictionary for topic modeling4m 36s
-
(Locked)
How to perform topic modeling6m 46s
-
(Locked)
Challenge: Analyze "Sense and Sensibility"2m 53s
-
(Locked)
Solution: Analyze "Sense and Sensibility"4m 55s
-
-