From the course: Data Science and Analytics Career Paths and Certifications: First Steps
Unlock the full course today
Join today to access over 24,700 courses taught by industry experts.
Data mining and analytics
From the course: Data Science and Analytics Career Paths and Certifications: First Steps
Data mining and analytics
- [Instructor] Data mining and analytics involve a myriad of data manipulation techniques. Text Retrieval is one of the most well known data mining techniques. It builds on many foundational concepts and methods developed by Natural Language Processing or NLP. Classification constructs a model that labels a group of data objects into a specific category. In the classification model, the classes with their own labels are discreet in nature. For instance, the same classification model can categorize people into groups of trustworthy and untrustworthy users of an online banking system. Prediction builds a model that produces continuous or ordered values that form a trend. For instance, a prediction model can provide estimated Mean Time To Failure, or MTTF, values for a computer. Clustering is a process of grouping similar data objects into a class. Clustering helps reveal features that distinguish one class of data objects from…