Data needs to be pre-processed before it can be used for classification. Learn about the steps need to pre-process text data.
- [Instructor] Let's prepare some data for classification. … For this exercise, we use two source text files. … They're under the directory classification … in the Exercise folder. … First, we have the Course Descriptions.txt file … which contains a list of course descriptions … for various technology courses. … This is our feature variable set. … Each line in this file contains the course description … for a specific course. … In order to train the model, … we also need to tag these course descriptions … with specific classes. … For this, we have another document, … Course Classification.txt … that lists the classes … for each of these course descriptions. … The line numbers between the descriptions … and the classes match one to one. … The code for this exercise is available … in the file code_04_XX Classification.R. … In order to prepare data for classification, … we need to build TF-IDF matrix. … So we first load the Course Description.txt file … into a VCorpus. … Let's load it and inspect the data. …
- Creating a word cloud
- Analyzing sentiment
- Extracting emotions from text
- Clustering similar entities based on text
- Using classification for supervised learning
- Recommending items to users based on text data analytics
Skill Level Intermediate
Predictive Customer Analyticswith Kumaran Ponnambalam1h 37m Intermediate
1. Word Cloud
2. Sentiment Analysis
5. Predictive Text
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.Cancel
Take notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.