In this video, learn about the stemming process and use Python to perform this action.
- [Instruction] In this video I will discuss stemming, … a key processing step in text mining. … What is stemming? … To understand that we need to define a stem. … A stem is the base part of a word … to which affixes can be attached for derivatives. … For example, the word combine is the stem for … combine, combining, and combined. … The first part of these words are common. … The different words represent different grammatical elements … with the same meaning. … Stemming is a process that converts a word into its stem. … It keeps the base word. … As a result, the total unique words in the corpus goes down … and the words with similar meaning can be grouped together. … Stemming simply cuts off the affix, … so it may not result in a complete word. … In this example, we will do stemming … by using the PorterStemmer function available in NLTK. … Each word in the remaining token list … is passed through the stemmer, … which will give back the stemmed representation of the word. … The results are collected in another list …
- Text mining today
- Reading text files using Python
- Cleansing text data
- Build n-grams databases for text predictions
- Preparing TF-IDF matrices for machine learning
- Scaling text processing for performance
Skill Level Intermediate
Processing Text with R Essential Trainingwith Kumaran Ponnambalam55m 57s Intermediate
1. Text Mining
2. Reading Text
3. Text Cleansing and Extraction
4. Advanced Text Processing
5. Best Practices
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.Cancel
Take notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.