In this video, learn about the lemmatization process and use Python to perform this action.
- [Instructor] Lemmatization … is another important step in text mining. … What is lemmatization? … Lemmetization is similar to stemming, … but it produces a proper root word … that belongs to the language. … For example, combine is the lemmatized version … of combine, combined, and combining. … As opposed to stemming, which produced an incomplete word, … combine is a proper English word. … Lemmatization uses a dictionary … to match words to their root word. … It is a more expensive operation than stemming … because of the dictionary and resources vitamins. … For our example with lemmatization, we will … use the WordNet Dictionary and the WordNet Lemmatizer. … Each token in the original token list4 … are passed through the lemmatizer, … which returns the lemmatized string. … The results are printed. … Let us execute the code and review the results. … You will notice that words like engine and devops … are now complete as opposed to stemming, … which produced incomplete word. … To review an example, we will compare the raw, stemmed, …
- Text mining today
- Reading text files using Python
- Cleansing text data
- Build n-grams databases for text predictions
- Preparing TF-IDF matrices for machine learning
- Scaling text processing for performance
Skill Level Intermediate
Processing Text with R Essential Trainingwith Kumaran Ponnambalam55m 57s Intermediate
1. Text Mining
2. Reading Text
3. Text Cleansing and Extraction
4. Advanced Text Processing
5. Best Practices
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.Cancel
Take notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.