From the course: Advanced NLP with Python for Machine Learning

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

What makes doc2vec powerful?

What makes doc2vec powerful? - Python Tutorial

From the course: Advanced NLP with Python for Machine Learning

Start my 1-month free trial

What makes doc2vec powerful?

- [Instructor] Doc2Vec converts sentences or paragraphs into a single numeric vector. Now, why is that more useful than what we saw with Word2Vec before? Remember we saw these interesting mathematical properties of the individual word vectors? You can do the same thing with document vectors, but the outcome is not as clean or easily illustrated. This was an interesting property for Word2Vec and very useful in some applications, but not necessarily for ours. Where Word2Vec does fall a little bit short is when you have to average the word vectors across a sentence to prepare it to be used for machine learning. The goal is to capture the information contained within the sentence, but averaging numbers is a very naive way to capture information about a group of numbers. When you average numbers, you inherently lose information. You're taking X numbers and trying to represent them with one number. The real benefit of Doc2Vec…

Contents