From the course: Learning Public Data Sets

Unlock the full course today

Join today to access over 22,700 courses taught by industry experts or purchase this course individually.

Google Ngram Viewer

Google Ngram Viewer - Microsoft Excel Tutorial

From the course: Learning Public Data Sets

Start my 1-month free trial

Google Ngram Viewer

- [Instructor] In this movie, I would like to point you to the Google Ngram Viewer. An Ngram is a string of characters of a given length. So a two-character string is a bigram. A three-character string is a trigram and so on. If you perform linguistic analysis and want to search usage in books published from 1800 to about 2008, the Google Ngram Viewer is a great tool to keep in mind. The URL for this resource is books.google.com/ngrams and you can see the interface here. You can set your search values for years so for example between 1800 and 2000. I'll go ahead and take that up to 2008 which is the last year for which data is available. We'll look at English and we'll leave the smoothing factor of three which is the default. You can play around with it if you think it makes a difference. And you can see in the comma-separated phrases box that we have Albert Einstein, Sherlock Holmes, and Frankenstein and those are separated by commas. I haven't updated my search so this data only…

Contents