From the course: Learning Public Data Sets
Unlock the full course today
Join today to access over 22,700 courses taught by industry experts or purchase this course individually.
Google Ngram Viewer - Microsoft Excel Tutorial
From the course: Learning Public Data Sets
Google Ngram Viewer
- [Instructor] In this movie, I would like to point you to the Google Ngram Viewer. An Ngram is a string of characters of a given length. So a two-character string is a bigram. A three-character string is a trigram and so on. If you perform linguistic analysis and want to search usage in books published from 1800 to about 2008, the Google Ngram Viewer is a great tool to keep in mind. The URL for this resource is books.google.com/ngrams and you can see the interface here. You can set your search values for years so for example between 1800 and 2000. I'll go ahead and take that up to 2008 which is the last year for which data is available. We'll look at English and we'll leave the smoothing factor of three which is the default. You can play around with it if you think it makes a difference. And you can see in the comma-separated phrases box that we have Albert Einstein, Sherlock Holmes, and Frankenstein and those are separated by commas. I haven't updated my search so this data only…
Contents
-
-
-
-
-
-
(Locked)
Google Finance2m 39s
-
(Locked)
Google Trends4m 8s
-
(Locked)
Google Public Data Explorer3m 33s
-
(Locked)
Amazon Web Services Public Data Sets2m 36s
-
(Locked)
Data.gov3m 56s
-
(Locked)
Google Ngram Viewer3m 24s
-
(Locked)
The Corpus of Contemporary American English3m 5s
-
(Locked)
University of California, Irvine, Center for Machine Learning5m 45s
-
(Locked)
Pew Research Center4m 8s
-
(Locked)
-