From the course: Data Science Foundations: Data Mining in Python

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Iliad dataset

Iliad dataset - Python Tutorial

From the course: Data Science Foundations: Data Mining in Python

Start my 1-month free trial

Iliad dataset

- [Instructor] Text mining is just a whole different kind of data science, and consequently, we're going to be working with a different kind of data. To introduce us to this section on text mining, I want to show you the data set that we're going to be using, which is the text, "The Iliad," by Homer written forever ago. We're getting "The Iliad" from Project Gutenberg, which provides over 60,000 free eBooks. There are a lot of other options there and we'll try some for the challenge, but right now let's use the plain text version of "The Iliad" that I've already downloaded and saved for us in our data folder. All we need to do to see this is to import Pandas, and then we're going to read CSV, but we're actually reading a text file. And we're going to separate things, we're going to drop the empty lines, and we're going to remove what's called the Gutenberg ID, and we'll look at just the first 10 lines, and what you…

Contents