Join Keith McCormick for an in-depth discussion in this video Using the exercise files, part of Machine Learning: Advanced Decision Trees.
- [Instructor] I've provided a handful of files for you in the Exercise Files folder. You won't need these for every video, so I'll refer to them when you need them. It's the train.csv file is our practice data involving the passengers of the Titanic. I'm going to show ya how to download that now. You can download it from a website called kaggle.com. So if you simply search for keywords kaggle titanic, you'll find it, and let's go to the page. You will have to sign-up for kaggle, but of course, it's completely free data, and you may actually find some of the supporting information interesting.
If you click on Get the Data, the only file that we need is this initial file, train, the csv file. After you download it, go ahead and put it in the Exercise Files folder, as I've done. If you want to click along, and I encourage you to do so, you'll want to get a copy of the IBM SPSS Modeler Trial. You'll have to get an IBM ID, but it's completely free, and the trial will last about 30 days, which should be more than enough time to work through the course.
- Understanding QUEST functions and applications
- C5.0 concepts and practical applications
- Understanding information gain
- Random forests
- Boosting and bagging
- Costs and priors