In this video, import the actual data from the City of Chicago, which is used for the rest of this course.
- [Instructor] Now it's really important to remember…that if it's been a couple of hours since you used…your notebook on Google Colab, you won't see…your files there anymore.…Google Colab is a virtual machine,…and it just gets recycled if you've…left it idle for a while.…So that's not a problem, we just need to re-run…a couple of the cells and we're done.…The start of each notebook will have a…Download and install Spark section.…You just need to run the ls command,…and if you see the spark files and the…Chicago's Reported Crime Data file there,…you don't need to download them again.…
As you know, comments and pythons…start with a hash character, and extend…to the end of the physical line.…I've put comments at the start of each line…so you don't download the data again by mistake.…If you do need to download the data,…all you need to do is remove the hash character…at the start of each line.…For each notebook, you will need to run…the setup environment section, and the downloading…and pre-processing Chicago's Reported Crime section.…
- Benefits of the Apache Spark ecosystem
- Working with the DataFrame API
- Working with columns and rows
- Leveraging built-in Spark functions
- Creating your own functions in Spark
- Working with Resilient Distributed Datasets (RDDs)
Skill Level Intermediate
1. Introduction to Apache Spark
2. Technical Setup
3. Working with the DataFrame API
5. Resilient Distributed Datasets (RDDs)
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.Cancel
Take notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.