Join Ben Sullins for an in-depth discussion in this video What you should know before watching this course, part of Apache Spark Essential Training.
- [Narrator] For this course, you should have a basic understanding of working with data in Python or SQL and have some familiarity with Hive or Hadoop. If you want to brush up on Hive beforehand checkout my other course, Analyzing Big Data with Hive. This is a beginner course, so it's not necessary to have any previous knowledge of Spark or Hadoop as I'll walk you through all of the concepts and terminology as we go through this course.
- Understanding Spark
- Reviewing Spark components
- Where Spark shines
- Understanding data interfaces
- Working with text files
- Loading CSV data into DataFrames
- Using Spark SQL to analyze data
- Running machine learning algorithms using MLib
- Querying streaming data
- Connecting BI tools to Spark
Skill Level Intermediate
1. Introducing Apache Spark
2. Analyzing Data in Spark
3. Using Spark SQL to Analyze Data
4. Running Machine Learning Algorithms Using MLlib
5. Real-Time Data Analysis with Spark Streaming
6. Connecting BI Tools to Spark
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.Cancel
Take notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.