From the course: Big Data Analytics with Hadoop and Apache Spark
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
Apache Spark overview
From the course: Big Data Analytics with Hadoop and Apache Spark
Apache Spark overview
- [Instructor] In this video, I will review some of the salient features of Apache Spark. Apache Spark is an open source technology that started out as a more efficient alternative to Hadoop MapReduce. It is a large-scale distributed data processing engine. Spark stores its data primarily in memory to speed up computations. It has also grown to add a number of capabilities like batch processing, real-time streaming, machine learning and graphs. Spark can be programmed with Scala, Java, Python and R. It's performance features and programing support makes it the most popular big-data technology today.
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.