From the course: Big Data Analytics with Hadoop and Apache Spark

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Apache Spark overview

Apache Spark overview

From the course: Big Data Analytics with Hadoop and Apache Spark

Start my 1-month free trial

Apache Spark overview

- [Instructor] In this video, I will review some of the salient features of Apache Spark. Apache Spark is an open source technology that started out as a more efficient alternative to Hadoop MapReduce. It is a large-scale distributed data processing engine. Spark stores its data primarily in memory to speed up computations. It has also grown to add a number of capabilities like batch processing, real-time streaming, machine learning and graphs. Spark can be programmed with Scala, Java, Python and R. It's performance features and programing support makes it the most popular big-data technology today.

Contents