Introduction to Spark SQL and DataFrames Preview

Introduction to Spark SQL and DataFrames

With Dan Sullivan Liked by 1,697 users
Duration: 1h 54m Skill level: Intermediate Released: 5/30/2019

Start my 1-month free trial

Course details

Explore DataFrames, a widely used data structure in Apache Spark. DataFrames allow Spark developers to perform common data operations, such as filtering and aggregation, as well as advanced data analysis on large collections of distributed data. With the addition of Spark SQL, developers have access to an even more popular and powerful query language than the built-in DataFrames API. In this course, instructor Dan Sullivan shows how to perform basic operations—loading, filtering, and aggregating data in DataFrames—with the API and SQL, as well as more advanced techniques that are easily performed in SQL. In this section of the course, Dan explains how to join data, eliminate duplicates, and deal with null or NA values. The lessons conclude with three in-depth examples of using DataFrames for data science: exploratory data analysis, time series analysis, and machine learning.

Skills you’ll gain

Earn a sharable certificate

Share what you’ve learned, and be a standout professional in your desired industry with a certificate showcasing your knowledge gained from the course.

Sample certificate

Certificate of Completion

  • Showcase on your LinkedIn profile under “Licenses and Certificate” section

  • Download or print out as PDF to share with others

  • Share as image online to demonstrate your skill

Meet the instructor

Learner reviews

4.7 out of 5

366 ratings
  • 5 star
    Current value: 276 75%
  • 4 star
    Current value: 72 19%
  • 3 star
    Current value: 9 2%
  • 2 star
    Current value: 3 <1%
  • 1 star
    Current value: 6 1%

Contents

What’s included

  • Practice while you learn 1 exercise file
  • Test your knowledge 4 quizzes
  • Learn on the go Access on tablet and phone

Similar courses

Download courses

Use your iOS or Android LinkedIn Learning app, and watch courses on your mobile device without an internet connection.