Apache PySpark by Example
With Jonathan Fernandes
Liked by 2,119 users
Duration: 1h 58m
Skill level: Intermediate
Released: 1/31/2019
Course details
Want to get up and running with Apache Spark as soon as possible? If you're well versed in Python, the Spark Python API (PySpark) is your ticket to accessing the power of this hugely popular big data platform. This practical, hands-on course helps you get comfortable with PySpark, explaining what it has to offer and how it can enhance your data science work. To begin, instructor Jonathan Fernandes digs into the Spark ecosystem, detailing its advantages over other data science platforms, APIs, and tool sets. Next, he looks at the DataFrame API and how it's the platform's answer to many big data challenges. Finally, he goes over Resilient Distributed Datasets (RDDs), the building blocks of Spark.
Skills you’ll gain
Earn a sharable certificate
Share what you’ve learned, and be a standout professional in your desired industry with a certificate showcasing your knowledge gained from the course.
LinkedIn Learning
Certificate of Completion
-
Showcase on your LinkedIn profile under “Licenses and Certificate” section
-
Download or print out as PDF to share with others
-
Share as image online to demonstrate your skill
Meet the instructor
Learner reviews
-
Harshini Kondeti
Harshini Kondeti
Looking for Full Time | C2C | PowerBI | Tableau | SQL | Apache Hadoop | Scala | AWS
-
Arun Ajithan
Arun Ajithan
Architect Mainframe Modernization Skills - DevOps(DBB, UCD, Jenkins, Git), API (ZOS Connect), Micro Services(ZCX Containers, OpenShift)…
-
Ashen Weligalle
Ashen Weligalle
Machine Learning Enthusiast | M.Sc. student in Machine Learning & Statistics at Linköpings University
Contents
What’s included
- Learn on the go Access on tablet and phone