Author
Released
9/7/2017Dan also focuses on using Scala with Spark, a distributed processing platform. He first describes how to work with Resilient Distributed Datasets (RDDs)—a fundamental Spark data structure—and then explains how to use Scala with Spark DataFrames, a new class of data structure specially designed for analytic processing. He wraps up the course by providing a summary of advantages of using Scala for data science.
- The advantages of Scala for data science
- Scala data types
- Scala arrays, vectors, and ranges
- Parallel processing in Scala
- Mapping functions over parallel collections
- When and when not to use parallel collections
- Using SQL in Scala
- Scala and Spark RDDs
- Scala and Spark DataFrames
- Creating DataFrames
Skill Level Intermediate
Duration
Views
- Hi, I'm Dan Sullivan, and in this course, I'll be describing how to use Scala for data science. We'll start by reviewing the basics of Scala data types and expressions. We'll cover how to use collections like arrays, sets, and maps. And we'll discuss the advantages of Scala's functional programming features for working with collections and analyzing data. We'll also use parallel collections which allow us to easily implement parallel processing and take advantage of multicore processors.
Next, we'll review the basics of using JDBC drivers and SQL queries to work with relational databases. We'll also explore how to use Scala with Spark, a popular distributed processing system widely used for big data analytics and data science. So let's start learning how to use Scala for data science.
Related Courses
-
Java for Data Scientists Essential Training
with Charles Kelly2h 43m Intermediate
-
Introduction
-
Welcome53s
-
-
1. Introduction to Scala
-
Installing Scala2m 56s
-
Scala data types4m 34s
-
Scala collections1m 17s
-
Scala maps4m 28s
-
Scala expressions3m 40s
-
Scala functions2m 46s
-
Scala objects8m 5s
-
2. Parallel Processing in Scala
-
3. Using SQL in Scala
-
Installing PostgreSQL1m 50s
-
Loading data into PostgreSQL2m 28s
-
Connecting to PostgreSQL4m 34s
-
Querying with SQL strings3m 40s
-
-
4. Scala and Spark RDDs
-
Introduction to Spark1m 48s
-
Installing Spark2m 1s
-
Mapping Functions over RDDs8m 47s
-
Statistics over RDDs8m 50s
-
-
5. Scala and Spark DataFrames
-
Creating DataFrames5m 6s
-
Joining DataFrames2m 18s
-
Working with JSON files2m 22s
-
-
Conclusion
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.
CancelTake notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.
Share this video
Embed this video
Video: Welcome