From the course: Cloud Hadoop: Scaling Apache Spark
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
Transformations and actions - Apache Spark Tutorial
From the course: Cloud Hadoop: Scaling Apache Spark
Transformations and actions
- (Narrator)As we're working with this section, you will see some of the print statements may or may not have parentheses around the variable. I have tried to correct this for Python three in all of my examples, but I do see in various documentation that this changed from Python two to Python three, sometimes code examples are still lacking the parentheses around the variable in the print statement. So if you're working with Python three, be sure to add them. Another tip I have for you in working in this environment, is to take a look at and use the sample notebooks that Databricks provides. We had earlier imported data to make a table. Do you remember how we did that? clicked over here on table and then we created a table. Now you can do it through the interface, but you can also just import files and not work with the table abstraction. How does that work? We're going to go back here. And in the center here we…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
(Locked)
Tour the Databricks Environment4m 36s
-
(Locked)
Tour the notebook5m 29s
-
(Locked)
Import and export notebooks2m 56s
-
(Locked)
Calculate Pi on Spark8m 30s
-
(Locked)
Run WordCount of Spark with Scala4m 59s
-
(Locked)
Import data2m
-
(Locked)
Transformations and actions3m 21s
-
(Locked)
Caching and the DAG6m 49s
-
(Locked)
Architecture: Streaming for prediction3m 51s
-
(Locked)
-
-
-
-