Understand how to install PostgreSQL, an open-source relational database.
- [Instructor] We will be using PostgreSQL as our relational database. I've opened a browser and navigated to enterprisedb.com. From here I can download a version of PostgreSQL. I'm going to select the latest version, and I'm going to select the operating system. I'm using a Mac OS X, but the installation works similarly on Linux and Windows. And I will download now. And that will start the download process.
Now, I'm opening the image that I downloaded. And that displays a folder with, in this case, "PostGres," a version, dash "OSX." So, I'm just going to double-click on that. And I get a warning message, and I'll enter that. And I'll enter my password. And when the setup wizard starts, I can just select Next. I'm going to select defaults, for the most part.
And I'll enter a password. And of course, it's important to remember that. We'll need that when we connect to the database. And I'll use the default port, and the default locale. And I'll install. Now at this point, we're offered the option of installing some additional packages. I'm going to deselect that. We don't need to use Stackbuilder. So I will click Finish.
And we have installed PostGres. The EnterpriseDB installation package will start Postgres for you. If it doesn't start automatically for you, check the Postgres documentation.
Dan also focuses on using Scala with Spark, a distributed processing platform. He first describes how to work with Resilient Distributed Datasets (RDDs)—a fundamental Spark data structure—and then explains how to use Scala with Spark DataFrames, a new class of data structure specially designed for analytic processing. He wraps up the course by providing a summary of advantages of using Scala for data science.
- The advantages of Scala for data science
- Scala data types
- Scala arrays, vectors, and ranges
- Parallel processing in Scala
- Mapping functions over parallel collections
- When and when not to use parallel collections
- Using SQL in Scala
- Scala and Spark RDDs
- Scala and Spark DataFrames
- Creating DataFrames