From the course: Data Science Foundations: Data Engineering

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Loading and profiling data

Loading and profiling data

From the course: Data Science Foundations: Data Engineering

Start my 1-month free trial

Loading and profiling data

- [Narrator] Now let's take a look at loading and profiling our data. First, we're going to create our staging environment. Then we're going to upload some sales data. And then we'll do a simple row count to create that profile. Here in my virtual machine I have the exercise file 2_1.sql open and what I want to do is run some options on the command line here that will actually create our structure. So, where we're going to land our staging data. I'm just going to copy this from the file and then paste it in to my terminal window here. Do that for the other two. Alright, so we have our data structure, so now we can actually upload some of the data. We need to browse in the terminal window over to where the actual exercise files live. So if I take a look at my current directory, this is the user directory. What I need to do is find out where that actual folder is, and it's under /media/sf_Exercise_Files. Now from here I can take a look and I have data, scripts, and setup. What I want is…

Contents