Learn how to load data into dimension tables, including handling slowly changing dimensions.
- [Narrator] All right, now let's take a look at loading our dimension tables. First, we need to actually create a dimension table, so we're going to create the client dimension table, then we'll go and load that table from our cleansed environment. Here in my virtual environment, I have 5_2.sql up, this is from the exercise files, and it has all the code we need. First we're going to create our client dimension table, I'm going to copy and paste that into my Hive environment. Make sure I'm in the correct database here, I'm looking at the default database.
This is our schema, and it matches what we had in cleansed, so I'm just going to go ahead and execute that. Success and that should appear on the left here in a second. There it goes. Then we need to actually load that data. What I'm going to do is copy the part two of this and paste it into my Hive environment. All we're doing here is doing an insert overwrite, so we're going to overwrite any data if there was any data in there, and select the fields from our cleansed environment, which we've already processed and cleaned up. Go ahead and hit play.
Now that that's there, we'll just take a look and see what data we have. Select star from clients, and it looks like the data was ported over successfully.
- Working with systems and schemas
- Managing of a good data pipeline
- Setting up an environment
- Loading and profiling data
- Testing quality
- Adding data types
- Handling missing values and inferred members
- Performing master data lookups
- Loading schemas and tables
- Creating views