At the end of this video the student will understand the different join types supported by Hive and when to use each one.
- [Narrator] Okay, now we're going to take a look at…how to combine tables using JOIN.…Now, JOIN is a function in which Hive will allow you…to combine these different tables to create…a more comprehensive result set.…JOIN isn't something unique to Hive.…This is an anti standard feature of SQL…and really a fundamental thing to understand…when you're just working with databases,…as it's really common to separate out different data…into separate tables.…Now one thing to note about Hive is that it only supports…EQUI JOINs, that's joins that use the equals operator.…
So, at this stage you may not be thinking,…why else would you use a different operator?…But rest assured, there are other databases and cases…where you come across where you may want…to have a non equal, or greater than, or less than…in your actual join condition.…Here in Hive, that's just not supported currently.…Let's first take a look at just some of the join types,…and the first one of course is the most common,…which is an Inner Join.…Now this is the type of join where you only get results…
This course shows how to use Hive to process data. Instructor Ben Sullins starts by showing you how to structure and optimize your data. Next, he explains how to get Hue, the Hadoop user interface, to leverage HiveQL when analyzing data. Using the newly configured option, he then demonstrates how to load data, create aggregate tables for fast query access, and run advanced analytics. He also takes you through managing tables and putting functions to use. This course is designed to help you find new ways to work with datasets so you can answer the tough data science questions that come your way.
- Defining data structures in Hive
- Selecting data
- Joining tables
- Manipulating data
- Filtering results
- Aggregating data
- Using built-in aggregate functions
- Mastering built-in table-generating functions
- Using CUBE and ROLLUP
- Using clauses: WHERE and HAVING
- Using LIKE, JOIN, and SEMI JOIN
- Using functions: String, math, date, and conditional