From the course: Hadoop for Data Science Tips, Tricks, & Techniques

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Select partitions in Hive

Select partitions in Hive - Hadoop Tutorial

From the course: Hadoop for Data Science Tips, Tricks, & Techniques

Start my 1-month free trial

Select partitions in Hive

- [Narrator] When working with large tables in Hive, it can dramatically improve query performance by only selecting a specific partition of the table. And if you're setting up these tables, it often makes sense most of the times, at least by year, but often by other dimensions or other categories of things, to separate partitions. That way if people, the analysts are currently only working in one section of that data, they're having to analyze a much smaller set. So let's take a look at that now in Partitioning Data in Hive. First what I want to do, and I have the exercise file for this video open here. I want to open up my web browser and I want to go into the Hue UI. I'll go over to Hive. And here's where we're going to do our work. So first, let me just delete any previous queries that were there and I'm going to create a new table called sales all years partitioned. And if I scroll down to the bottom, it's partitioned by company name and this is a string and this table's going to…

Contents