From the course: Hadoop for Data Science Tips, Tricks, & Techniques

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Create aggregates in Hive

Create aggregates in Hive - Hadoop Tutorial

From the course: Hadoop for Data Science Tips, Tricks, & Techniques

Start my 1-month free trial

Create aggregates in Hive

- [Instructor] When you're working with data in Hive, one of the things you'll most likely end up doing is creating aggregate tables, which are rollup tables that summarize the data for you and the reason you do that is because it's going to be a lot faster when you're wanting to query it and analyze it. So if you have a very low-level of detail, say, the exact minute or second that an event occurred, like a sales order, but you often find yourself reporting on a weekly or monthly basis, it makes sense to create these tables in advance, so that way, when you run your dashboards or however you provide those analytics, you have a really small and easy table to work from. So that's what we're going to take a look at here is creating some aggregates in our Hive environment. First, what I'm going to is move the sales files that we downloaded earlier over to a new location in hdfs. If I'm already in where I've downloaded my exercise file to, I'm just going to run hadoop fs -put and the…

Contents