At the end of this video the student will know how to parse and manipulate data using string functions.
- [Instructor] Let's dive in here and look…at actually manipulating strings in Hive.…Now, what we're going to do is use…the string manipulation functions that come in HiveQL…to change data inside a table or query.…The first thing we're going to do is create a new customer key.…Then, we're going to parse a URL,…which has a special function that we can use.…And lastly, we'll use the substring function to add…to our customer key that we create in step one.…Here in my Cloudera environment I'm going…to open up the scripts folder, which has…the exercise files for this course.…
And I'm going to open 7-2.sql.…Once I have that I'm just going to copy this…and go back to my Hive editor and paste it in.…And, the first example we're going to do here is…just to find a new customer key.…We're going to use lower to make the lowercase name.…Then, we're going to use a different function,…regexp replace, regexp stands for regular expression,…and it's a way of identifying certain characters…and in this case replacing them with something else.…
This course shows how to use Hive to process data. Instructor Ben Sullins starts by showing you how to structure and optimize your data. Next, he explains how to get Hue, the Hadoop user interface, to leverage HiveQL when analyzing data. Using the newly configured option, he then demonstrates how to load data, create aggregate tables for fast query access, and run advanced analytics. He also takes you through managing tables and putting functions to use. This course is designed to help you find new ways to work with datasets so you can answer the tough data science questions that come your way.
- Defining data structures in Hive
- Selecting data
- Joining tables
- Manipulating data
- Filtering results
- Aggregating data
- Using built-in aggregate functions
- Mastering built-in table-generating functions
- Using CUBE and ROLLUP
- Using clauses: WHERE and HAVING
- Using LIKE, JOIN, and SEMI JOIN
- Using functions: String, math, date, and conditional