Learn how to set up the demo environment to use for this course.
- Now let's get set up with our Sandbox. And we start by going to teradata.com and finding our download. So, from here on teradata.com I'm going to products and services and I'll just look at all products A through Z. On this page they have everything that they offer. I'll just do a quick search for Presto. And, click on the link for the free download. On the download page, we're going to use the cloudera Sandbox, but there's also a Hortonworks Sandbox, if you're more familiar and want to experiment a bit later. For this course, click on the download Presto Sandbox link.
Then you'll need to fill out the form here with your information in order to get access. You'll agree to the license letting you know that this is trial, this is a Sandbox environment, don't run this in production. And then you'll get the link to actually download it on this page here. When you go there, you'll want to choose the VirtualBox virtual machine. That's the platform we're going to use to actually run this. So here in VirtualBox, I have just installed that it's pretty straightforward you can download it and just click next through the installer. The way to set up Presto is then to import the appliance.
So click file then import. And I'll browse to my download. And from here I'll just import it with the basic settings. Now before we start the actual VM what we need to do is set up our networking environment. So I'm going to go to VirtualBox and preferences then network and host only networks. Here I need to create a host only network so I can access the virtual machine from my local terminal window. I'll click on the plus sign to create one, then I need to adjust some of the settings.
I need to go into the DHCP Server and enable this. This is where I set up the IPs that I want my machine to use. So here I'll type in one-nine-two dot one-six-eight dot five-six dot one hundred I'll copy that for the server mask. I'll enter two-five-five dot two-five-five dot two-five-five dot zero. For the lower address bound, I'm going to paste in the IP that I copied from above and set it up as one-zero-one.
And for the upper address bound I will paste it in again and will give it an upper bound of two-five-four. Hit okay. Hit okay. And now we have the network we need to assign that network to our Presto virtual machine. I'll select that from the list here and choose settings then go into network. And where it says attached to NAT, I'll change that to a host only adapter and it will automatically select the one we just created. Hit okay and then I can start my virtual machine.
Once it's started up, I'll go ahead and dismiss these warnings here and it'll ask me about capturing the mouse. I'm going to say do not show this message again and cancel. The virtual machine doesn't use the mouse it's only the command line. So we don't want that to happen. Then when I click in here I'll log in with presto as the username and presto again as the password. From there, what I can do is find my IP by typing, "ifconfig". And you can see that inet address here is one-nine-two-one-six-eight-five-six one-zero-one.
This is what we are going to use to log in from our own terminal window. So click over here onto my terminal window and type that in ssh presto for my username @ one-nine-two one-six-eight five-six one-zero-one Give it the password of presto and you can see I'm logged in. Let me just resize the window a little bit here. I can do a simple command and I can see that I am now in the presto machine and I have access to all that. We can start running our commands and really diving into presto.
Data science expert Ben Sullins helps you get up to speed with Presto, and leverage it to accomplish a wide-range of data science and analytics tasks. He uses different interfaces with Presto—such as R and Tableau—and digs into the expressive SQL language that Presto offers for your analysis. At the end of this course, you'll know the key concepts of Presto and how to use them to take full advantage of your modern big data system.
- What does Presto do?
- Running Presto
- Connecting from Tableau and R
- Connecting to Hive, MySQL, and the local system
- Retrieving data
- Combining data sources
- Basic SQL functions
- Advanced SQL functions
- Migrating from Hive