From the course: Data Science Tools of the Trade: First Steps

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Hadoop hands-on

Hadoop hands-on

From the course: Data Science Tools of the Trade: First Steps

Start my 1-month free trial

Hadoop hands-on

- When you install Hadoop on your computer, by default, it's configured to run in a single-node mode. Here, node refers to a standalone computer, whether it is a physical or virtual machine. Of course, there's no point of operating Hadoop this way, but this is still a necessary step to get to the multi-node mode. You need to take additional steps to set up your default Hadoop installations into a multi-node distributed system. One of these extra tasks is to designate a node as the master, while configuring others as slaves. Because of this, a minimum Hadoop installation is a single-node setup. And this is what we are going to try to accomplish here. Another prerequisite is your access to computers available for Hadoop installation. Hadoop is designed to run on bare-metal hardware and it is therefore ideal to get ahold of physical machines with access to the internet for our experiment. However, it's perfectly fine for us to use a virtual machine here, especially for testing purposes…

Contents