From the course: Data Science Tools of the Trade: First Steps

Unlock the full course today

Join today to access over 22,400 courses taught by industry experts or purchase this course individually.

Enabling technologies

Enabling technologies

From the course: Data Science Tools of the Trade: First Steps

Start my 1-month free trial

Enabling technologies

- Cloud computing, virtualization, distributed computing and machine learning are all enabling technologies that allow data scientists to do their job effectively and efficiently. These tools are indispensable and interconnected. Distributed computing builds on cloud computing and virtualization. Machine learning in turn relies on distributed computing. I'm excited to report that I've found excellent examples you can experiment with for all of these enabling technologies for data science. Proxmox is my choice for cloud computing and virtualization. It is relatively easy to install and you can get a good sense of what it takes to build your own cloud by installing and configuring the software. Hardoop or Spark or open source platforms for distributed computing. Hardoop is more comprehensive in its features, and Spark can be plugged into Hardoop to enhance its distributed processing feature. Weka is a powerful machine learning tool that allows its users to run various machine learning…

Contents