From the course: Twelve Myths About Data Science

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Big data is one thing

Big data is one thing

From the course: Twelve Myths About Data Science

Start my 1-month free trial

Big data is one thing

- [Instructor] Often when people refer to Big Data, they talk of it as if it were a singular thing. And that's just not true. So, if we think about Big Data and we look just at one of the most popular frameworks, Hadoop, we can really understand how Big Data is an entire framework of things. In fact if you look at the definition of what Hadoop is and how it's described, we find that Hadoop is a framework for distributed processing of large data sets across clusters of computers using simple programming models. So none of this here talks about it as a singular thing, unless you consider a framework a singular thing. But by the nature, a framework is a combination of things. If we just take a look at the Apache Hadoop Ecosystem here, what we'll find is that there are a number of systems that all work together to achieve this goal, this distributed computing platform goal. If you want to implement Hadoop, you'll need to understand at least the basic level of most of these platforms. For…

Contents