Advance Your Skills in the Hadoop/NoSQL Data Science Stack
The ecosystem of data science tools based on the Hadoop and NoSQL stack is a crucial and growing part of the future of the industry. Data engineers need a broad knowledge of the components of this platform, and this learning path enables learners to dig in and skill up.
Extend your general Hadoop knowledge.
Branch into related tools such as Kafka, HBase, Hive, and Cassandra.
Learn how to use Hive to analyze large datasets and derive information from Hadoop. Learn how to work with tables, structures, aggregations, clauses, functions, and more.
1h 53m • COURSE
Advanced NoSQL for Data Science with Dan Sullivan
Explore the fundamentals of NoSQL. Learn the differences between NoSQL and traditional relational databases, discover how to perform common data science tasks with NoSQL, and more.
1h 56m • COURSE
Extending Hadoop for Data Science: Streaming, Spark, Storm, and Kafka with Lynn Langit
Extend your Hadoop data science knowledge to other Apache data science tools and attendant technologies including Apache Spark, Storm, Kafka, and more.
2h 53m • COURSE
Kafka Essential Training with Ben Sullins
Get up to speed with Apache Kafka, a distributed streaming platform that provides scalable, high-throughput messaging systems in place of traditional messaging systems like JMS.
1h 20m • COURSE
Hadoop for Data Science Tips, Tricks, & Techniques with Ben Sullins
Get up to speed with Hadoop. Learn tips and tricks for doing data science work in this popular big data platform.
1h 12m • COURSE
HBase Essential Training with Ben Sullins
Learn the basics of HBase, the Hadoop database for big data analytics. Get an understanding of the HBase architecture and basic read/write commands.
1h 20m • COURSE
NoSQL Data Modeling Essential Training with Robert Van Cleave
Get started with data modeling for NoSQL databases and learn how to work with common design patterns.
1h 20m • COURSE
Cassandra Data Modeling Essential Training with Dan Sullivan
Learn about the architecture of Cassandra—a popular NoSQL database capable of handling large amounts of fast-changing data—and discover how to design Cassandra data models.
1h 38m • COURSE
You'll learn Hadoop and NoSQL skills with these experts.
As a lifelong data geek, Ben Sullins dedicates his time to helping others use data wisely.
Ben makes information meaningful and has fun doing it. His background affords him a unique set of knowledge that sets him apart in the data community. Since the late 1990s, he has consulted many high-tech companies, including Facebook, Microsoft, LinkedIn, Cisco, Mozilla, Pluralsight, and Genentech, on democratizing data in their organizations. Moreover, Ben spent three months leading the charge at Facebook to grow its data culture by demonstrating proper tool implementation and data visualization techniques using Tableau. And with this expertise, Ben aims to provide exceptional service to his customers by enriching their lives with impactful smart data.
Dan Sullivan, PhD, is an enterprise architect and big data expert.
Dan specializes in data architecture, analytics, data mining, statistics, data modeling, big data, and cloud computing. In addition, he holds a PhD in genetics, bioinformatics, and computational biology. Dan works regularly with Spark, Oracle, NoSQL, MongoDB, Redis, R, and Python. He has extensive writing experience in topics including cloud computing, big data, Hadoop, and security.
Lynn Langit is a cloud architect who works with Amazon Web Services and Google Cloud Platform.
Lynn specializes in big data projects. She has worked with AWS Athena, Aurora, Redshift, Kinesis, and the IoT. She has also done production work with Databricks for Apache Spark and Google Cloud Dataproc, Bigtable, BigQuery, and Cloud Spanner.
Lynn is also the cofounder of Teaching Kids Programming. She has spoken on data and cloud technologies in North and South America, Europe, Africa, Asia, and Australia.
Robert Van Cleave is a deeply experienced data architect and consultant.
During his time at Cognizant, a leading professional services company, Robert served as director and chief architect and was recognized for his excellent client-facing consulting skills. He led both onshore and offshore teams, providing Fortune 500 clients with customized digital transformation solutions to boost revenue, efficiency, and cost savings. Earlier in his career, Robert worked as a senior consultant at IBM Global Business Services and as an enterprise data architect at USAA. He currently works as a consultant. Robert's professional skills include enterprise architecture, data architecture, SOA, SDLC, requirements analysis, and IT strategy.
Learning Paths are big commitments. Keep your goal in focus by taking one at a time. Starting Advance Your Skills in the Hadoop/NoSQL Data Science Stack will pause your previous path and save your progress.