From the course: Spark for Machine Learning & AI

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Summary of clustering techniques

Summary of clustering techniques - Apache Spark Tutorial

From the course: Spark for Machine Learning & AI

Start my 1-month free trial

Summary of clustering techniques

- [Instructor] Clustering algorithms are useful for exploring data. K-means is especially useful and commonly used for exploring small and modest sized data sets. For large data sets, hierarchical clustering algorithms, like bisecting K-means is a better option. One thing to keep in mind is that different algorithms may find different cluster centers, but in general, they should be fairly close.

Contents