From the course: Scala Essential Training for Data Science

The advantages of Scala for data science - Scala Tutorial

From the course: Scala Essential Training for Data Science

Start my 1-month free trial

The advantages of Scala for data science

- [Instructor] Data scientists can use virtually any language for analytics, but Python and R are the most popular. Scala is a valuable language for data science because it is a language designed for scalability. This is especially important when working with large datasets. Scala runs on the Java Virtual Machine and therefore it can run anywhere Java runs. It uses both functional and object-oriented programming paradigms. Functional programming is a style of computation that uses functions to compute values and reduce the amount of state information that has to be maintained. Scala also employs object-oriented techniques such as structuring programs around data and methods. Scala programs can work with relational databases using SQL from Scala. JDBC drivers that are used with Java can also be used with Scala programs for querying data and issuing database commands. Scala is designed to take advantage of multiple cores. Abstractions like parallel collections make it easy to parallelize computations over large datasets. Apache Spark is a widely used big data analytics platform that's written in Scala. Although Spark supports Java, Python and R programs, Scala is a popular language for Spark applications that want to take full advantage of fast execution times. Now this concludes our brief look at the advantages of Scala for data science.

Contents