Learn about key feature and performance requirements for databases in data science.
- [Instructor] What kind of features and capabilities…do we expect databases to support in order…for them to be useful inside a data science environment?…Well, first databases need to support data integrity.…Data stored in the database should be persistent…and delivered without loss of integrity.…This is one of the primary…requirements for any database system.…Second, consider the database capabilities.…
Databases can support a variety of operation like insert,…updates, deletes, queries, joints, aggregations, etc.…What will you need for your project?…Third, databases need to provide a way to model…relationships between entities.…What about supported data types?…Storing date in specific data types provides…builtin integrity and operation support.…Data types like blobs are needed these days…to store large quantities of text and media.…
Support for transactions are important,…especially for OLDB use cases.…Next data science applications invariably need…large quantities of data to build reliable models.…How does one do scalability to tera and…
Released
6/19/2018Kumaran Ponnambalam begins by discussing the roles of databases in data science, as well as the key feature and performance requirements for databases in this field. Next, Kumaran goes over different database types, sharing the strengths and weaknesses of each one. To wrap up, he walks through specific use cases and shows how to select the best database technology for each situation.
Share this video
Embed this video
Video: Key requirements for databases in data science