In data science, as in life, it is important to focus on what matters most. In this video, learn how feature selection, or the choice of meaningful variables, is one way to do this in data science models.
- [Instructor] I teach statistics to undergraduate students … who don't always see how it connects to their lives. … I can give specific examples about each of the fields, … but I found that even the most recalcitrant student … can get excited about data … when we talk about sports like baseball. … Baseball's a data-friendly sport. … It's been going on for over 100 years, … there are 162 games in the regular season, … and they count everything. … If you're trying to figure out how good, for example, … a particular batter is, … you can start with these basic bits of data … and you'll have an enormous amount of information … to work with. … These are the features in the dataset that you start with. … But if you're a coach or a manager, … you can do a lot more … than just use those raw data points to make a strategy. … You can start combining them … to create new features in your dataset … and finding value and possibilities in your team. … Now you can start with really simple ones. … This is the batting average, …
Author
Released
8/8/2019- Assess the skills required for a career in data science.
- Evaluate different sources of data, including metrics and APIs.
- Explore data through graphs and statistics.
- Discover how data scientists use programming languages such as R, Python, and SQL.
- Assess the role of mathematics, such as algebra, in data science.
- Assess the role of applied statistics, such as confidence intervals, in data science.
- Assess the role of machine learning, such as artificial neural networks, in data science.
- Define the components of effective data visualization.
Skill Level Beginner
Duration
Views
Related Courses
-
Introduction
-
1. What Is Data Science?
-
The data science pathway4m 51s
-
2. The Place of Data Science in the Data Universe
-
Artificial intelligence8m 22s
-
Machine learning8m 6s
-
Big data5m 36s
-
Predictive analytics4m 57s
-
Prescriptive analytics7m 42s
-
Business intelligence4m 40s
-
-
3. Ethics and Agency
-
4. Sources of Data
-
Data preparation5m 26s
-
In-house data2m 6s
-
Open data4m 49s
-
APIs2m 40s
-
Scraping data4m 44s
-
Creating data5m 37s
-
Self-generated data3m 30s
-
-
5. Sources of Rules
-
6. Tools for Data Science
-
Languages for data science3m 55s
-
7. Mathematics for Data Science
-
Algebra7m 25s
-
Calculus5m 3s
-
Bayes' theorem4m 25s
-
-
8. Analyses for Data Science
-
Descriptive analyses6m 38s
-
Predictive models7m 32s
-
Trend analysis6m 22s
-
Clustering5m 45s
-
Classifying5m 34s
-
Dimensionality reduction5m 42s
-
Validating models4m 55s
-
Aggregating models4m 8s
-
-
9. Acting on Data Science
-
Interpretability3m 17s
-
Actionable insights2m 53s
-
-
Conclusion
-
Next steps2m 47s
-
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.
CancelTake notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.
Share this video
Embed this video
Video: Feature selection and creation