Explore different snapshots of data at different stages of the process.
- [Narrator] So let's follow the data…through the whole process.…When the modeler begins, their almost always starting…by looking at lots of transactions.…So for instance, we have a bunch of items…that people have bought,…and we can see that several rows…are associated with one customer.…What they're looking for at this stage,…is things as simple as,…how long is the maximum length of time…a customers been around?…Did they buy just a few things,…or did they buy a lot of things?…Because, what we want to zero in on…is the individual customer level.…
So here we're looking at just one customer.…Mr. Streeter, who lives on Rockwell Lane,…and what we have to now do, is pull in…all the transactions that belong to just him.…So our data set is about to get shorter,…but much wider,…let's see why.…It's not just all those transactions…that we're interested in, but all kinds of other things…that we can get at that individual level.…So we might have had hundred of thousands of transactions,…but associated with ten's of thousands of customers.…
Author
Released
12/11/2018Note: This course is software agnostic. The emphasis is on strategy and planning. Examples, calculations, and software results shown are for training purposes only.
- Evaluating the proper amount of data
- Assessing data quality and quantity
- Seasonality and time alignment
- Data preparation challenges
- Data modeling challenges
- Scoring machine-learning models
- Deploying models and adjusting data prep and scoring
- Monitoring and maintenance
Skill Level Beginner
Duration
Views
Related Courses
-
Machine Learning and AI Foundations: Recommendations
with Adam Geitgey58m 7s Intermediate -
Deploying Scalable Machine Learning for Data Science
with Dan Sullivan1h 43m Intermediate
-
Introduction
-
Defining terms1m 48s
-
1. The Phases of a Machine Learning Project
-
2. Designing a Machine Learning Dataset
-
How much data do I need?2m 12s
-
Balancing1m 56s
-
Who truly has big data?3m 43s
-
Assessing data3m 32s
-
-
3. Data Prep Challenges
-
Data and the data scientist2m 55s
-
Dummy coding2m 1s
-
Feature engineering2m 51s
-
-
4. Modeling Challenges
-
Slow algorithms: Brute force1m 59s
-
Slow algorithms: More models2m 24s
-
How to sample properly2m 36s
-
Modeling with missing data3m 37s
-
5. Scoring
-
Scoring a black box model2m 50s
-
Scoring an ensemble1m 49s
-
6. Deployment
-
Batch vs. real-time scoring4m 39s
-
Data prep and scoring2m 59s
-
-
7. Monitoring and Maintenance
-
Conclusion
-
Next steps1m 1s
-
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.
CancelTake notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.
Share this video
Embed this video
Video: The stages of predictive analytics data