- [Voiceover] The last thing we wanna discuss…before we wrap up our course…on an introduction to data science…is reproducible research.…I like to think of this…as leaving a digital trail of your work.…There's a few reasons…you would wanna be able to do this.…First is, it allows you to check your work…and verify your conclusions.…Second, for your client,…for future researchers, and yourself,…and anyone who's gonna come back to the project,…it allows them to see how it happened.…Third, it may be required by certain policies.…
And fourth, by documenting your process,…it ensures a form of intellectual honesty.…Now there's a few things…that you actually need to include…in reproducible research.…Number one is your sources.…You want to include the raw data…before it's processed at all.…You want to include a list of the goals…and the rationale for the project,…and the resources,…and that can include the software, the hardware,…even the people who worked on it,…the funding sources, whatever is in the background.…Next, you want to talk about process.…
Author
Released
7/27/2016- Assess the skills required for a career in data science.
- Evaluate different sources of data, including metrics and APIs.
- Explore data through graphs and statistics.
- Discover how data scientists use programming languages such as R, Python, and SQL.
- Assess the role of mathematics, such as algebra, in data science.
- Assess the role of applied statistics, such as confidence intervals, in data science.
- Assess the role of machine learning, such as artificial neural networks, in data science.
- Define the components of effective data visualization.
Skill Level Beginner
Duration
Views
Related Courses
-
Code Clinic: R (2015)
with Mark Niemann-Ross3h 24m Intermediate
-
Introduction
-
Welcome58s
-
Exercise files34s
-
-
1. What Is Data Science?
-
Demand3m 54s
-
Venn diagram4m 2s
-
Pipeline4m 43s
-
Roles3m 14s
-
Team2m 14s
-
-
2. Fields of Study
-
Big data3m 20s
-
Programming2m 26s
-
Statistics1m 57s
-
-
3. Ethics
-
Ethical issues2m 39s
-
-
4. Data Sources
-
Metrics3m 43s
-
Existing data4m 36s
-
APIs4m 38s
-
Scraping2m 16s
-
Creating data3m 3s
-
-
5. Data Exploration
-
Exploratory graphs4m 32s
-
Exploratory statistics4m 26s
-
-
6. Programming
-
Spreadsheets3m 49s
-
R5m 18s
-
Python4m 51s
-
SQL3m 44s
-
Web formats3m 53s
-
-
7. Mathematics
-
Algebra6m 22s
-
Systems of equations5m 11s
-
Calculus9m 50s
-
Big O5m 8s
-
Bayes probability8m 15s
-
-
8. Applied Statistics
-
Hypothesis6m 23s
-
Confidence5m 42s
-
Problems5m 30s
-
Validating3m 35s
-
-
9. Machine Learning
-
Decision trees5m 22s
-
Ensembles5m 15s
-
k-nearest neighbors (kNN)5m 26s
-
Naive Bayes classifiers5m 16s
-
Artificial neural networks5m 43s
-
-
10. Communicating
-
Interpretability5m 50s
-
Actionable insights4m 40s
-
Reproducible research3m 27s
-
-
Conclusion
-
Next steps2m 17s
-
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.
CancelTake notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.
Share this video
Embed this video
Video: Reproducible research