Join Mike Chapple for an in-depth discussion in this video Inconsistent spellings, part of Cleaning Bad Data in R.
- [Narrator] Spelling errors are another source of chaos…when it comes to data analysis,…particularly when you're trying to group results.…There are many possible sources of spelling errors,…but one of the most common causes of those errors…is when people are hand entering data into a system…and they don't know how to spell a word,…they type it in incorrectly,…or they use inconsistent punctuation.…One of the best ways that you can reduce spelling errors…is to limit the use of freeform text input fields…in your applications.…
Instead of asking a user to type in an entry,…you can present them with a list of options…and ask them to select the appropriate value.…Of course, you can only do this…when the range of choices is limited.…Otherwise you'll have to undertake the painstaking process…of identifying and correcting spelling errors…in your data before you begin analysis.…Let's take a look at an example…of how we can do this is in R.…We'll be working with a data file…that contains the results…of over 145,000 restaurant inspections…
Where possible, instructor Mike Chapple shows how to correct the issues using R, but the same principles can be applied to any statistical programing language.
- Missing data
- Duplicate rows and values
- Converting data
- Formatting data
- Working with tidy data
- Tidying data sets
- Dealing with suspicious data
Skill Level Beginner
1. Missing Data
2. Duplicated Data
3. Formatting Data
5. Tidy Data
6. Red Flags
What's next?1m 5s
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.Cancel
Take notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.