From the course: Learning the R Tidyverse
Unlock the full course today
Join today to access over 22,600 courses taught by industry experts or purchase this course individually.
The benefits of long (or tidy) data - R Tutorial
From the course: Learning the R Tidyverse
The benefits of long (or tidy) data
- [Instructor] The Tidyverse ecosystem of packages is extremely explicit about the format of data that you need to provide it. That's because tidy or long formatted data yields a number of benefits to you as data scientists and to the developers of other packages that want to work well with the Tidyverse ecosystem. The primary benefit of Tidydata is communicated beautifully in this quote from Hadley Wickham. Tidy datasets are all alike but every messy dataset is messy in its own way. Tidy datasets mean packages can get on with their job and not have to deal with your mess. Let's look of a definition of a tidy or long dataset so you understand what we're talking about here. Tidy data is the following definitions. Individual columns pertain to individual variables. In our example here, we have four variables. Country, Year, Horses per 100 people, and Carriages per horses. Each observation is a unique row in the dataset. And new columns are added for every new variable. Importantly, any…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.