From the course: Learning the R Tidyverse

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

The benefits of long (or tidy) data

The benefits of long (or tidy) data - R Tutorial

From the course: Learning the R Tidyverse

Start my 1-month free trial

The benefits of long (or tidy) data

- [Instructor] The Tidyverse ecosystem of packages is extremely explicit about the format of data that you need to provide it. That's because tidy or long formatted data yields a number of benefits to you as data scientists and to the developers of other packages that want to work well with the Tidyverse ecosystem. The primary benefit of Tidydata is communicated beautifully in this quote from Hadley Wickham. Tidy datasets are all alike but every messy dataset is messy in its own way. Tidy datasets mean packages can get on with their job and not have to deal with your mess. Let's look of a definition of a tidy or long dataset so you understand what we're talking about here. Tidy data is the following definitions. Individual columns pertain to individual variables. In our example here, we have four variables. Country, Year, Horses per 100 people, and Carriages per horses. Each observation is a unique row in the dataset. And new columns are added for every new variable. Importantly, any…

Contents