From the course: SQL for Exploratory Data Analysis Essential Training

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Imputing missing values

Imputing missing values - PostgreSQL Tutorial

From the course: SQL for Exploratory Data Analysis Essential Training

Start my 1-month free trial

Imputing missing values

- Now we are going to look at inputting missing values. This is the process of filling in missing values with a reasonable value. So now that we've identified missing values in our dataset, we have to decide what to do about that missing data. There are several options. First, we could just continue without replacing the missing values. We could define a default value and use that for all of the missing values in a column. We could compute a value based on neighboring rows. Or, we could use more advanced techniques, like regression. We could continue our analysis and not treat records with missing values any different than any other record. Now there are some advantages to this. For example, it makes no extra work for us. But, there are disadvantages. For example, rows with missing values will not contribute to anything to the total so that could throw averages off. Also, it may not be possible to perform some calculations, such as, using division. So for example, if we wanted to…

Contents