In this video, Mark Niemann-Ross demonstrates how to import data from a PDF file. Learn how to use the pdftools package to import data into R.
- [Instructor] You can import data from a PDF,…but what you're going to get will require…a lot of cleanup.…Let's take a look at how this actually works.…First, let's take a look at the PDF…that we're going to import data from.…This PDF has 11 pages,…and it has tables, it has lots of numbers, it has graphs.…So the data is kind of scattered about…throughout the document.…To bring that data into R, we're going to use PDF Tools.…
And PDF Tools, in turn, relies on something…called the Poplar C library.…And it has to be greater than .63,…you may encounter odd error messages,…declaring you need it to be greater than 65.…You may wind up reinstalling Poplar on your computer…in order for this to work.…With that in mind, let's dive in.…The first thing you'll want to do is install packages,…PDF Tools, and then use the library command…to bring PDF Tools into play.…
Once we've successfully done that,…we can use the PDF text command,…and I'm going to put the result into a vector…called text from PDF.…The command looks like PDF underbar text.…
- Name the three types of big data.
- List three considerations used to determine the appropriate R package for Excel.
- Determine the best package used to import entire Excel workbooks.
- Explain how to import standard text files using base R and tidyverse.
- Define the purpose of the foreign language package for R.
- Recognize restrictions when working on SAS files in the foreign language package.
- Identify the problems involved with extracting data from a PDF in R.
Skill Level Intermediate
R Programming in Data Science: High Volume Datawith Mark Niemann-Ross1h 25m Intermediate
1. Use R with Excel
2. Importing Text Files
3. Understanding the Foreign Package
4. Use R with Popular Data Formats
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.Cancel
Take notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.