Easy-to-follow video tutorials help you learn software, creative, and business skills.Become a member
Now you can have Acrobat automatically run OCR on scans as you scan them in or on scans as you convert them to a PDF, but you can also start with just a regular picture PDF and then run the OCR directly from within Acrobat while you're looking at it. So let's see how that's done. If we go to Create, we are going to create a PDF from a file again, as we did in the previous video. And I'll use the magazinescan.tif again, but this time in Settings, I'm going to turn off OCR and Scan Optimization.
And I'll click OK and then click Open. So it has converted the document to a PDF-- it's magazinescan.pdf--but it's a picture. Let me zoom out a bit. If I have the text selection tool, I can't select any text; it just recognizes it as one big picture. So to convert this to a recognizable PDF that has text that I can select, you would run the command from the Tools menu under Recognize text.
So we want to run it from in this file, and I am going to click In This File, and it says, "Which page is the current page," and again, here are the Settings that we've looked at before, so we can choose Edit. What is the primary OCR language? What language should it use? What language is this document in? It's in English (US), so we will leave it there. What is the PDF output style? And I had two examples to show you, but we have Searchable Image. Searchable Image (Exact), and Clear Scan. You will see what that difference is in a bit. I'm going to leave it at Searchable Image, which is the one you would normally want to use.
And then any kind of down sampling that you want to do, you can choose that option as well. I am going to leave it high-res at 600. Click OK and then click OK here. So it deskews the image, it rotates it, it does its image processing, it does its OCR, and then we ended up with actual text that you can swipe over and select, and you can search on it. So that's how you do it from within Acrobat. You don't have to do it at the same time that you're converting, or while you're scanning.
You can take any file that is currently in image and convert it into searchable text. Now let's look at those different kinds of searchable text. I am going to close this for a second. And we don't need to save any changes. This time I am going to open a couple of examples that I saved, and these are in your samples file, if you want to take a look. I will leave them at this size, but I am going to show them to you side by side by going to the Window menu and tiling them vertically. And we can close Tools there.
So on the left, we made this into a PDF using the Searchable Image setting, and this one is Clear Scan. And if we zoom in--I am pressing Command+Plus or Ctrl+Plus a few times-- I think that you can see that the text in the Clear Scan one is cleaner than the text in the Searchable Image one. Can you see that? They both are actual text, but this one seems cleaner. So why would you ever want to choose Searchable Image, rather than Clear Scan as the kind of OCR output? Well, because Searchable Image is actually two things in one. It's the image; it's the actual scan that is sitting on top of the type.
So when you do this, you are selecting type that is behind these letters, and so this Searchable Image is the closest to the original that you can get, but still be searchable as far as text is concerned; whereas this one doesn't have any image data in it at all for the type. It's been replaced by actual characters, so it might not be exactly as true as what was in the scan. It's just clearer to read. For example, if there was a typeface that Adobe didn't recognize that was used here, it could not show you that typeface if it doesn't have that typeface loaded, so it's not going to be that close to the original if you use Clear Scan.
The third type of PDF output that you can make is called Searchable Image Exact, which gets even closer to the original. It would keep it tilted. It wouldn't deskew it. It would look exactly like the original scan; however, it would still be text behind there. So those are your choices when you do OCR. Do you want Clear Scan? Do you want a Searchable Image? I would say in my experience, most people use Searchable Image, because they want to keep the look of the scan, but they also wanted to do double duty as an actual PDF that's searchable and index-able.
Get unlimited access to all courses for just $25/month.Become a member
82 Video lessons · 87373 Viewers
80 Video lessons · 136388 Viewers
59 Video lessons · 55031 Viewers
52 Video lessons · 68901 Viewers
Access exercise files from a button right under the course name.
Search within course videos and transcripts, and jump right to the results.
Remove icons showing you already watched videos if you want to start over.
Make the video wide, narrow, full-screen, or pop the player out of the page into its own window.
Click on text in the transcript to jump to that spot in the video. As the video plays, the relevant spot in the transcript will be highlighted.