Question: How can I OCR PDF documents? Answer: It is possible to OCR documents in PDF Studio whether they are existing documents or whether you are scanning new documents. See the different steps below: OCR an Existing Document In PDF Studio, Open the existing document you want to OCR. Navigate to the Document Tab > […]
Category: OCR
OCR: Optical Character Recognition / Text recognition
Video: How to OCR a PDF in PDF Studio
This video shows how to OCR a PDF in PDF Studio. Video Transcript: Hi, today I’m going to show you how to enhance a scanned document and interact with the text in that document using the OCR feature in pdf studio. As you can see the document is a little tilted and you’re unable to […]
Fix Scan Tool
The fix scan tool (introduced in PDF Studio 2019) allows you to perform various repair functions on scanned documents. This is useful if you do not have access to the original document or scanner to rescan and fix alignment or size issues. The options include: OCR – Add searchable text to the pages Optimize – […]
How to manually install OCR languages
Q: How can I manually install the OCR languages in PDF Studio. A: First, it’s recommended that you download the OCR packages directly through PDF Studio as this will be the most up to date and prevent any possible issues. See OCR language download troubleshooting If the above still does not work you can try to […]
“Discard invisible text” option in PDF Studio
Q: I want to perform a fresh OCR of all pages. Some pages already have invisible text, how can I remove these text and OCR again? A: This option is available in PDF Studio 12 and above, it will removes any previous OCR text that has been added to the page. To use this option, follow the steps below: […]
OCR two different languages at once
Q: Can I OCR two different languages at once in PDF Studio? A: Starting in PDF Studio 11, you can OCR two different languages at once by following the instructions below: 1. Download the languages that you need to download. Go to Edit -> Preferences -> OCR Select Download OCR Languages Check the languages that […]
Could not initialize tesseract / OCRBridge when OCRing a document
Q: When I OCR a document, I get errors such as “Could not initialize tesseract” , ” OCR library is not loaded: null” , “unable to initiate OCRBridge”. What are these errors and how can I fix them? A: There are 4 possible reasons why you’ re seeing one of the errors above: If you […]
OCR for Non-Latin Languages & Multi-Language OCR
PDF Studio 11 comes with a new OCR engine with support for non-Latin and CJK languages. New Latin languages will also be added as well to the available list of languages. The complete list of new OCR languages can be found below. In addition to the new languages, PDF Studio 11 also has the ability […]
How to proofread and correct OCRed text in a PDF
Q: After running a PDF through OCR, I need to be able to inspect the result and, if necessary, correct the OCR results. Is it possible to show the text added by the OCR in PDF Studio? A: We don’t have a specific tool or view to allow users to inspect the OCR text yet but we […]
OCRing images on a PDF page that already contains text
Q: If I have a mixed content document, containing some text and some images, can PDF Studio OCR the images only? PDF Studio can handle mixed content pages, i.e, pages that contain both images and text content. PDF Studio will simply ignore any existing text content and perform OCR on the rest of the page, so […]