OCR Pages

<< Click to Display Table of Contents >>

Navigation:  Tabs Guide > Document >

OCR Pages


 

Editor-Icon OCR Pages

 


 

The optical character recognition (OCR) in PDF-XChange Editor analyzes image-based documents, recognizes text and then makes it selectable and searchable. Click OCR Pages to initiate this operation:

 

11.OCR.pages.location

Figure 1. Document Tab Submenu. OCR Pages

 

The OCR Pages dialog box will open:

 

OCR.Pages.Options

Figure 2. OCR Pages Dialog Box

 

The Pages Range options are as follows:

Select All to OCR all the pages of the document.

Select Current Page to OCR only the current page.

Use the Pages box to determine specific pages of the document on which to perform the OCR process. See here for further information.

Use the Subset option to select All Pages, Odd Pages Only or Even Pages Only.

The Recognition options determine the language and accuracy of the OCR process. Increasing the accuracy increases the time that the process takes and vice versa. Additionally, it should be noted that setting the accuracy to high may result in unusual output if the document on which the operation is carried out features imperfections. This is because the software will search to a greater depth and may attempt to recognise imperfections as text.

The Output options determine the format of the output information from the OCR process:

Select either Create New Searchable PDF or Preserve Original Content and Add Text Layer.

The Quality setting determines the resolution of the new PDF document in dpi (dots per inch).

Select the Auto Deskew option to deskew documents automatically. (Deskewing is a useful feature that straightens images that have been photographed or scanned crookedly).

Use the Scanner Presets page to determine preferences for subsequent use.

Click OK to OCR documents.