PXODocument

<< Click to Display Table of Contents >>

Navigation:  OCR Module > OCR Library Types >

PXODocument


 

PRO SDK Icon PXODocument

 


 

PXODocument is the main document structure that the OCR Module utilizes. PXODocument receives input documents, enables other functions in the OCR Module to manipulate them and contains the output when the process is complete. The following functions in the OCR Module utilize the PXODocument library type. Please note that functions are case-sensitive:

 

OCR_Init sets up a new PXODocument in order to load input files and perform OCR.

OCR_LoadA and OCR_LoadW load input files into the PXODocument object’s input layer.

OCR_GetText processes a PXODocument and then formats and returns the plain text.

OCR_MakeSearchable processes a PXODocument and then generates a new output layer that contains searchable PDF results.

OCR_GetField performs OCR on a PXODocument and then formats and returns the plain text.

OCR_GetFields performs multiple OCRs on a PXODocument and then formats and returns the plain text.

OCR_SetCallBack sets the callback function for the PDF rasterization process of PXODocument structures.

OCRp_Page performs OCR on a specified page of a PXODocument, then returns the results in a structure that can be queried for text layout details.

OCRp_Field performs OCR on a specified area of a PXODocument, then returns the results in a structure that can be queried for text layout details.

OCR_GetNumInputPages returns the number of pages in the input layer of the PXODocument.

OCR_Delete deletes the PXODocument, which is a necessary step once all functions are complete.

 

See Workflow Overview for further information.