OCR_SymbolBox

<< Click to Display Table of Contents >>

Navigation:  OCR Module > OCR Library Types >

OCR_SymbolBox


 

PRO SDK Icon OCR_SymbolBox

 


 

OCR_SymbolBox is a structure that contains a single character and, when available, descriptive information from the OCR process.

 

Use OCRp_GetSymbolFromRegion to access OCR_SymbolBox members.

Asterisks indicate that members are experimental and therefore should be taken with due consideration.

 

Please note that members are case-sensitive:

 

 

MEMBER

 

 

TYPE

 

 

MEANING

 

 

rcBound

 

 

RECT

 

 

The rectangular bounding region of the symbol in rasterized page image coordinates.

 

 

LineBaseline

 

 

OCR_Baseline

 

 

The baseline for the line that contains the symbol. This member is useful for the precise alignment of symbols along a text line.

 

 

rcLineBound

 

 

RECT

 

 

The rectangular bounding region for the symbol’s enclosing line. It is equivalent to the symbol's bounding rectangle in cases of single symbols recognized outside the context of a surrounding line.

 

 

wcSymbol

 

 

wchar_t

 

 

The unicode symbol that OCR recognizes.

 

 

nConfidence*

 

 

float

 

 

This member is an estimation of confidence. Higher values signify higher confidence.

 

 

isbold*

 

 

bool

 

 

A boolean value that defines whether or not text is bold.

 

 

isitalic*

 

 

bool

 

 

A boolean value that defines whether or not text is in italics.

 

 

isunderlined*

 

 

bool

 

 

A boolean value that defines whether or not text is underlined.

 

 

isserif*

 

 

bool

 

 

A boolean value that defines whether or not the text is serif.

 

 

ismonospace*

 

 

bool

 

 

A boolean value that defines whether or not font is monospace.

 

 

issmallcaps*

 

 

bool

 

 

A boolean value that defines whether or not font is a small caps font.

 

 

pointsize*

 

 

SHORT

 

 

This member estimates the font size (in points). Bounding boxes can also be used for this purpose.

 

 

fontid*

 

 

SHORT

 

 

This member is currently unused and reserved for future use.