Class OCRClient

High-level async API for performing document image layout analysis and OCR.

In the browser, this class can be constructed directly. In Node, use the createOCRClient helper from node-worker.js.

Hierarchy

OCRClient

Index

Constructors

constructor

Methods

Constructors

constructor

new OCRClient(__namedParameters?: OCRClientInit): OCRClient

- Defined in ocr-client.ts:66
Initialize an OCR engine.

This will start a Worker in which the OCR operations will actually be performed.

Parameters
- __namedParameters: OCRClientInit = {}
Returns OCRClient

Methods

clearImage

clearImage(): Promise<void>

- Defined in ocr-client.ts:162
Clear the current image and text recognition results.

This will clear the loaded image data internally, but keep the text recognition model loaded.

At present there is no way to shrink WebAssembly memory, so this will not return the memory used by the image to the OS/browser. To release memory, the web worker needs to be shut down via destroy.

Returns Promise<void>

destroy

destroy(): Promise<void>

- Defined in ocr-client.ts:117
Returns Promise<void>

getBoundingBoxes

getBoundingBoxes(unit: TextUnit): Promise<BoxItem[]>

- Defined in ocr-client.ts:175
Perform layout analysis on the current image, if not already done, and return bounding boxes for a given unit of text.

This operation is relatively cheap compared to text recognition, so can provide much faster results if only the location of lines/words etc. on the page is required, not the text content.

Parameters
- unit: TextUnit
Returns Promise<BoxItem[]>

getHOCR

getHOCR(onProgress?: ProgressListener): Promise<string>

- Defined in ocr-client.ts:226
Perform layout analysis and text recognition on the current image, if not already done, and return the image's text in hOCR format (see https://en.wikipedia.org/wiki/HOCR).

Parameters
- Optional onProgress: ProgressListener
Returns Promise<string>

getOrientation

getOrientation(): Promise<Orientation>

- Defined in ocr-client.ts:249
Attempt to determine the orientation of the image.

This currently uses a simplistic algorithm [1] which is designed for non-uppercase Latin text. It will likely perform badly for other scripts or if the text is all uppercase.

[1] See http://www.leptonica.org/papers/skew-measurement.pdf

Returns Promise<Orientation>

getText

getText(onProgress?: ProgressListener): Promise<string>

- Defined in ocr-client.ts:207
Perform layout analysis and text recognition on the current image, if not already done, and return the image's text as a string.

Parameters
- Optional onProgress: ProgressListener
Returns Promise<string>

getTextBoxes

getTextBoxes(unit: TextUnit, onProgress?: ProgressListener): Promise<TextItem[]>

- Defined in ocr-client.ts:185
Perform layout analysis and text recognition on the current image, if not already done, and return bounding boxes and text content for a given unit of text.

Parameters
- unit: TextUnit
- Optional onProgress: ProgressListener
Returns Promise<TextItem[]>

loadImage

loadImage(image: ImageBitmap | ImageData): Promise<void>

- Defined in ocr-client.ts:138
Load an image into the OCR engine for processing.

Parameters
- image: ImageBitmap | ImageData
Returns Promise<void>

loadModel

loadModel(model: string | ArrayBuffer): Promise<void>

- Defined in ocr-client.ts:126
Load a trained model for a specific language. This can be specified either as a URL to fetch or a buffer containing an already-loaded model.

Parameters
- model: string | ArrayBuffer
Returns Promise<void>

Generated using TypeDoc