Class TesseractOCRParser

  • All Implemented Interfaces:
    Serializable, Initializable, Parser

    public class TesseractOCRParser
    extends AbstractParser
    implements Initializable
    TesseractOCRParser powered by tesseract-ocr engine. To enable this parser, create a TesseractOCRConfig object and pass it through a ParseContext. Tesseract-ocr must be installed and on system path or the path to its root folder must be provided:

    TesseractOCRConfig config = new TesseractOCRConfig();
    //Needed if tesseract is not on system path
    parseContext.set(TesseractOCRConfig.class, config);

    See Also:
    Serialized Form