public static class PDFParserConfig.OCRStrategyAuto extends Object implements Serializable
If the total characters on the page < this.totalCharsPerPage or total unmapped unicode characters on the page > this.unmappedUnicodeCharsPerPage then we will perform OCR on the page
If unamppedUnicodeCharsPerPage is an integer > 0, then we compare absolute number of characters. If it is a float < 1, then we assume it is a percentage and we compare it to the percentage of unmappedCharactersPerPage/totalCharsPerPage
Constructor and Description |
---|
OCRStrategyAuto(float unmappedUnicodeCharsPerPage,
int totalCharsPerPage) |
Modifier and Type | Method and Description |
---|---|
int |
getTotalCharsPerPage() |
float |
getUnmappedUnicodeCharsPerPage() |
Copyright © 2007–2022 The Apache Software Foundation. All rights reserved.