Package org.apache.tika.parser.pdf
package org.apache.tika.parser.pdf
-
ClassDescriptionConfiguration for OCR processing in PDF parsing.Configuration for AUTO strategy behavior.This counts the number of pages that OCR would have been run or was run depending on the settings.This was added in Tika 1.24 as an alpha version of a text extractor that builds the text from the marked text tree and includes/normalizes some of the structural tags.PDF parser.Config for PDFParser.Mode for checking document access permissions.