Package org.apache.tika.parser.pdf
package org.apache.tika.parser.pdf
-
ClassDescriptionChecks whether or not a document allows extraction generally or extraction for accessibility only.This counts the number of pages that OCR would have been run or was run depending on the settings.This was added in Tika 1.24 as an alpha version of a text extractor that builds the text from the marked text tree and includes/normalizes some of the structural tags.PDF parser.Config for PDFParser.Encapsulate the numbers used to control OCR Strategy when set to auto