Uses of Class
org.apache.tika.config.Field
-
-
Uses of Field in org.apache.tika.parser.html
Methods in org.apache.tika.parser.html with annotations of type Field Modifier and Type Method Description void
HtmlParser. setExtractScripts(boolean extractScripts)
Whether or not to extract contents in script entities.void
HtmlEncodingDetector. setMarkLimit(int markLimit)
How far into the stream to read for charset detection. -
Uses of Field in org.apache.tika.parser.html.charsetdetector
Methods in org.apache.tika.parser.html.charsetdetector with annotations of type Field Modifier and Type Method Description void
StandardHtmlEncodingDetector. setMarkLimit(int markLimit)
How far into the stream to read for charset detection. -
Uses of Field in org.apache.tika.parser.microsoft
Methods in org.apache.tika.parser.microsoft with annotations of type Field Modifier and Type Method Description void
AbstractOfficeParser. setByteArrayMaxOverride(int maxOverride)
WARNING: this sets a static variable in POI.void
AbstractOfficeParser. setConcatenatePhoneticRuns(boolean concatenatePhoneticRuns)
void
AbstractOfficeParser. setExtractAllAlternativesFromMSG(boolean extractAllAlternativesFromMSG)
Some .msg files can contain body content in html, rtf and/or text.void
AbstractOfficeParser. setExtractMacros(boolean extractMacros)
void
AbstractOfficeParser. setIncludeDeletedContent(boolean includeDeletedConent)
void
AbstractOfficeParser. setIncludeMoveFromContent(boolean includeMoveFromContent)
void
AbstractOfficeParser. setIncludeShapeBasedContent(boolean includeShapeBasedContent)
void
AbstractOfficeParser. setUseSAXDocxExtractor(boolean useSAXDocxExtractor)
void
AbstractOfficeParser. setUseSAXPptxExtractor(boolean useSAXPptxExtractor)
-
Uses of Field in org.apache.tika.parser.ocr
Methods in org.apache.tika.parser.ocr with annotations of type Field Modifier and Type Method Description void
TesseractOCRParser. setApplyRotation(boolean applyRotation)
void
TesseractOCRParser. setColorspace(String colorspace)
void
TesseractOCRParser. setDensity(int density)
void
TesseractOCRParser. setDepth(int depth)
void
TesseractOCRParser. setEnableImageProcessing(int enableImageProcessing)
void
TesseractOCRParser. setFilter(String filter)
void
TesseractOCRParser. setImageMagickPath(String imageMagickPath)
void
TesseractOCRParser. setLanguage(String language)
void
TesseractOCRParser. setMaxFileSizeToOcr(long maxFileSizeToOcr)
void
TesseractOCRParser. setMinFileSizeToOcr(long minFileSizeToOcr)
void
TesseractOCRParser. setOutputType(String outputType)
void
TesseractOCRParser. setPageSegMode(String pageSegMode)
void
TesseractOCRParser. setPreserveInterwordSpacing(boolean preserveInterwordSpacing)
void
TesseractOCRParser. setResize(int resize)
void
TesseractOCRParser. setTessdataPath(String tessdataPath)
void
TesseractOCRParser. setTesseractPath(String tesseractPath)
void
TesseractOCRParser. setTimeout(int timeout)
-
Uses of Field in org.apache.tika.parser.pdf
Methods in org.apache.tika.parser.pdf with annotations of type Field Modifier and Type Method Description void
PDFParser. setMaxMainMemoryBytes(long maxMainMemoryBytes)
void
PDFParser. setOcrImageType(String imageType)
void
PDFParser. setOcrStrategy(String ocrStrategyString)
void
PDFParser. setSortByPosition(boolean v)
Deprecated. -
Uses of Field in org.apache.tika.parser.pkg
Methods in org.apache.tika.parser.pkg with annotations of type Field Modifier and Type Method Description void
CompressorParser. setMemoryLimitInKb(int memoryLimitInKb)
-
Uses of Field in org.apache.tika.parser.recognition
Methods in org.apache.tika.parser.recognition with annotations of type Field Modifier and Type Method Description void
ObjectRecognitionParser. setRecogniser(String recogniserClass)
-
Uses of Field in org.apache.tika.parser.recognition.tf
Fields in org.apache.tika.parser.recognition.tf with annotations of type Field Modifier and Type Field Description protected URI
TensorflowRESTRecogniser. apiBaseUri
protected double
TensorflowRESTRecogniser. minConfidence
protected int
TensorflowRESTRecogniser. topN
-
Uses of Field in org.apache.tika.parser.rtf
Methods in org.apache.tika.parser.rtf with annotations of type Field Modifier and Type Method Description void
RTFParser. setMemoryLimitInKb(int memoryLimitInKb)
-
Uses of Field in org.apache.tika.parser.txt
Methods in org.apache.tika.parser.txt with annotations of type Field Modifier and Type Method Description void
Icu4jEncodingDetector. setMarkLimit(int markLimit)
How far into the stream to read for charset detection.void
UniversalEncodingDetector. setMarkLimit(int markLimit)
How far into the stream to read for charset detection.void
Icu4jEncodingDetector. setStripMarkup(boolean stripMarkup)
Whether or not to attempt to strip html-ish markup from the stream before sending it to the underlying detector. -
Uses of Field in org.apache.tika.parser.wordperfect
Methods in org.apache.tika.parser.wordperfect with annotations of type Field Modifier and Type Method Description void
WordPerfectParser. setIncludeDeletedContent(boolean includeDeletedContent)
Whether or not to include deleted content.
-