Modifier and Type | Method and Description |
---|---|
void |
HtmlParser.setExtractScripts(boolean extractScripts)
Whether or not to extract contents in script entities.
|
void |
HtmlEncodingDetector.setMarkLimit(int markLimit)
How far into the stream to read for charset detection.
|
Modifier and Type | Method and Description |
---|---|
void |
AbstractOfficeParser.setConcatenatePhoneticRuns(boolean concatenatePhoneticRuns) |
void |
AbstractOfficeParser.setExtractAllAlternativesFromMSG(boolean extractAllAlternativesFromMSG)
Some .msg files can contain body content in html, rtf and/or text.
|
void |
AbstractOfficeParser.setExtractMacros(boolean extractMacros) |
void |
AbstractOfficeParser.setIncludeDeletedContent(boolean includeDeletedConent) |
void |
AbstractOfficeParser.setIncludeMoveFromContent(boolean includeMoveFromContent) |
void |
AbstractOfficeParser.setIncludeShapeBasedContent(boolean includeShapeBasedContent) |
void |
AbstractOfficeParser.setUseSAXDocxExtractor(boolean useSAXDocxExtractor) |
void |
AbstractOfficeParser.setUseSAXPptxExtractor(boolean useSAXPptxExtractor) |
Modifier and Type | Method and Description |
---|---|
void |
PDFParser.setOcrImageType(String imageType) |
void |
PDFParser.setOcrStrategy(String ocrStrategyString) |
void |
PDFParser.setSortByPosition(boolean v)
Deprecated.
|
Modifier and Type | Method and Description |
---|---|
void |
CompressorParser.setMemoryLimitInKb(int memoryLimitInKb) |
Modifier and Type | Method and Description |
---|---|
void |
ObjectRecognitionParser.setRecogniser(String recogniserClass) |
Modifier and Type | Field and Description |
---|---|
protected URI |
TensorflowRESTRecogniser.apiBaseUri |
protected double |
TensorflowRESTRecogniser.minConfidence |
protected int |
TensorflowRESTRecogniser.topN |
Modifier and Type | Method and Description |
---|---|
void |
RTFParser.setMemoryLimitInKb(int memoryLimitInKb) |
Modifier and Type | Method and Description |
---|---|
void |
UniversalEncodingDetector.setMarkLimit(int markLimit)
How far into the stream to read for charset detection.
|
void |
Icu4jEncodingDetector.setMarkLimit(int markLimit)
How far into the stream to read for charset detection.
|
void |
Icu4jEncodingDetector.setStripMarkup(boolean stripMarkup)
Whether or not to attempt to strip html-ish markup
from the stream before sending it to the underlying
detector.
|
Copyright © 2007–2017 The Apache Software Foundation. All rights reserved.