Modifier and Type | Method and Description |
---|---|
protected void |
FileResourceConsumer.parse(String resourceId,
Parser parser,
InputStream is,
ContentHandler handler,
Metadata m,
ParseContext parseContext)
Utility method to handle logging equivalently among all
implementing classes.
|
Modifier and Type | Method and Description |
---|---|
List<RecognisedObject> |
DL4JInceptionV3Net.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
List<RecognisedObject> |
DL4JVGG16Net.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
ExternalEmbedder.embed(Metadata metadata,
InputStream inputStream,
OutputStream outputStream,
ParseContext context)
Executes the configured external command and passes the given document
stream as a simple XHTML document to the given SAX content handler.
|
void |
Embedder.embed(Metadata metadata,
InputStream originalStream,
OutputStream outputStream,
ParseContext context)
Embeds related document metadata from the given metadata object into the
given output stream.
|
Set<MediaType> |
ExternalEmbedder.getSupportedEmbedTypes(ParseContext context) |
Set<MediaType> |
Embedder.getSupportedEmbedTypes(ParseContext context)
Returns the set of media types supported by this embedder when used with
the given parse context.
|
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
PrescriptionParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
Set<MediaType> |
EncryptedPrescriptionParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
DirListParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
PrescriptionParser.getSupportedTypes(ParseContext context) |
void |
EncryptedPrescriptionParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
LanguageDetectingParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
DirListParser.parse(InputStream is,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
static EmbeddedDocumentExtractor |
EmbeddedDocumentUtil.getEmbeddedDocumentExtractor(ParseContext context)
This offers a uniform way to get an EmbeddedDocumentExtractor from a ParseContext.
|
static Parser |
EmbeddedDocumentUtil.tryToFindExistingLeafParser(Class clazz,
ParseContext context)
Tries to find an existing parser within the ParseContext.
|
Constructor and Description |
---|
EmbeddedDocumentUtil(ParseContext context) |
ParsingEmbeddedDocumentExtractor(ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ForkParser.getSupportedTypes(ParseContext context) |
void |
ForkParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
DigestingParser.Digester.digest(InputStream is,
Metadata m,
ParseContext parseContext)
Digests an InputStream and sets the appropriate value(s) in the metadata.
|
Map<MediaType,List<Parser>> |
CompositeParser.findDuplicateParsers(ParseContext context)
Utility method that goes through all the component parsers and finds
all media types for which more than one parser declares support.
|
protected Parser |
DelegatingParser.getDelegateParser(ParseContext context)
Returns the parser instance to which parsing tasks should be delegated.
|
protected EncodingDetector |
AbstractEncodingDetectorParser.getEncodingDetector(ParseContext parseContext)
Look for an EncodingDetetor in the ParseContext.
|
protected Parser |
CompositeParser.getParser(Metadata metadata,
ParseContext context) |
Map<MediaType,Parser> |
DefaultParser.getParsers(ParseContext context) |
Map<MediaType,Parser> |
CompositeParser.getParsers(ParseContext context) |
Set<MediaType> |
RecursiveParserWrapper.getSupportedTypes(ParseContext context) |
Set<MediaType> |
ParserDecorator.getSupportedTypes(ParseContext context)
Delegates the method call to the decorated parser.
|
Set<MediaType> |
Parser.getSupportedTypes(ParseContext context)
Returns the set of media types supported by this parser when used
with the given parse context.
|
Set<MediaType> |
NetworkParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
ErrorParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
EmptyParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
DelegatingParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
CryptoParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
CompositeParser.getSupportedTypes(ParseContext context) |
void |
RecursiveParserWrapper.parse(InputStream stream,
ContentHandler ignore,
Metadata metadata,
ParseContext context)
Acts like a regular parser except it ignores the ContentHandler
and it automatically sets/overwrites the embedded Parser in the
ParseContext object.
|
void |
ParserPostProcessor.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Forwards the call to the delegated parser and post-processes the
results as described above.
|
void |
ParserDecorator.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Delegates the method call to the decorated parser.
|
void |
Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses a document stream into a sequence of XHTML SAX events.
|
void |
NetworkParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
ErrorParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
EmptyParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
DigestingParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
DelegatingParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Looks up the delegate parser from the parsing context and
delegates the parse operation to it.
|
void |
CryptoParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
CompositeParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Delegates the call to the matching component parser.
|
void |
AutoDetectParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Constructor and Description |
---|
ParsingReader(Parser parser,
InputStream stream,
Metadata metadata,
ParseContext context)
Creates a reader for the text content of the given binary stream
with the given document metadata.
|
ParsingReader(Parser parser,
InputStream stream,
Metadata metadata,
ParseContext context,
Executor executor)
Creates a reader for the text content of the given binary stream
with the given document metadata.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
AppleSingleFileParser.getSupportedTypes(ParseContext context) |
void |
AppleSingleFileParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ClassParser.getSupportedTypes(ParseContext context) |
void |
ClassParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
MidiParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
AudioParser.getSupportedTypes(ParseContext context) |
void |
MidiParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
AudioParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ChmParser.getSupportedTypes(ParseContext context) |
void |
ChmParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
SourceCodeParser.getSupportedTypes(ParseContext context) |
void |
SourceCodeParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
TSDParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
Pkcs7Parser.getSupportedTypes(ParseContext context) |
void |
TSDParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
Pkcs7Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
CTAKESParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
DBFParser.getSupportedTypes(ParseContext context) |
void |
DBFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
DIFParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
Set<MediaType> |
DIFParser.getSupportedTypes(ParseContext context) |
void |
DIFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
InputStreamDigester.digest(InputStream is,
Metadata metadata,
ParseContext parseContext) |
void |
CompositeDigester.digest(InputStream is,
Metadata m,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
DWGParser.getSupportedTypes(ParseContext context) |
void |
DWGParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
EnviHeaderParser.getSupportedTypes(ParseContext context) |
void |
EnviHeaderParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
EpubParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
EpubContentParser.getSupportedTypes(ParseContext context) |
void |
EpubParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
EpubContentParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ExecutableParser.getSupportedTypes(ParseContext context) |
void |
ExecutableParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ExternalParser.getSupportedTypes(ParseContext context) |
void |
ExternalParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Executes the configured external command and passes the given document
stream as a simple XHTML document to the given SAX content handler.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
FeedParser.getSupportedTypes(ParseContext context) |
void |
FeedParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
TrueTypeParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
AdobeFontMetricParser.getSupportedTypes(ParseContext context) |
void |
TrueTypeParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
AdobeFontMetricParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
GDALParser.getSupportedTypes(ParseContext context) |
void |
GDALParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
GeoParser.getSupportedTypes(ParseContext parseContext) |
void |
GeoParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
GeographicInformationParser.getSupportedTypes(ParseContext parseContext) |
void |
GeographicInformationParser.parse(InputStream inputStream,
ContentHandler contentHandler,
Metadata metadata,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
GribParser.getSupportedTypes(ParseContext context) |
void |
GribParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
HDFParser.getSupportedTypes(ParseContext context) |
void |
HDFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
HtmlParser.getSupportedTypes(ParseContext context) |
void |
HtmlParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
IptcAnpaParser.getSupportedTypes(ParseContext context) |
void |
IptcAnpaParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ISArchiveParser.getSupportedTypes(ParseContext context) |
void |
ISArchiveParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
static void |
ISATabUtils.parseAssay(InputStream stream,
XHTMLContentHandler xhtml,
Metadata metadata,
ParseContext context) |
static void |
ISATabUtils.parseInvestigation(InputStream stream,
XHTMLContentHandler handler,
Metadata metadata,
ParseContext context) |
static void |
ISATabUtils.parseInvestigation(InputStream stream,
XHTMLContentHandler handler,
Metadata metadata,
ParseContext context,
String studyFileName) |
static void |
ISATabUtils.parseStudy(InputStream stream,
XHTMLContentHandler xhtml,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
IWorkPackageParser.getSupportedTypes(ParseContext context) |
void |
IWorkPackageParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
IWork13PackageParser.getSupportedTypes(ParseContext context) |
void |
IWork13PackageParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
SQLite3Parser.getSupportedTypes(ParseContext context) |
void |
SQLite3Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
JournalParser.getSupportedTypes(ParseContext context) |
void |
JournalParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
GrobidRESTParser.parse(String filePath,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Metadata |
TEIDOMParser.parse(String source,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
JpegParser.getSupportedTypes(ParseContext context) |
void |
JpegParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
RFC822Parser.getSupportedTypes(ParseContext context) |
void |
RFC822Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
MatParser.getSupportedTypes(ParseContext context) |
void |
MatParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
OutlookPSTParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
MboxParser.getSupportedTypes(ParseContext context) |
void |
OutlookPSTParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
MboxParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
AbstractOfficeParser.configure(ParseContext parseContext)
Checks to see if the user has specified an
OfficeParserConfig . |
Set<MediaType> |
WMFParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
TNEFParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
OldExcelParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
OfficeParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
MSOwnerFileParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
JackcessParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
EMFParser.getSupportedTypes(ParseContext context) |
protected void |
OfficeParser.parse(org.apache.poi.poifs.filesystem.DirectoryNode root,
ParseContext context,
Metadata metadata,
XHTMLContentHandler xhtml) |
void |
WMFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
TNEFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream
|
void |
OldExcelParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream
|
void |
OfficeParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream
|
void |
MSOwnerFileParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts owner from MS temp file
|
void |
JackcessParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
EMFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Constructor and Description |
---|
ExcelExtractor(ParseContext context,
Metadata metadata) |
HSLFExtractor(ParseContext context,
Metadata metadata) |
OutlookExtractor(org.apache.poi.poifs.filesystem.DirectoryNode root,
ParseContext context) |
OutlookExtractor(org.apache.poi.poifs.filesystem.NPOIFSFileSystem filesystem,
ParseContext context) |
WordExtractor(ParseContext context,
Metadata metadata) |
Modifier and Type | Field and Description |
---|---|
protected ParseContext |
XSSFExcelExtractorDecorator.parseContext |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
OOXMLParser.getSupportedTypes(ParseContext context) |
void |
XSSFExcelExtractorDecorator.getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
XSSFBExcelExtractorDecorator.getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
OOXMLExtractor.getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the document into a sequence of XHTML SAX events sent to the
given content handler.
|
void |
AbstractOOXMLExtractor.getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
OOXMLParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
static void |
OOXMLExtractorFactory.parse(InputStream stream,
ContentHandler baseHandler,
Metadata metadata,
ParseContext context) |
Constructor and Description |
---|
AbstractOOXMLExtractor(ParseContext context,
org.apache.poi.POIXMLTextExtractor extractor) |
POIXMLTextExtractorDecorator(ParseContext context,
org.apache.poi.POIXMLTextExtractor extractor) |
SXSLFPowerPointExtractorDecorator(Metadata metadata,
ParseContext context,
XSLFEventBasedPowerPointExtractor extractor) |
SXWPFWordExtractorDecorator(Metadata metadata,
ParseContext context,
XWPFEventBasedWordExtractor extractor) |
XSLFPowerPointExtractorDecorator(Metadata metadata,
ParseContext context,
org.apache.poi.xslf.extractor.XSLFPowerPointExtractor extractor) |
XSLFPowerPointExtractorDecorator(ParseContext context,
org.apache.poi.xslf.extractor.XSLFPowerPointExtractor extractor)
Deprecated.
|
XSSFBExcelExtractorDecorator(ParseContext context,
org.apache.poi.POIXMLTextExtractor extractor,
Locale locale) |
XSSFExcelExtractorDecorator(ParseContext context,
org.apache.poi.POIXMLTextExtractor extractor,
Locale locale) |
XWPFWordExtractorDecorator(Metadata metadata,
ParseContext context,
org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) |
XWPFWordExtractorDecorator(ParseContext context,
org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor)
|
Constructor and Description |
---|
XWPFStylesShim(org.apache.poi.openxml4j.opc.PackagePart part,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
Word2006MLParser.getSupportedTypes(ParseContext context) |
void |
Word2006MLParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
WordMLParser.getContentHandler(ContentHandler ch,
Metadata metadata,
ParseContext context) |
protected ContentHandler |
SpreadsheetMLParser.getContentHandler(ContentHandler ch,
Metadata metadata,
ParseContext context) |
protected ContentHandler |
AbstractXML2003Parser.getContentHandler(ContentHandler ch,
Metadata md,
ParseContext context) |
Set<MediaType> |
WordMLParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
SpreadsheetMLParser.getSupportedTypes(ParseContext context) |
void |
AbstractXML2003Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
Mp3Parser.getSupportedTypes(ParseContext context) |
void |
Mp3Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
MP4Parser.getSupportedTypes(ParseContext context) |
void |
MP4Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
NamedEntityParser.getSupportedTypes(ParseContext parseContext) |
void |
NamedEntityParser.parse(InputStream inputStream,
ContentHandler contentHandler,
Metadata metadata,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
NetCDFParser.getSupportedTypes(ParseContext context) |
void |
NetCDFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
TesseractOCRParser.getSupportedTypes(ParseContext context) |
void |
TesseractOCRParser.parse(Image image,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
TesseractOCRParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext parseContext) |
void |
TesseractOCRParser.parseInline(InputStream stream,
XHTMLContentHandler xhtml,
ParseContext parseContext,
TesseractOCRConfig config)
Use this to parse content without starting a new document.
|
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
OpenDocumentMetaParser.getContentHandler(ContentHandler ch,
Metadata md,
ParseContext context) |
Set<MediaType> |
OpenDocumentParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
OpenDocumentContentParser.getSupportedTypes(ParseContext context) |
void |
OpenDocumentParser.parse(InputStream stream,
ContentHandler baseHandler,
Metadata metadata,
ParseContext context) |
void |
OpenDocumentMetaParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
OpenDocumentContentParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
PDFParser.getSupportedTypes(ParseContext context) |
void |
PDFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
RarParser.getSupportedTypes(ParseContext arg0) |
Set<MediaType> |
PackageParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
CompressorParser.getSupportedTypes(ParseContext context) |
void |
RarParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
PackageParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
CompressorParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
PooledTimeSeriesParser.getSupportedTypes(ParseContext context)
Returns the set of media types supported by this parser when used with the
given parse context.
|
void |
PooledTimeSeriesParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses a document stream into a sequence of XHTML SAX events.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
PRTParser.getSupportedTypes(ParseContext context) |
void |
PRTParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
AgeRecogniser.getSupportedTypes(ParseContext parseContext) |
Set<MediaType> |
ObjectRecognitionParser.getSupportedTypes(ParseContext context) |
void |
AgeRecogniser.parse(InputStream inputStream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
ObjectRecognitionParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
List<RecognisedObject> |
ObjectRecogniser.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Recognise the objects in the stream
|
Modifier and Type | Method and Description |
---|---|
List<RecognisedObject> |
TensorflowRESTRecogniser.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
List<RecognisedObject> |
TensorflowImageRecParser.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
RTFParser.getSupportedTypes(ParseContext context) |
void |
RTFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
SentimentAnalysisParser.getSupportedTypes(ParseContext context)
Returns the types supported
|
void |
SentimentAnalysisParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Performs the parse
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
StringsParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
Latin1StringsParser.getSupportedTypes(ParseContext arg0) |
void |
StringsParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
Latin1StringsParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
TXTParser.getSupportedTypes(ParseContext context) |
void |
TXTParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
FLVParser.getSupportedTypes(ParseContext context) |
void |
FLVParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
WordPerfectParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
QuattroProParser.getSupportedTypes(ParseContext context) |
void |
WordPerfectParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
QuattroProParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
XMLParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
protected ContentHandler |
FictionBookParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
protected ContentHandler |
DcXMLParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
Set<MediaType> |
XMLParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
FictionBookParser.getSupportedTypes(ParseContext context) |
void |
XMLParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
static void |
TikaResource.fillMetadata(Parser parser,
Metadata metadata,
ParseContext context,
javax.ws.rs.core.MultivaluedMap<String,String> httpHeaders) |
static void |
TikaResource.fillParseContext(ParseContext parseContext,
javax.ws.rs.core.MultivaluedMap<String,String> httpHeaders,
Parser embeddedParser) |
static void |
TikaResource.parse(Parser parser,
org.slf4j.Logger logger,
String path,
InputStream inputStream,
ContentHandler handler,
Metadata metadata,
ParseContext parseContext)
Use this to call a parser and unify exception handling.
|
Modifier and Type | Method and Description |
---|---|
static Future |
ConcurrentUtils.execute(ParseContext context,
Runnable runnable)
Execute a runnable using an ExecutorService from the ParseContext if possible.
|
Copyright © 2007–2017 The Apache Software Foundation. All rights reserved.