Modifier and Type | Method and Description |
---|---|
protected void |
FileResourceConsumer.parse(String resourceId,
Parser parser,
InputStream is,
ContentHandler handler,
Metadata m,
ParseContext parseContext)
Utility method to handle logging equivalently among all
implementing classes.
|
Modifier and Type | Method and Description |
---|---|
List<RecognisedObject> |
DL4JVGG16Net.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
List<RecognisedObject> |
DL4JInceptionV3Net.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
ExternalEmbedder.embed(Metadata metadata,
InputStream inputStream,
OutputStream outputStream,
ParseContext context)
Executes the configured external command and passes the given document
stream as a simple XHTML document to the given SAX content handler.
|
void |
Embedder.embed(Metadata metadata,
InputStream originalStream,
OutputStream outputStream,
ParseContext context)
Embeds related document metadata from the given metadata object into the
given output stream.
|
Set<MediaType> |
ExternalEmbedder.getSupportedEmbedTypes(ParseContext context) |
Set<MediaType> |
Embedder.getSupportedEmbedTypes(ParseContext context)
Returns the set of media types supported by this embedder when used with
the given parse context.
|
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
PrescriptionParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
Set<MediaType> |
EncryptedPrescriptionParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
PrescriptionParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
DirListParser.getSupportedTypes(ParseContext context) |
void |
EncryptedPrescriptionParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
LanguageDetectingParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
DirListParser.parse(InputStream is,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
static EmbeddedDocumentExtractor |
EmbeddedDocumentUtil.getEmbeddedDocumentExtractor(ParseContext context)
This offers a uniform way to get an EmbeddedDocumentExtractor from a ParseContext.
|
static Parser |
EmbeddedDocumentUtil.tryToFindExistingLeafParser(Class clazz,
ParseContext context)
Tries to find an existing parser within the ParseContext.
|
Constructor and Description |
---|
EmbeddedDocumentUtil(ParseContext context) |
ParsingEmbeddedDocumentExtractor(ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ForkParser.getSupportedTypes(ParseContext context) |
void |
ForkParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
This sends the objects to the server for parsing, and the server via
the proxies acts on the handler as if it were updating it directly.
|
Modifier and Type | Method and Description |
---|---|
void |
DigestingParser.Digester.digest(InputStream is,
Metadata m,
ParseContext parseContext)
Digests an InputStream and sets the appropriate value(s) in the metadata.
|
Map<MediaType,List<Parser>> |
CompositeParser.findDuplicateParsers(ParseContext context)
Utility method that goes through all the component parsers and finds
all media types for which more than one parser declares support.
|
protected Parser |
DelegatingParser.getDelegateParser(ParseContext context)
Returns the parser instance to which parsing tasks should be delegated.
|
protected EncodingDetector |
AbstractEncodingDetectorParser.getEncodingDetector(ParseContext parseContext)
Look for an EncodingDetetor in the ParseContext.
|
protected Parser |
CompositeParser.getParser(Metadata metadata,
ParseContext context) |
Map<MediaType,Parser> |
CompositeParser.getParsers(ParseContext context) |
Map<MediaType,Parser> |
DefaultParser.getParsers(ParseContext context) |
Set<MediaType> |
EmptyParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
Parser.getSupportedTypes(ParseContext context)
Returns the set of media types supported by this parser when used
with the given parse context.
|
Set<MediaType> |
DelegatingParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
ErrorParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
ParserDecorator.getSupportedTypes(ParseContext context)
Delegates the method call to the decorated parser.
|
Set<MediaType> |
RecursiveParserWrapper.getSupportedTypes(ParseContext context) |
Set<MediaType> |
CompositeParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
CryptoParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
NetworkParser.getSupportedTypes(ParseContext context) |
void |
DigestingParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
EmptyParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses a document stream into a sequence of XHTML SAX events.
|
void |
DelegatingParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Looks up the delegate parser from the parsing context and
delegates the parse operation to it.
|
void |
ParserPostProcessor.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Forwards the call to the delegated parser and post-processes the
results as described above.
|
void |
ErrorParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
ParserDecorator.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Delegates the method call to the decorated parser.
|
void |
RecursiveParserWrapper.parse(InputStream stream,
ContentHandler recursiveParserWrapperHandler,
Metadata metadata,
ParseContext context)
Acts like a regular parser except it ignores the ContentHandler
and it automatically sets/overwrites the embedded Parser in the
ParseContext object.
|
void |
AutoDetectParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
CompositeParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Delegates the call to the matching component parser.
|
void |
CryptoParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
NetworkParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Constructor and Description |
---|
ParsingReader(Parser parser,
InputStream stream,
Metadata metadata,
ParseContext context)
Creates a reader for the text content of the given binary stream
with the given document metadata.
|
ParsingReader(Parser parser,
InputStream stream,
Metadata metadata,
ParseContext context,
Executor executor)
Creates a reader for the text content of the given binary stream
with the given document metadata.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
AppleSingleFileParser.getSupportedTypes(ParseContext context) |
void |
AppleSingleFileParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ClassParser.getSupportedTypes(ParseContext context) |
void |
ClassParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
AudioParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
MidiParser.getSupportedTypes(ParseContext context) |
void |
AudioParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
MidiParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
List<CaptionObject> |
TensorflowRESTCaptioner.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ChmParser.getSupportedTypes(ParseContext context) |
void |
ChmParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
SourceCodeParser.getSupportedTypes(ParseContext context) |
void |
SourceCodeParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
TSDParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
Pkcs7Parser.getSupportedTypes(ParseContext context) |
void |
TSDParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
Pkcs7Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
TextAndCSVParser.getSupportedTypes(ParseContext context) |
void |
TextAndCSVParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
CTAKESParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
DBFParser.getSupportedTypes(ParseContext context) |
void |
DBFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
DIFParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
Set<MediaType> |
DIFParser.getSupportedTypes(ParseContext context) |
void |
DIFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
CompositeDigester.digest(InputStream is,
Metadata m,
ParseContext parseContext) |
void |
InputStreamDigester.digest(InputStream is,
Metadata metadata,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
DWGParser.getSupportedTypes(ParseContext context) |
void |
DWGParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
EnviHeaderParser.getSupportedTypes(ParseContext context) |
void |
EnviHeaderParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
EpubParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
EpubContentParser.getSupportedTypes(ParseContext context) |
void |
EpubParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
EpubContentParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ExecutableParser.getSupportedTypes(ParseContext context) |
void |
ExecutableParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ExternalParser.getSupportedTypes(ParseContext context) |
void |
ExternalParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Executes the configured external command and passes the given document
stream as a simple XHTML document to the given SAX content handler.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
FeedParser.getSupportedTypes(ParseContext context) |
void |
FeedParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
AdobeFontMetricParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
TrueTypeParser.getSupportedTypes(ParseContext context) |
void |
AdobeFontMetricParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
TrueTypeParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
GDALParser.getSupportedTypes(ParseContext context) |
void |
GDALParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
GeoParser.getSupportedTypes(ParseContext parseContext) |
void |
GeoParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
GeographicInformationParser.getSupportedTypes(ParseContext parseContext) |
void |
GeographicInformationParser.parse(InputStream inputStream,
ContentHandler contentHandler,
Metadata metadata,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
GribParser.getSupportedTypes(ParseContext context) |
void |
GribParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
HDFParser.getSupportedTypes(ParseContext context) |
void |
HDFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
HtmlParser.getSupportedTypes(ParseContext context) |
void |
HtmlParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
HwpV5Parser.getSupportedTypes(ParseContext context) |
void |
HwpV5Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
IptcAnpaParser.getSupportedTypes(ParseContext context) |
void |
IptcAnpaParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ISArchiveParser.getSupportedTypes(ParseContext context) |
void |
ISArchiveParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
static void |
ISATabUtils.parseAssay(InputStream stream,
XHTMLContentHandler xhtml,
Metadata metadata,
ParseContext context) |
static void |
ISATabUtils.parseInvestigation(InputStream stream,
XHTMLContentHandler handler,
Metadata metadata,
ParseContext context) |
static void |
ISATabUtils.parseInvestigation(InputStream stream,
XHTMLContentHandler handler,
Metadata metadata,
ParseContext context,
String studyFileName) |
static void |
ISATabUtils.parseStudy(InputStream stream,
XHTMLContentHandler xhtml,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
IWorkPackageParser.getSupportedTypes(ParseContext context) |
void |
IWorkPackageParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
IWork18PackageParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
IWork13PackageParser.getSupportedTypes(ParseContext context) |
void |
IWork18PackageParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
IWork13PackageParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
SQLite3Parser.getSupportedTypes(ParseContext context) |
void |
SQLite3Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
JournalParser.getSupportedTypes(ParseContext context) |
void |
JournalParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
GrobidRESTParser.parse(String filePath,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Metadata |
TEIDOMParser.parse(String source,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
JpegParser.getSupportedTypes(ParseContext context) |
void |
JpegParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
RFC822Parser.getSupportedTypes(ParseContext context) |
void |
RFC822Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
MatParser.getSupportedTypes(ParseContext context) |
void |
MatParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
OutlookPSTParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
MboxParser.getSupportedTypes(ParseContext context) |
void |
OutlookPSTParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
MboxParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
AbstractOfficeParser.configure(ParseContext parseContext)
Checks to see if the user has specified an
OfficeParserConfig . |
Set<MediaType> |
JackcessParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
OfficeParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
EMFParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
TNEFParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
MSOwnerFileParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
OldExcelParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
WMFParser.getSupportedTypes(ParseContext context) |
protected void |
OfficeParser.parse(org.apache.poi.poifs.filesystem.DirectoryNode root,
ParseContext context,
Metadata metadata,
XHTMLContentHandler xhtml) |
void |
JackcessParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
OfficeParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream
|
void |
EMFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
TNEFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream
|
void |
MSOwnerFileParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts owner from MS temp file
|
void |
OldExcelParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Extracts properties and text from an MS Document input stream
|
void |
WMFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Constructor and Description |
---|
ExcelExtractor(ParseContext context,
Metadata metadata) |
HSLFExtractor(ParseContext context,
Metadata metadata) |
OutlookExtractor(org.apache.poi.poifs.filesystem.DirectoryNode root,
ParseContext context) |
OutlookExtractor(org.apache.poi.poifs.filesystem.POIFSFileSystem filesystem,
ParseContext context) |
WordExtractor(ParseContext context,
Metadata metadata) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
OneNoteParser.getSupportedTypes(ParseContext context) |
void |
OneNoteParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Field and Description |
---|---|
protected ParseContext |
XSSFExcelExtractorDecorator.parseContext |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
OOXMLParser.getSupportedTypes(ParseContext context) |
void |
XSSFExcelExtractorDecorator.getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
OOXMLExtractor.getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the document into a sequence of XHTML SAX events sent to the
given content handler.
|
void |
XSSFBExcelExtractorDecorator.getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
AbstractOOXMLExtractor.getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context) |
static void |
OOXMLExtractorFactory.parse(InputStream stream,
ContentHandler baseHandler,
Metadata metadata,
ParseContext context) |
void |
OOXMLParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Constructor and Description |
---|
AbstractOOXMLExtractor(ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor) |
POIXMLTextExtractorDecorator(ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor) |
SXSLFPowerPointExtractorDecorator(Metadata metadata,
ParseContext context,
XSLFEventBasedPowerPointExtractor extractor) |
SXWPFWordExtractorDecorator(Metadata metadata,
ParseContext context,
XWPFEventBasedWordExtractor extractor) |
XSLFPowerPointExtractorDecorator(Metadata metadata,
ParseContext context,
org.apache.poi.xslf.extractor.XSLFPowerPointExtractor extractor) |
XSLFPowerPointExtractorDecorator(ParseContext context,
org.apache.poi.xslf.extractor.XSLFPowerPointExtractor extractor)
Deprecated.
|
XSSFBExcelExtractorDecorator(ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor,
Locale locale) |
XSSFExcelExtractorDecorator(ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor,
Locale locale) |
XWPFWordExtractorDecorator(Metadata metadata,
ParseContext context,
org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) |
XWPFWordExtractorDecorator(ParseContext context,
org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor)
|
Constructor and Description |
---|
XPSExtractorDecorator(ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor) |
Constructor and Description |
---|
XWPFStylesShim(org.apache.poi.openxml4j.opc.PackagePart part,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
Word2006MLParser.getSupportedTypes(ParseContext context) |
void |
Word2006MLParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
SpreadsheetMLParser.getContentHandler(ContentHandler ch,
Metadata metadata,
ParseContext context) |
protected ContentHandler |
WordMLParser.getContentHandler(ContentHandler ch,
Metadata metadata,
ParseContext context) |
protected ContentHandler |
AbstractXML2003Parser.getContentHandler(ContentHandler ch,
Metadata md,
ParseContext context) |
Set<MediaType> |
SpreadsheetMLParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
WordMLParser.getSupportedTypes(ParseContext context) |
void |
AbstractXML2003Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
Mp3Parser.getSupportedTypes(ParseContext context) |
void |
Mp3Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
MP4Parser.getSupportedTypes(ParseContext context) |
void |
MP4Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
NamedEntityParser.getSupportedTypes(ParseContext parseContext) |
void |
NamedEntityParser.parse(InputStream inputStream,
ContentHandler contentHandler,
Metadata metadata,
ParseContext parseContext) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
NetCDFParser.getSupportedTypes(ParseContext context) |
void |
NetCDFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
TesseractOCRParser.getSupportedTypes(ParseContext context) |
void |
TesseractOCRParser.parse(Image image,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
TesseractOCRParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext parseContext) |
void |
TesseractOCRParser.parseInline(InputStream stream,
XHTMLContentHandler xhtml,
ParseContext parseContext,
TesseractOCRConfig config)
Use this to parse content without starting a new document.
|
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
OpenDocumentMetaParser.getContentHandler(ContentHandler ch,
Metadata md,
ParseContext context) |
Set<MediaType> |
OpenDocumentContentParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
OpenDocumentParser.getSupportedTypes(ParseContext context) |
void |
OpenDocumentMetaParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
OpenDocumentContentParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
OpenDocumentParser.parse(InputStream stream,
ContentHandler baseHandler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
protected org.apache.pdfbox.pdmodel.PDDocument |
PDFParser.getPDDocument(InputStream inputStream,
String password,
org.apache.pdfbox.io.MemoryUsageSetting memoryUsageSetting,
Metadata metadata,
ParseContext parseContext) |
protected org.apache.pdfbox.pdmodel.PDDocument |
PDFPreflightParser.getPDDocument(InputStream inputStream,
String password,
org.apache.pdfbox.io.MemoryUsageSetting memoryUsageSetting,
Metadata metadata,
ParseContext parseContext) |
protected org.apache.pdfbox.pdmodel.PDDocument |
PDFParser.getPDDocument(Path path,
String password,
org.apache.pdfbox.io.MemoryUsageSetting memoryUsageSetting,
Metadata metadata,
ParseContext parseContext) |
protected org.apache.pdfbox.pdmodel.PDDocument |
PDFPreflightParser.getPDDocument(Path path,
String password,
org.apache.pdfbox.io.MemoryUsageSetting memoryUsageSetting,
Metadata metadata,
ParseContext context) |
Set<MediaType> |
PDFParser.getSupportedTypes(ParseContext context) |
void |
PDFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
static void |
PDFMarkedContent2XHTML.process(org.apache.pdfbox.pdmodel.PDDocument pdDocument,
ContentHandler handler,
ParseContext context,
Metadata metadata,
PDFParserConfig config)
Converts the given PDF document (and related metadata) to a stream
of XHTML SAX events sent to the given content handler.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
PackageParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
RarParser.getSupportedTypes(ParseContext arg0) |
Set<MediaType> |
CompressorParser.getSupportedTypes(ParseContext context) |
void |
PackageParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
RarParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
CompressorParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
PooledTimeSeriesParser.getSupportedTypes(ParseContext context)
Returns the set of media types supported by this parser when used with the
given parse context.
|
void |
PooledTimeSeriesParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses a document stream into a sequence of XHTML SAX events.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
PRTParser.getSupportedTypes(ParseContext context) |
void |
PRTParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
AgeRecogniser.getSupportedTypes(ParseContext parseContext) |
Set<MediaType> |
ObjectRecognitionParser.getSupportedTypes(ParseContext context) |
void |
AgeRecogniser.parse(InputStream inputStream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
ObjectRecognitionParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
List<? extends RecognisedObject> |
ObjectRecogniser.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Recognise the objects in the stream
|
Modifier and Type | Method and Description |
---|---|
List<RecognisedObject> |
TensorflowImageRecParser.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
List<RecognisedObject> |
TensorflowRESTRecogniser.recognise(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
RTFParser.getSupportedTypes(ParseContext context) |
void |
RTFParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
SAS7BDATParser.getSupportedTypes(ParseContext context) |
void |
SAS7BDATParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
SentimentAnalysisParser.getSupportedTypes(ParseContext context)
Returns the types supported
|
void |
SentimentAnalysisParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Performs the parse
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
Latin1StringsParser.getSupportedTypes(ParseContext arg0) |
Set<MediaType> |
StringsParser.getSupportedTypes(ParseContext context) |
void |
Latin1StringsParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
StringsParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
TXTParser.getSupportedTypes(ParseContext context) |
void |
TXTParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
FLVParser.getSupportedTypes(ParseContext context) |
void |
FLVParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
WordPerfectParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
QuattroProParser.getSupportedTypes(ParseContext context) |
void |
WordPerfectParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
QuattroProParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
XLIFF12Parser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
XLZParser.getSupportedTypes(ParseContext context) |
void |
XLIFF12Parser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
XLZParser.parse(InputStream stream,
ContentHandler baseHandler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
protected ContentHandler |
XMLParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
protected ContentHandler |
FictionBookParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
protected ContentHandler |
DcXMLParser.getContentHandler(ContentHandler handler,
Metadata metadata,
ParseContext context) |
Set<MediaType> |
XMLParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
FictionBookParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
XMLProfiler.getSupportedTypes(ParseContext context) |
void |
XMLParser.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
void |
XMLProfiler.parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context) |
Modifier and Type | Method and Description |
---|---|
static void |
TikaResource.fillMetadata(Parser parser,
Metadata metadata,
ParseContext context,
javax.ws.rs.core.MultivaluedMap<String,String> httpHeaders) |
static void |
TikaResource.fillParseContext(ParseContext parseContext,
javax.ws.rs.core.MultivaluedMap<String,String> httpHeaders,
Parser embeddedParser) |
static void |
TikaResource.parse(Parser parser,
org.slf4j.Logger logger,
String path,
InputStream inputStream,
ContentHandler handler,
Metadata metadata,
ParseContext parseContext)
Use this to call a parser and unify exception handling.
|
Modifier and Type | Method and Description |
---|---|
static Document |
XMLReaderUtils.buildDOM(InputStream is,
ParseContext context)
This checks context for a user specified
DocumentBuilder . |
static Future |
ConcurrentUtils.execute(ParseContext context,
Runnable runnable)
Execute a runnable using an ExecutorService from the ParseContext if possible.
|
static void |
XMLReaderUtils.parseSAX(InputStream is,
DefaultHandler contentHandler,
ParseContext context)
This checks context for a user specified
SAXParser . |
Copyright © 2007–2020 The Apache Software Foundation. All rights reserved.