|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
TemporaryResources.close() method is called.
.../@* XPath expression.available() method.
DefaultExtractor extraction rules and "delegate" as the content handler.
XHTMLContentHandler object received as parameter.
null if the stream is not a TikaInputStream.
CharsetDetector provides a facility for detecting the
charset or encoding of character data in an unknown format..../*... XPath expression.://translated.by/you/microsoft-s-html-help-chm-format-incomplete/original
/?show-translation-form=1ClosedInputStream
sentinel.
close() method.
Reader.
Channel.
Writer.
InputStream.
OutputStream.
ID3Tags in preference order, and when asked for
a given tag, will return it from the first ID3Tags that has it.ContentHandler interface.InputStream to an
OutputStream.
InputStream to chars on a
Writer using the default character encoding of the platform.
InputStream to chars on a
Writer using the specified character encoding.
Reader to a Writer.
Reader to bytes on an
OutputStream using the default character encoding of the
platform, and calling flush.
Reader to bytes on an
OutputStream using the specified character encoding, and
calling flush.
InputStream to an
OutputStream.
Reader to a Writer.
TemporaryResources.close() method is called.
Detector implementations
available through the service provider mechanism.Parser implementations
available through the service provider mechanism.POIFSContainerDetector.detect(Set, DirectoryEntry) and pass the root
entry of the filesystem whose type is to be detected, as a
second argument.
TemporaryResources.close() method and wraps the potential
IOException into a TikaException for convenience
when used within Tika.
ContainerExtractor instance.QNames using
a Map.EmbeddedContentHandler.startDocument()
and EmbeddedContentHandler.endDocument() events from reaching the decorated handler.EmbeddedContentHandler.startDocument() and EmbeddedContentHandler.endDocument()
events.
ContentHandler which will ignore normal
SAX calls to EndDocumentShieldingContentHandler.endDocument(), and only fire them later.XHTMLContentHandler.NL character.
*.html files.TikaException without even
attempting to parse the given document stream.ExternalParsersConfigReader.POIXMLTextExtractor.getMetadataTextExtractor() not yet supported
for OOXML by POI.
Tika.detect(String) instead
Tika.detect(File) instead
TikaConfig.getParser() method instead
InputStream emulates.
length bytes from the
given stream.
SAXExceptions thrown, wrapping and re-throwing.
NetCDFParser depends on the NetCDF-Java API,
we are able to use it to parse HDF files as well.Throwable constructors missing before Java 6.HtmlMapper mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
true if this parser is configured to listen
for all records instead of just the specified few.
HtmlMapper mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
HtmlMapper mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
mark(int) method.
markSupported() method.
true if the XPath expression matches the named
attribute of the element associated with this evaluation state.
true if the XPath expression matches
the element associated with this evaluation state.
true if the XPath expression matches all text
nodes whose parent is the element associated with this evaluation
state.
Metadata fields.AttributeMetadataHandler and
ElementMetadataHandler classes insteadMimeTypesReader.Mp3Parser is used to parse ID3 Version 1 Tag information
from an MP3 file, if available..../@name XPath expression..../name... XPath
expression.Parser for NetCDF
files using the UCAR, MIT-licensed NetCDF for Java
API..../node() XPath expression.InputStream that emulates
a stream of a specified size.InputStream that emulates a specified size
which supports marking and does not throw EOFException.
InputStream that emulates a specified
size with option settings.
root type, application/octet-stream.
OfflineContentHandler.resolveEntity(String, String) method to prevent potential
network or other external resources from being accessed by an XML parser.OOXMLExtractor for the supplied document and
returns it.content.xml files.meta.xml files.OpenDocumentParser class instead.
This class will be removed in Apache Tika 1.0.Parser.parse(InputStream, ContentHandler, Metadata, ParseContext) method instead
Appendable.
ContainerExtractor powered by the regular
Parser API.Parser interface.PasswordProvider on the ParseContext instead
text type, text/plain.
read() method.
read(byte[], offset, length)
method.
read() method.
read(byte[]) method.
read(byte[], int, int) method.
InputStream as a list of Strings,
one entry per line, using the default character encoding of the platform.
InputStream as a list of Strings,
one entry per line, using the specified character encoding.
Reader as a list of Strings,
one entry per line.
reset() method.
SafeContentHandler.characters(char[], int, int) or
SafeContentHandler.ignorableWhitespace(char[], int, int)) passed to the decorated
content handler contain only valid XML characters.TemporaryResources.createTemporaryFile() method.
skip(long) method.
...//... XPath expression.IOException wrapper that tags the wrapped exception with
a given object reference.SAXException wrapper that tags the wrapped exception with
a given object reference.TextContentHandler.characters(char[], int, int) and
(@link TextContentHandler.ignorableWhitespace(char[], int, int)
(plus TextContentHandler.startDocument() and TextContentHandler.endDocument() events to
the decorated content handler.TextDetector which will look at the default number
of bytes from the beginning of the document.
TextDetector which will look at a given number of
bytes from the beginning of the document.
.../text() XPath expression.RuntimeException with the given
throwable as the root cause, thus interrupting the entire service
loading operation.
SAXException to a corresponding
TikaException if it's caused by this instance detecting
a zip bomb.
ServiceLoader class to work correctly in an OSGi environment.Parser implementations available through the
service provider mechanism in the given
class loader.
InputStream as a byte[].
Reader as a byte[]
using the default character encoding of the platform.
Reader as a byte[]
using the specified character encoding.
String.getBytes()
InputStream as a character array
using the default character encoding of the platform.
InputStream as a character array
using the specified character encoding.
Reader as a character array.
InputStream as a String
using the default character encoding of the platform.
InputStream as a String
using the specified character encoding.
Reader as a String.
String.String(byte[])
String.String(byte[],String)
Logger
created using the given class name.
byte[] to an OutputStream.
byte[] to chars on a Writer
using the default character encoding of the platform.
byte[] to chars on a Writer
using the specified character encoding.
char[] to a Writer
using the default character encoding of the platform.
char[] to bytes on an
OutputStream.
char[] to bytes on an
OutputStream using the specified character encoding.
CharSequence to a Writer.
CharSequence to bytes on an
OutputStream using the default character encoding of the
platform.
CharSequence to bytes on an
OutputStream using the specified character encoding.
String to a Writer.
String to bytes on an
OutputStream using the default character encoding of the
platform.
String to bytes on an
OutputStream using the specified character encoding.
/dev/null.
/dev/null.
/dev/null.
xml type, application/xml.
SAXParser to determine
the namespace URI and local name of the root element of an XML file.
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||