|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
.../@*
XPath expression.available()
method.
DefaultExtractor
extraction rules and "delegate" as the content handler.
XHTMLContentHandler
object received as parameter.
CharsetDetector
provides a facility for detecting the
charset or encoding of character data in an unknown format..../*...
XPath expression.ClosedInputStream
sentinel.
close()
method.
Reader
.
Channel
.
Writer
.
InputStream
.
OutputStream
.
ID3Tags
in preference order, and when asked for
a given tag, will return it from the first ID3Tags
that has it.ContentHandler
interface.InputStream
to an
OutputStream
.
InputStream
to chars on a
Writer
using the default character encoding of the platform.
InputStream
to chars on a
Writer
using the specified character encoding.
Reader
to a Writer
.
Reader
to bytes on an
OutputStream
using the default character encoding of the
platform, and calling flush.
Reader
to bytes on an
OutputStream
using the specified character encoding, and
calling flush.
InputStream
to an
OutputStream
.
Reader
to a Writer
.
Parser
implementations
available through the service provider mechanism
.ContainerExtractor
instance.QName
s using
a Map
.EmbeddedContentHandler.startDocument()
and EmbeddedContentHandler.endDocument()
events from reaching the decorated handler.EmbeddedContentHandler.startDocument()
and EmbeddedContentHandler.endDocument()
events.
XHTMLContentHandler.NL
character.
*.html
files.TikaException
without even
attempting to parse the given document stream.POIXMLTextExtractor.getMetadataTextExtractor()
not yet supported
for OOXML by POI.
TikaConfig.getParser()
method instead
TikaConfig.getParser()
method instead
InputStream
emulates.
length
bytes from the
given stream.
SAXException
s thrown, wrapping and re-throwing.
NetCDFParser
depends on the NetCDF-Java API,
we are able to use it to parse HDF files as well.Throwable
constructors missing before Java 6.HtmlMapper
mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
true
if this parser is configured to listen
for all records instead of just the specified few.
HtmlMapper
mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
HtmlMapper
mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
mark(int)
method.
markSupported()
method.
true
if the XPath expression matches the named
attribute of the element associated with this evaluation state.
true
if the XPath expression matches
the element associated with this evaluation state.
true
if the XPath expression matches all text
nodes whose parent is the element associated with this evaluation
state.
TikaInputStream
insteadMimeTypesReader
.Mp3Parser
is used to parse ID3 Version 1 Tag information
from an MP3 file, if available..../@name
XPath expression..../name...
XPath
expression.Parser
for NetCDF
files using the UCAR, MIT-licensed NetCDF for Java
API..../node()
XPath expression.InputStream
that emulates
a stream of a specified size.InputStream
that emulates a specified size
which supports marking and does not throw EOFException.
InputStream
that emulates a specified
size with option settings.
root
type, application/octet-stream.
OfflineContentHandler.resolveEntity(String, String)
method to prevent potential
network or other external resources from being accessed by an XML parser.OOXMLExtractor
for the supplied document and
returns it.content.xml
files.meta.xml
files.OpenDocumentParser
class instead.
This class will be removed in Apache Tika 1.0.Appendable
.
ContainerExtractor
powered by the
regular Parser
classes.Parser
interface.text
type, text/plain.
read()
method.
read(byte[], offset, length)
method.
read()
method.
read(byte[])
method.
read(byte[], int, int)
method.
InputStream
as a list of Strings,
one entry per line, using the default character encoding of the platform.
InputStream
as a list of Strings,
one entry per line, using the specified character encoding.
Reader
as a list of Strings,
one entry per line.
reset()
method.
SafeContentHandler.characters(char[], int, int)
or
SafeContentHandler.ignorableWhitespace(char[], int, int)
) passed to the decorated
content handler contain only valid XML characters.skip(long)
method.
...//...
XPath expression.IOException
wrapper that tags the wrapped exception with
a given object reference.SAXException
wrapper that tags the wrapped exception with
a given object reference.TextContentHandler.characters(char[], int, int)
and
(@link TextContentHandler.ignorableWhitespace(char[], int, int)
(plus TextContentHandler.startDocument()
and TextContentHandler.endDocument()
events to
the decorated content handler..../text()
XPath expression.SAXException
to a corresponding
TikaException
if it's caused by this instance detecting
a zip bomb.
Parser
implementations available through the
service provider mechanism
in the given
class loader.
Parser
implementations available through the
service provider mechanism
in the context
class loader of the current thread.
InputStream
and represent
same data as result InputStream.
InputStream
and represent
same data as result InputStream.
InputStream
as a byte[]
.
Reader
as a byte[]
using the default character encoding of the platform.
Reader
as a byte[]
using the specified character encoding.
String.getBytes()
InputStream
as a character array
using the default character encoding of the platform.
InputStream
as a character array
using the specified character encoding.
Reader
as a character array.
InputStream
as a String
using the default character encoding of the platform.
InputStream
as a String
using the specified character encoding.
Reader
as a String.
String.String(byte[])
String.String(byte[],String)
byte[]
to an OutputStream
.
byte[]
to chars on a Writer
using the default character encoding of the platform.
byte[]
to chars on a Writer
using the specified character encoding.
char[]
to a Writer
using the default character encoding of the platform.
char[]
to bytes on an
OutputStream
.
char[]
to bytes on an
OutputStream
using the specified character encoding.
CharSequence
to a Writer
.
CharSequence
to bytes on an
OutputStream
using the default character encoding of the
platform.
CharSequence
to bytes on an
OutputStream
using the specified character encoding.
String
to a Writer
.
String
to bytes on an
OutputStream
using the default character encoding of the
platform.
String
to bytes on an
OutputStream
using the specified character encoding.
/dev/null
.
/dev/null
.
/dev/null
.
Writer
character stream.xml
type, application/xml.
SAXParser
to determine
the namespace URI and local name of the root element of an XML file.
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |