|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
TemporaryResources.close()
method is called.
.../@*
XPath expression.available()
method.
DefaultExtractor
extraction rules and "delegate" as the content handler.
XHTMLContentHandler
object received as parameter.
null
if the stream is not a TikaInputStream.
CharsetDetector
provides a facility for detecting the
charset or encoding of character data in an unknown format..../*...
XPath expression.://translated.by/you/microsoft-s-html-help-chm-format-incomplete/original
/?show-translation-form=1
ClosedInputStream
sentinel.
close()
method.
Reader
.
Channel
.
Writer
.
InputStream
.
OutputStream
.
ID3Tags
in preference order, and when asked for
a given tag, will return it from the first ID3Tags
that has it.ContentHandler
interface.InputStream
to an
OutputStream
.
InputStream
to chars on a
Writer
using the default character encoding of the platform.
InputStream
to chars on a
Writer
using the specified character encoding.
Reader
to a Writer
.
Reader
to bytes on an
OutputStream
using the default character encoding of the
platform, and calling flush.
Reader
to bytes on an
OutputStream
using the specified character encoding, and
calling flush.
InputStream
to an
OutputStream
.
Reader
to a Writer
.
TemporaryResources.close()
method is called.
Detector
implementations
available through the service provider mechanism
.Parser
implementations
available through the
service provider mechanism
.null
if the encoding of the document can not be detected.
POIFSContainerDetector.detect(Set, DirectoryEntry)
and pass the root
entry of the filesystem whose type is to be detected, as a
second argument.
TemporaryResources.close()
method and wraps the potential
IOException
into a TikaException
for convenience
when used within Tika.
ContainerExtractor
instance.QName
s using
a Map
.EmbeddedContentHandler.startDocument()
and EmbeddedContentHandler.endDocument()
events from reaching the decorated handler.EmbeddedContentHandler.startDocument()
and EmbeddedContentHandler.endDocument()
events.
ContentHandler
which will ignore normal
SAX calls to EndDocumentShieldingContentHandler.endDocument()
, and only fire them later.XHTMLContentHandler.NL
character.
*.html
files.TikaException
without even
attempting to parse the given document stream.ExternalParsersConfigReader
.Metadata
map to XMP namespaces.POIXMLTextExtractor.getMetadataTextExtractor()
not yet supported
for OOXML by POI.
Tika.detect(String)
instead
Tika.detect(File)
instead
TikaConfig.getParser()
method instead
InputStream
emulates.
length
bytes from the
given stream.
SAXException
s thrown, wrapping and re-throwing.
NetCDFParser
depends on the NetCDF-Java API,
we are able to use it to parse HDF files as well.Throwable
constructors missing before Java 6.HtmlMapper
mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
true
if this parser is configured to listen
for all records instead of just the specified few.
Metadata
to XMP convertersHtmlMapper
mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
HtmlMapper
mechanism to customize
the HTML mapping. This method will be removed in Tika 1.0.
mark(int)
method.
markSupported()
method.
true
if the XPath expression matches the named
attribute of the element associated with this evaluation state.
true
if the XPath expression matches
the element associated with this evaluation state.
true
if the XPath expression matches all text
nodes whose parent is the element associated with this evaluation
state.
Metadata
fields.AttributeMetadataHandler
and
ElementMetadataHandler
classes insteadMimeTypesReader
.Mp3Parser
is used to parse ID3 Version 1 Tag information
from an MP3 file, if available..../@name
XPath expression..../name...
XPath
expression.Parser
for NetCDF
files using the UCAR, MIT-licensed NetCDF for Java
API..../node()
XPath expression.InputStream
that emulates
a stream of a specified size.InputStream
that emulates a specified size
which supports marking and does not throw EOFException.
InputStream
that emulates a specified
size with option settings.
root
type, application/octet-stream.
OfflineContentHandler.resolveEntity(String, String)
method to prevent potential
network or other external resources from being accessed by an XML parser.OOXMLExtractor
for the supplied document and
returns it.content.xml
files.meta.xml
files.OpenDocumentParser
class instead.
This class will be removed in Apache Tika 1.0.Parser.parse(InputStream, ContentHandler, Metadata, ParseContext)
method instead
Appendable
.
ContainerExtractor
powered by the regular
Parser
API.Parser
interface.PasswordProvider
on the ParseContext
instead
text
type, text/plain.
Metadata
-object into an XMPMeta
containing the useful
properties.
read()
method.
read(byte[], offset, length)
method.
read()
method.
read(byte[])
method.
read(byte[], int, int)
method.
InputStream
as a list of Strings,
one entry per line, using the default character encoding of the platform.
InputStream
as a list of Strings,
one entry per line, using the specified character encoding.
Reader
as a list of Strings,
one entry per line.
Namespace
information with XMPCore.
reset()
method.
SafeContentHandler.characters(char[], int, int)
or
SafeContentHandler.ignorableWhitespace(char[], int, int)
) passed to the decorated
content handler contain only valid XML characters.TemporaryResources.createTemporaryFile()
method.
skip(long)
method.
...//...
XPath expression.IOException
wrapper that tags the wrapped exception with
a given object reference.SAXException
wrapper that tags the wrapped exception with
a given object reference.TextContentHandler.characters(char[], int, int)
and
(@link TextContentHandler.ignorableWhitespace(char[], int, int)
(plus TextContentHandler.startDocument()
and TextContentHandler.endDocument()
events to
the decorated content handler.TextDetector
which will look at the default number
of bytes from the beginning of the document.
TextDetector
which will look at a given number of
bytes from the beginning of the document.
.../text()
XPath expression.RuntimeException
with the given
throwable as the root cause, thus interrupting the entire service
loading operation.
SAXException
to a corresponding
TikaException
if it's caused by this instance detecting
a zip bomb.
ServiceLoader
class to work correctly in an OSGi environment.Parser
implementations available through the
service provider mechanism
in the given
class loader.
InputStream
as a byte[]
.
Reader
as a byte[]
using the default character encoding of the platform.
Reader
as a byte[]
using the specified character encoding.
String.getBytes()
InputStream
as a character array
using the default character encoding of the platform.
InputStream
as a character array
using the specified character encoding.
Reader
as a character array.
InputStream
as a String
using the default character encoding of the platform.
InputStream
as a String
using the specified character encoding.
Reader
as a String.
String.String(byte[])
String.String(byte[],String)
Logger
created using the given class name.
byte[]
to an OutputStream
.
byte[]
to chars on a Writer
using the default character encoding of the platform.
byte[]
to chars on a Writer
using the specified character encoding.
char[]
to a Writer
using the default character encoding of the platform.
char[]
to bytes on an
OutputStream
.
char[]
to bytes on an
OutputStream
using the specified character encoding.
CharSequence
to a Writer
.
CharSequence
to bytes on an
OutputStream
using the default character encoding of the
platform.
CharSequence
to bytes on an
OutputStream
using the specified character encoding.
String
to a Writer
.
String
to bytes on an
OutputStream
using the default character encoding of the
platform.
String
to bytes on an
OutputStream
using the specified character encoding.
/dev/null
.
/dev/null
.
/dev/null
.
xml
type, application/xml.
SAXParser
to determine
the namespace URI and local name of the root element of an XML file.
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |