Package org.apache.tika.sax
Class BodyContentHandler
java.lang.Object
org.xml.sax.helpers.DefaultHandler
org.apache.tika.sax.ContentHandlerDecorator
org.apache.tika.sax.BodyContentHandler
- All Implemented Interfaces:
ContentHandler
,DTDHandler
,EntityResolver
,ErrorHandler
Content handler decorator that only passes everything inside
the XHTML <body/> tag to the underlying handler. Note that
the <body/> tag itself is not passed on.
-
Constructor Summary
ConstructorDescriptionCreates a content handler that writes XHTML body character events to an internal string buffer.BodyContentHandler
(int writeLimit) Creates a content handler that writes XHTML body character events to an internal string buffer.BodyContentHandler
(Writer writer) Creates a content handler that writes XHTML body character events to the given writer.BodyContentHandler
(ContentHandler handler) Creates a content handler that passes all XHTML body events to the given underlying content handler. -
Method Summary
Methods inherited from class org.apache.tika.sax.ContentHandlerDecorator
characters, endDocument, endElement, endPrefixMapping, error, fatalError, handleException, ignorableWhitespace, processingInstruction, setContentHandler, setDocumentLocator, skippedEntity, startDocument, startElement, startPrefixMapping, toString, warning
Methods inherited from class org.xml.sax.helpers.DefaultHandler
notationDecl, resolveEntity, unparsedEntityDecl
-
Constructor Details
-
BodyContentHandler
Creates a content handler that passes all XHTML body events to the given underlying content handler.- Parameters:
handler
- content handler
-
BodyContentHandler
Creates a content handler that writes XHTML body character events to the given writer.- Parameters:
writer
- writer
-
BodyContentHandler
public BodyContentHandler(int writeLimit) Creates a content handler that writes XHTML body character events to an internal string buffer. The contents of the buffer can be retrieved using theContentHandlerDecorator.toString()
method.The internal string buffer is bounded at the given number of characters. If this write limit is reached, then a
SAXException
is thrown.- Parameters:
writeLimit
- maximum number of characters to include in the string, or -1 to disable the write limit- Since:
- Apache Tika 0.7
-
BodyContentHandler
public BodyContentHandler()Creates a content handler that writes XHTML body character events to an internal string buffer. The contents of the buffer can be retrieved using theContentHandlerDecorator.toString()
method.The internal string buffer is bounded at 100k characters. If this write limit is reached, then a
SAXException
is thrown.
-