Package org.apache.tika.sax
Class BodyContentHandler
- java.lang.Object
-
- org.xml.sax.helpers.DefaultHandler
-
- org.apache.tika.sax.ContentHandlerDecorator
-
- org.apache.tika.sax.BodyContentHandler
-
- All Implemented Interfaces:
ContentHandler
,DTDHandler
,EntityResolver
,ErrorHandler
public class BodyContentHandler extends ContentHandlerDecorator
Content handler decorator that only passes everything inside the XHTML <body/> tag to the underlying handler. Note that the <body/> tag itself is not passed on.
-
-
Constructor Summary
Constructors Constructor Description BodyContentHandler()
Creates a content handler that writes XHTML body character events to an internal string buffer.BodyContentHandler(int writeLimit)
Creates a content handler that writes XHTML body character events to an internal string buffer.BodyContentHandler(OutputStream stream)
Creates a content handler that writes XHTML body character events to the given output stream using the default encoding.BodyContentHandler(Writer writer)
Creates a content handler that writes XHTML body character events to the given writer.BodyContentHandler(ContentHandler handler)
Creates a content handler that passes all XHTML body events to the given underlying content handler.
-
Method Summary
-
Methods inherited from class org.apache.tika.sax.ContentHandlerDecorator
characters, endDocument, endElement, endPrefixMapping, handleException, ignorableWhitespace, processingInstruction, setContentHandler, setDocumentLocator, skippedEntity, startDocument, startElement, startPrefixMapping, toString
-
Methods inherited from class org.xml.sax.helpers.DefaultHandler
error, fatalError, notationDecl, resolveEntity, unparsedEntityDecl, warning
-
-
-
-
Constructor Detail
-
BodyContentHandler
public BodyContentHandler(ContentHandler handler)
Creates a content handler that passes all XHTML body events to the given underlying content handler.- Parameters:
handler
- content handler
-
BodyContentHandler
public BodyContentHandler(Writer writer)
Creates a content handler that writes XHTML body character events to the given writer.- Parameters:
writer
- writer
-
BodyContentHandler
public BodyContentHandler(OutputStream stream)
Creates a content handler that writes XHTML body character events to the given output stream using the default encoding.- Parameters:
stream
- output stream
-
BodyContentHandler
public BodyContentHandler(int writeLimit)
Creates a content handler that writes XHTML body character events to an internal string buffer. The contents of the buffer can be retrieved using theContentHandlerDecorator.toString()
method.The internal string buffer is bounded at the given number of characters. If this write limit is reached, then a
SAXException
is thrown.- Parameters:
writeLimit
- maximum number of characters to include in the string, or -1 to disable the write limit- Since:
- Apache Tika 0.7
-
BodyContentHandler
public BodyContentHandler()
Creates a content handler that writes XHTML body character events to an internal string buffer. The contents of the buffer can be retrieved using theContentHandlerDecorator.toString()
method.The internal string buffer is bounded at 100k characters. If this write limit is reached, then a
SAXException
is thrown.
-
-