| Interface | Description | 
|---|---|
| ContentHandlerFactory | 
 Interface to allow easier injection of code for getting a new ContentHandler 
 | 
| SafeContentHandler.Output | 
 Internal interface that allows both character and
 ignorable whitespace content to be filtered the same way. 
 | 
| Class | Description | 
|---|---|
| AbstractRecursiveParserWrapperHandler | 
 This is a special handler to be used only with the
  
RecursiveParserWrapper. | 
| BasicContentHandlerFactory | 
 Basic factory for creating common types of ContentHandlers 
 | 
| BodyContentHandler | 
 Content handler decorator that only passes everything inside
 the XHTML <body/> tag to the underlying handler. 
 | 
| CleanPhoneText | 
 Class to help de-obfuscate phone numbers in text. 
 | 
| ContentHandlerDecorator | 
 Decorator base class for the  
ContentHandler interface. | 
| DIFContentHandler | |
| ElementMappingContentHandler | 
 Content handler decorator that maps element  
QNames using
 a Map. | 
| ElementMappingContentHandler.TargetElement | |
| EmbeddedContentHandler | 
 Content handler decorator that prevents the  
EmbeddedContentHandler.startDocument()
 and EmbeddedContentHandler.endDocument() events from reaching the decorated handler. | 
| EndDocumentShieldingContentHandler | 
 A wrapper around a  
ContentHandler which will ignore normal
 SAX calls to EndDocumentShieldingContentHandler.endDocument(), and only fire them later. | 
| ExpandedTitleContentHandler | 
 Content handler decorator which wraps a  
TransformerHandler in order to
 allow the TITLE tag to render as <title></title>
 rather than <title/> which is accomplished
 by calling the ContentHandler.characters(char[], int, int) method
 with a length of 1 but a zero length char array. | 
| Link | |
| LinkContentHandler | 
 Content handler that collects links from an XHTML document. 
 | 
| OfflineContentHandler | 
 Content handler decorator that always returns an empty stream from the
  
OfflineContentHandler.resolveEntity(String, String) method to prevent potential
 network or other external resources from being accessed by an XML parser. | 
| PhoneExtractingContentHandler | 
 Class used to extract phone numbers while parsing. 
 | 
| RecursiveParserWrapperHandler | 
 This is the default implementation of  
AbstractRecursiveParserWrapperHandler. | 
| RichTextContentHandler | 
 Content handler for Rich Text, it will extract XHTML <img/>
 tag <alt/> attribute and XHTML <a/> tag <name/>
 attribute into the output. 
 | 
| SafeContentHandler | 
 Content handler decorator that makes sure that the character events
 ( 
SafeContentHandler.characters(char[], int, int) or
 SafeContentHandler.ignorableWhitespace(char[], int, int)) passed to the decorated
 content handler contain only valid XML characters. | 
| SecureContentHandler | 
 Content handler decorator that attempts to prevent denial of service
 attacks against Tika parsers. 
 | 
| StandardOrganizations | 
 This class provides a collection of the most important technical standard organizations. 
 | 
| StandardReference | 
 Class that represents a standard reference. 
 | 
| StandardReference.StandardReferenceBuilder | |
| StandardsExtractingContentHandler | 
 StandardsExtractingContentHandler is a Content Handler used to extract
 standard references while parsing. 
 | 
| StandardsText | 
 StandardText relies on regular expressions to extract standard references
 from text. 
 | 
| TaggedContentHandler | 
 A content handler decorator that tags potential exceptions so that the
 handler that caused the exception can easily be identified. 
 | 
| TeeContentHandler | 
 Content handler proxy that forwards the received SAX events to zero or
 more underlying content handlers. 
 | 
| TextAndAttributeContentHandler | |
| TextContentHandler | 
 Content handler decorator that only passes the
  
TextContentHandler.characters(char[], int, int) and
 (@link TextContentHandler.ignorableWhitespace(char[], int, int)
 (plus TextContentHandler.startDocument() and TextContentHandler.endDocument() events to
 the decorated content handler. | 
| ToHTMLContentHandler | 
 SAX event handler that serializes the HTML document to a character stream. 
 | 
| ToTextContentHandler | 
 SAX event handler that writes all character content out to a character
 stream. 
 | 
| ToXMLContentHandler | 
 SAX event handler that serializes the XML document to a character stream. 
 | 
| WriteOutContentHandler | 
 SAX event handler that writes content up to an optional write
 limit out to a character stream or other decorated handler. 
 | 
| XHTMLContentHandler | 
 Content handler decorator that simplifies the task of producing XHTML
 events for Tika content parsers. 
 | 
| XMPContentHandler | 
 Content handler decorator that simplifies the task of producing XMP output. 
 | 
| Enum | Description | 
|---|---|
| BasicContentHandlerFactory.HANDLER_TYPE | 
 Common handler types for content. 
 | 
| Exception | Description | 
|---|---|
| StoppingEarlyException | 
 Sentinel exception to stop parsing xml once target is found
 while SAX parsing. 
 | 
| TaggedSAXException | 
 A  
SAXException wrapper that tags the wrapped exception with
 a given object reference. | 
Copyright © 2007–2021 The Apache Software Foundation. All rights reserved.