Class ToTextContentHandler

  • All Implemented Interfaces:
    ContentHandler, DTDHandler, EntityResolver, ErrorHandler
    Direct Known Subclasses:
    ToXMLContentHandler

    public class ToTextContentHandler
    extends DefaultHandler
    SAX event handler that writes all character content out to a character stream. No escaping or other transformations are made on the character content.

    As of Tika 1.20, this handler ignores content within <script> and <style> tags.

    Since:
    Apache Tika 0.10