org.apache.tika.sax
Class ToXMLContentHandler

java.lang.Object
  extended by org.xml.sax.helpers.DefaultHandler
      extended by org.apache.tika.sax.ToTextContentHandler
          extended by org.apache.tika.sax.ToXMLContentHandler
All Implemented Interfaces:
ContentHandler, DTDHandler, EntityResolver, ErrorHandler
Direct Known Subclasses:
ToHTMLContentHandler

public class ToXMLContentHandler
extends ToTextContentHandler

SAX event handler that serializes the XML document to a character stream. The incoming SAX events are expected to be well-formed (properly nested, etc.) and to explicitly include namespace declaration attributes and corresponding namespace prefixes in element and attribute names.

Since:
Apache Tika 0.10

Field Summary
protected  boolean inStartElement
           
protected  Map<String,String> namespaces
           
 
Constructor Summary
ToXMLContentHandler()
           
ToXMLContentHandler(OutputStream stream, String encoding)
          Creates an XML serializer that writes to the given byte stream using the given character encoding.
ToXMLContentHandler(String encoding)
           
 
Method Summary
 void characters(char[] ch, int start, int length)
          Writes the given characters to the given character stream.
 void endElement(String uri, String localName, String qName)
           
 void startDocument()
          Writes the XML prefix.
 void startElement(String uri, String localName, String qName, Attributes atts)
           
 void startPrefixMapping(String prefix, String uri)
           
protected  void write(char ch)
          Writes the given character as-is.
protected  void write(String string)
          Writes the given string of character as-is.
 
Methods inherited from class org.apache.tika.sax.ToTextContentHandler
endDocument, ignorableWhitespace, toString
 
Methods inherited from class org.xml.sax.helpers.DefaultHandler
endPrefixMapping, error, fatalError, notationDecl, processingInstruction, resolveEntity, setDocumentLocator, skippedEntity, unparsedEntityDecl, warning
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Field Detail

inStartElement

protected boolean inStartElement

namespaces

protected final Map<String,String> namespaces
Constructor Detail

ToXMLContentHandler

public ToXMLContentHandler(OutputStream stream,
                           String encoding)
                    throws UnsupportedEncodingException
Creates an XML serializer that writes to the given byte stream using the given character encoding.

Parameters:
stream - output stream
encoding - output encoding
Throws:
UnsupportedEncodingException - if the encoding is unsupported

ToXMLContentHandler

public ToXMLContentHandler(String encoding)

ToXMLContentHandler

public ToXMLContentHandler()
Method Detail

startDocument

public void startDocument()
                   throws SAXException
Writes the XML prefix.

Specified by:
startDocument in interface ContentHandler
Overrides:
startDocument in class org.xml.sax.helpers.DefaultHandler
Throws:
SAXException

startPrefixMapping

public void startPrefixMapping(String prefix,
                               String uri)
                        throws SAXException
Specified by:
startPrefixMapping in interface ContentHandler
Overrides:
startPrefixMapping in class org.xml.sax.helpers.DefaultHandler
Throws:
SAXException

startElement

public void startElement(String uri,
                         String localName,
                         String qName,
                         Attributes atts)
                  throws SAXException
Specified by:
startElement in interface ContentHandler
Overrides:
startElement in class org.xml.sax.helpers.DefaultHandler
Throws:
SAXException

endElement

public void endElement(String uri,
                       String localName,
                       String qName)
                throws SAXException
Specified by:
endElement in interface ContentHandler
Overrides:
endElement in class org.xml.sax.helpers.DefaultHandler
Throws:
SAXException

characters

public void characters(char[] ch,
                       int start,
                       int length)
                throws SAXException
Description copied from class: ToTextContentHandler
Writes the given characters to the given character stream.

Specified by:
characters in interface ContentHandler
Overrides:
characters in class ToTextContentHandler
Throws:
SAXException

write

protected void write(char ch)
              throws SAXException
Writes the given character as-is.

Parameters:
ch - character to be written
Throws:
SAXException - if the character could not be written

write

protected void write(String string)
              throws SAXException
Writes the given string of character as-is.

Parameters:
string - string of character to be written
Throws:
SAXException - if the character string could not be written


Copyright © 2007-2012 The Apache Software Foundation. All Rights Reserved.