Package org.apache.tika.sax
Class ToXMLContentHandler
- java.lang.Object
-
- org.xml.sax.helpers.DefaultHandler
-
- org.apache.tika.sax.ToTextContentHandler
-
- org.apache.tika.sax.ToXMLContentHandler
-
- All Implemented Interfaces:
ContentHandler
,DTDHandler
,EntityResolver
,ErrorHandler
- Direct Known Subclasses:
ToHTMLContentHandler
public class ToXMLContentHandler extends ToTextContentHandler
SAX event handler that serializes the XML document to a character stream. The incoming SAX events are expected to be well-formed (properly nested, etc.) and to explicitly include namespace declaration attributes and corresponding namespace prefixes in element and attribute names.- Since:
- Apache Tika 0.10
-
-
Field Summary
Fields Modifier and Type Field Description protected boolean
inStartElement
protected Map<String,String>
namespaces
-
Constructor Summary
Constructors Constructor Description ToXMLContentHandler()
ToXMLContentHandler(OutputStream stream, String encoding)
Creates an XML serializer that writes to the given byte stream using the given character encoding.ToXMLContentHandler(String encoding)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
characters(char[] ch, int start, int length)
Writes the given characters to the given character stream.void
endElement(String uri, String localName, String qName)
void
startDocument()
Writes the XML prefix.void
startElement(String uri, String localName, String qName, Attributes atts)
void
startPrefixMapping(String prefix, String uri)
protected void
write(char ch)
Writes the given character as-is.protected void
write(String string)
Writes the given string of character as-is.-
Methods inherited from class org.apache.tika.sax.ToTextContentHandler
endDocument, ignorableWhitespace, toString
-
Methods inherited from class org.xml.sax.helpers.DefaultHandler
endPrefixMapping, error, fatalError, notationDecl, processingInstruction, resolveEntity, setDocumentLocator, skippedEntity, unparsedEntityDecl, warning
-
-
-
-
Constructor Detail
-
ToXMLContentHandler
public ToXMLContentHandler(OutputStream stream, String encoding) throws UnsupportedEncodingException
Creates an XML serializer that writes to the given byte stream using the given character encoding.- Parameters:
stream
- output streamencoding
- output encoding- Throws:
UnsupportedEncodingException
- if the encoding is unsupported
-
ToXMLContentHandler
public ToXMLContentHandler(String encoding)
-
ToXMLContentHandler
public ToXMLContentHandler()
-
-
Method Detail
-
startDocument
public void startDocument() throws SAXException
Writes the XML prefix.- Specified by:
startDocument
in interfaceContentHandler
- Overrides:
startDocument
in classDefaultHandler
- Throws:
SAXException
-
startPrefixMapping
public void startPrefixMapping(String prefix, String uri) throws SAXException
- Specified by:
startPrefixMapping
in interfaceContentHandler
- Overrides:
startPrefixMapping
in classDefaultHandler
- Throws:
SAXException
-
startElement
public void startElement(String uri, String localName, String qName, Attributes atts) throws SAXException
- Specified by:
startElement
in interfaceContentHandler
- Overrides:
startElement
in classToTextContentHandler
- Throws:
SAXException
-
endElement
public void endElement(String uri, String localName, String qName) throws SAXException
- Specified by:
endElement
in interfaceContentHandler
- Overrides:
endElement
in classToTextContentHandler
- Throws:
SAXException
-
characters
public void characters(char[] ch, int start, int length) throws SAXException
Description copied from class:ToTextContentHandler
Writes the given characters to the given character stream.- Specified by:
characters
in interfaceContentHandler
- Overrides:
characters
in classToTextContentHandler
- Throws:
SAXException
-
write
protected void write(char ch) throws SAXException
Writes the given character as-is.- Parameters:
ch
- character to be written- Throws:
SAXException
- if the character could not be written
-
write
protected void write(String string) throws SAXException
Writes the given string of character as-is.- Parameters:
string
- string of character to be written- Throws:
SAXException
- if the character string could not be written
-
-