Class AbstractXML2003Parser
- java.lang.Object
- 
- org.apache.tika.parser.AbstractParser
- 
- org.apache.tika.parser.microsoft.xml.AbstractXML2003Parser
 
 
- 
- All Implemented Interfaces:
- Serializable,- Parser
 - Direct Known Subclasses:
- SpreadsheetMLParser,- WordMLParser
 
 public abstract class AbstractXML2003Parser extends AbstractParser - See Also:
- Serialized Form
 
- 
- 
Constructor SummaryConstructors Constructor Description AbstractXML2003Parser()
 - 
Method SummaryAll Methods Instance Methods Abstract Methods Concrete Methods Modifier and Type Method Description protected ContentHandlergetContentHandler(ContentHandler ch, Metadata md, ParseContext context)voidparse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context)Parses a document stream into a sequence of XHTML SAX events.protected abstract voidsetContentType(Metadata contentType)- 
Methods inherited from class org.apache.tika.parser.AbstractParserparse
 - 
Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 - 
Methods inherited from interface org.apache.tika.parser.ParsergetSupportedTypes
 
- 
 
- 
- 
- 
Method Detail- 
parsepublic void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException Description copied from interface:ParserParses a document stream into a sequence of XHTML SAX events. Fills in related document metadata in the given metadata object.The given document stream is consumed but not closed by this method. The responsibility to close the stream remains on the caller. Information about the parsing context can be passed in the context parameter. See the parser implementations for the kinds of context information they expect. - Parameters:
- stream- the document stream (input)
- handler- handler for the XHTML SAX events (output)
- metadata- document metadata (input and output)
- context- parse context
- Throws:
- IOException- if the document stream could not be read
- SAXException- if the SAX events could not be processed
- TikaException- if the document could not be parsed
 
 - 
getContentHandlerprotected ContentHandler getContentHandler(ContentHandler ch, Metadata md, ParseContext context) 
 - 
setContentTypeprotected abstract void setContentType(Metadata contentType) 
 
- 
 
-