Package org.apache.tika.parser.ctakes
Class CTAKESParser
java.lang.Object
org.apache.tika.parser.ParserDecorator
org.apache.tika.parser.ctakes.CTAKESParser
- All Implemented Interfaces:
Serializable,Parser
CTAKESParser decorates a
Parser and leverages on
CTAKESContentHandler to extract biomedical information from
clinical text using Apache cTAKES.
It is normally called by supplying an instance to
AutoDetectParser, such as:
AutoDetectParser parser = new AutoDetectParser(new CTAKESParser());
It can also be used by giving a Tika Config file similar to:
Because this is a Parser Decorator, and not a normal Parser in it's own right, it isn't normally selected via the Parser Service Loader.
- See Also:
-
Constructor Summary
ConstructorsConstructorDescriptionWraps the default ParserCTAKESParser(TikaConfig config) Wraps the default Parser for this ConfigCTAKESParser(Parser parser) Wraps the specified Parser -
Method Summary
Modifier and TypeMethodDescriptionvoidparse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) Delegates the method call to the decorated parser.Methods inherited from class org.apache.tika.parser.ParserDecorator
getSupportedTypes, getWrappedParser, withFallbacks, withoutTypes, withTypes
-
Constructor Details
-
CTAKESParser
public CTAKESParser()Wraps the default Parser -
CTAKESParser
Wraps the default Parser for this Config -
CTAKESParser
Wraps the specified Parser
-
-
Method Details
-
parse
public void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException Description copied from class:ParserDecoratorDelegates the method call to the decorated parser. Subclasses should override this method (and usesuper.parse()to invoke the decorated parser) to implement extra decoration.- Specified by:
parsein interfaceParser- Overrides:
parsein classParserDecorator- Parameters:
stream- the document stream (input)handler- handler for the XHTML SAX events (output)metadata- document metadata (input and output)context- parse context- Throws:
IOException- if the document stream could not be readSAXException- if the SAX events could not be processedTikaException- if the document could not be parsed
-
getDecorationName
- Overrides:
getDecorationNamein classParserDecorator- Returns:
- A name/description of the decoration, or null if none available
-