Package org.apache.tika.parser.ctakes
Class CTAKESContentHandler
java.lang.Object
org.xml.sax.helpers.DefaultHandler
org.apache.tika.sax.ContentHandlerDecorator
org.apache.tika.parser.ctakes.CTAKESContentHandler
- All Implemented Interfaces:
ContentHandler,DTDHandler,EntityResolver,ErrorHandler
Class used to extract biomedical information while parsing.
This class relies on Apache cTAKES that is a natural language processing system for extraction of information from electronic medical record clinical free-text.
-
Field Summary
Fields -
Constructor Summary
ConstructorsConstructorDescriptionDefault constructor.CTAKESContentHandler(ContentHandler handler, Metadata metadata) Creates a newCTAKESContentHandlerfor the givenContentHandlerand Metadata objects.CTAKESContentHandler(ContentHandler handler, Metadata metadata, CTAKESConfig config) Creates a newCTAKESContentHandlerfor the givenContentHandlerand Metadata objects. -
Method Summary
Modifier and TypeMethodDescriptionvoidcharacters(char[] ch, int start, int length) voidReturns metadata that includes cTAKES annotations.Methods inherited from class org.apache.tika.sax.ContentHandlerDecorator
endElement, endPrefixMapping, error, fatalError, handleException, ignorableWhitespace, processingInstruction, setContentHandler, setDocumentLocator, skippedEntity, startDocument, startElement, startPrefixMapping, toString, warningMethods inherited from class org.xml.sax.helpers.DefaultHandler
notationDecl, resolveEntity, unparsedEntityDecl
-
Field Details
-
CTAKES_META_PREFIX
-
-
Constructor Details
-
CTAKESContentHandler
Creates a newCTAKESContentHandlerfor the givenContentHandlerand Metadata objects.- Parameters:
handler- theContentHandlerobject to be decorated.metadata- theMetadataobject that will be populated using biomedical information extracted by cTAKES.config- theCTAKESConfigobject used to configure the handler.
-
CTAKESContentHandler
Creates a newCTAKESContentHandlerfor the givenContentHandlerand Metadata objects.- Parameters:
handler- theContentHandlerobject to be decorated.metadata- theMetadataobject that will be populated using biomedical information extracted by cTAKES.
-
CTAKESContentHandler
public CTAKESContentHandler()Default constructor.
-
-
Method Details
-
characters
- Specified by:
charactersin interfaceContentHandler- Overrides:
charactersin classContentHandlerDecorator- Throws:
SAXException
-
endDocument
- Specified by:
endDocumentin interfaceContentHandler- Overrides:
endDocumentin classContentHandlerDecorator- Throws:
SAXException
-
getMetadata
Returns metadata that includes cTAKES annotations.- Returns:
Metadataobject that includes cTAKES annotations.
-