Class CoreNLPNERecogniser
java.lang.Object
org.apache.tika.parser.ner.corenlp.CoreNLPNERecogniser
- All Implemented Interfaces:
NERecogniser
This class offers an implementation of
NERecogniser based on
CRF classifiers from Stanford CoreNLP. This NER requires additional setup,
due to runtime binding to Stanford CoreNLP.
See
Tika NER Wiki for configuring this recogniser.- See Also:
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final Stringdefault Model pathstatic final Stringstatic final Stringstatic final Stringstatic final StringFields inherited from interface org.apache.tika.parser.ner.NERecogniser
DATE, LOCATION, MISCELLANEOUS, MONEY, ORGANIZATION, PERCENT, PERSON, TIME -
Constructor Summary
ConstructorsConstructorDescriptionCoreNLPNERecogniser(String modelPath) Creates a NERecogniser by loading model from given path -
Method Summary
Modifier and TypeMethodDescriptionGets set of entity types recognised by this recogniserbooleanchecks if this Named Entity recogniser is available for servicestatic voidrecognises names of entities in the text
-
Field Details
-
NER_3CLASS_MODEL
- See Also:
-
NER_4CLASS_MODEL
- See Also:
-
NER_7CLASS_MODEL
- See Also:
-
DEFAULT_MODEL_PATH
default Model path- See Also:
-
MODEL_PROP_NAME
- See Also:
-
ENTITY_TYPES
-
-
Constructor Details
-
CoreNLPNERecogniser
public CoreNLPNERecogniser() -
CoreNLPNERecogniser
Creates a NERecogniser by loading model from given path- Parameters:
modelPath- path to NER model file
-
-
Method Details
-
main
- Throws:
IOExceptioncom.github.openjson.JSONException
-
isAvailable
public boolean isAvailable()Description copied from interface:NERecogniserchecks if this Named Entity recogniser is available for service- Specified by:
isAvailablein interfaceNERecogniser- Returns:
trueif model was available, valid and was able to initialise the classifier. returnsfalsewhen this recogniser is not available for service.
-
getEntityTypes
Gets set of entity types recognised by this recogniser- Specified by:
getEntityTypesin interfaceNERecogniser- Returns:
- set of entity classes/types
-
recognise
recognises names of entities in the text- Specified by:
recognisein interfaceNERecogniser- Parameters:
text- text which possibly contains names- Returns:
- map of entity type -> set of names
-