public interface TikaCoreProperties
Users of Tika who wish to have consistent metadata across file formats
can make use of these Properties, knowing that where present they will
have consistent semantic meaning between different file formats. (No
matter if one file format calls it Title, another Long-Title and another
Long-Name, if they all mean the same thing as defined by
DublinCore.TITLE then they will all be present as such)
For now, most of these properties are composite ones including the deprecated non-prefixed String properties from the Metadata class. In Tika 2.0, most of these will revert back to simple assignments.
| Modifier and Type | Interface and Description |
|---|---|
static class |
TikaCoreProperties.EmbeddedResourceType
A file might contain different types of embedded documents.
|
| Modifier and Type | Field and Description |
|---|---|
static Property |
ALTITUDE |
static Property |
COMMENTS |
static Property |
CONTAINER_EXCEPTION |
static Property |
CONTENT_TYPE_HINT
This is currently used to identify Content-Type that may be
included within a document, such as in html documents
(e.g.
|
static Property |
CONTENT_TYPE_PARSER_OVERRIDE
This is used by parsers to override detection of embedded resources
with the override detector.
|
static Property |
CONTENT_TYPE_USER_OVERRIDE
This is used by users to override detection with the override detector.
|
static Property |
CONTRIBUTOR |
static Property |
COVERAGE |
static Property |
CREATED |
static Property |
CREATOR |
static Property |
CREATOR_TOOL |
static Property |
DESCRIPTION |
static Property |
EMBEDDED_DEPTH |
static Property |
EMBEDDED_EXCEPTION |
static String |
EMBEDDED_RELATIONSHIP_ID |
static Property |
EMBEDDED_RESOURCE_PATH |
static Property |
EMBEDDED_RESOURCE_TYPE
Embedded resource type property
|
static String |
EMBEDDED_RESOURCE_TYPE_KEY |
static String |
EMBEDDED_STORAGE_CLASS_ID |
static Property |
FORMAT |
static Property |
HAS_SIGNATURE |
static Property |
IDENTIFIER |
static Property |
LANGUAGE |
static Property |
LATITUDE |
static Property |
LONGITUDE |
static Property |
METADATA_DATE |
static Property |
MODIFIED |
static Property |
MODIFIER |
static String |
NAMESPACE_PREFIX_DELIMITER
The common delimiter used between the namespace abbreviation and the property name
|
static Property |
ORIGINAL_RESOURCE_NAME
Some file formats can store information about their original
file name/location or about their attachment's original file name/location
within the file.
|
static Property |
PARSE_TIME_MILLIS |
static Property |
PRINT_DATE |
static String |
PROTECTED |
static Property |
PUBLISHER |
static Property |
RATING |
static Property |
RELATION |
static String |
RESOURCE_NAME_KEY |
static Property |
RIGHTS |
static Property |
SOURCE |
static Property |
SOURCE_PATH
This should be used to store the path (relative or full)
of the source file, including the file name,
e.g.
|
static Property |
SUBJECT
DublinCore.SUBJECT; should include both subject and keywords
if a document format has both. |
static Property |
TIKA_CONTENT |
static Property |
TIKA_CONTENT_HANDLER
Simple class name of the content handler
|
static Property |
TIKA_META_EXCEPTION_EMBEDDED_STREAM
Use this to store exceptions caught while trying to read the
stream of an embedded resource.
|
static String |
TIKA_META_EXCEPTION_PREFIX
Use this to store parse exception information in the Metadata object.
|
static Property |
TIKA_META_EXCEPTION_WARNING
Use this to store exceptions caught during a parse that are
non-fatal, e.g.
|
static String |
TIKA_META_PREFIX
Use this to prefix metadata properties that store information
about the parsing process.
|
static Property |
TIKA_PARSED_BY |
static Property |
TITLE |
static Property |
TYPE |
static Property |
WRITE_LIMIT_REACHED |
static final String NAMESPACE_PREFIX_DELIMITER
static final String TIKA_META_PREFIX
static final Property EMBEDDED_DEPTH
static final Property EMBEDDED_RESOURCE_PATH
static final Property PARSE_TIME_MILLIS
static final Property TIKA_CONTENT_HANDLER
static final Property TIKA_CONTENT
static final String TIKA_META_EXCEPTION_PREFIX
static final Property CONTAINER_EXCEPTION
static final Property EMBEDDED_EXCEPTION
static final Property WRITE_LIMIT_REACHED
static final Property TIKA_META_EXCEPTION_WARNING
static final Property TIKA_META_EXCEPTION_EMBEDDED_STREAM
static final Property TIKA_PARSED_BY
static final String RESOURCE_NAME_KEY
static final String PROTECTED
static final String EMBEDDED_RELATIONSHIP_ID
static final String EMBEDDED_STORAGE_CLASS_ID
static final String EMBEDDED_RESOURCE_TYPE_KEY
static final Property ORIGINAL_RESOURCE_NAME
static final Property SOURCE_PATH
This can also be used for a primary key within a database.
static final Property CONTENT_TYPE_HINT
static final Property CONTENT_TYPE_USER_OVERRIDE
static final Property CONTENT_TYPE_PARSER_OVERRIDE
static final Property FORMAT
DublinCore.FORMATstatic final Property IDENTIFIER
DublinCore.IDENTIFIERstatic final Property CONTRIBUTOR
DublinCore.CONTRIBUTORstatic final Property COVERAGE
DublinCore.COVERAGEstatic final Property CREATOR
DublinCore.CREATORstatic final Property MODIFIER
Office.LAST_AUTHORstatic final Property CREATOR_TOOL
XMP.CREATOR_TOOLstatic final Property LANGUAGE
DublinCore.LANGUAGEstatic final Property PUBLISHER
DublinCore.PUBLISHERstatic final Property RELATION
DublinCore.RELATIONstatic final Property RIGHTS
DublinCore.RIGHTSstatic final Property SOURCE
DublinCore.SOURCEstatic final Property TYPE
DublinCore.TYPEstatic final Property TITLE
DublinCore.TITLEstatic final Property DESCRIPTION
DublinCore.DESCRIPTIONstatic final Property SUBJECT
DublinCore.SUBJECT; should include both subject and keywords
if a document format has both. See also Office.KEYWORDS
and OfficeOpenXMLCore.SUBJECT.static final Property CREATED
DublinCore.DATEstatic final Property MODIFIED
DublinCore.MODIFIED,
Office.SAVE_DATEstatic final Property PRINT_DATE
Office.PRINT_DATEstatic final Property METADATA_DATE
XMP.METADATA_DATEstatic final Property LATITUDE
Geographic.LATITUDEstatic final Property LONGITUDE
Geographic.LONGITUDEstatic final Property ALTITUDE
Geographic.ALTITUDEstatic final Property RATING
XMP.RATINGstatic final Property COMMENTS
OfficeOpenXMLExtended.COMMENTSstatic final Property EMBEDDED_RESOURCE_TYPE
static final Property HAS_SIGNATURE
Copyright © 2007–2021 The Apache Software Foundation. All rights reserved.