Package org.apache.tika.metadata
Interface Office
public interface Office
Office Document properties collection. These properties apply to
Office / Productivity Documents of all forms, including (but not limited
to) MS Office and OpenDocument formats.
This is a logical collection of properties, which may be drawn from a
few different external definitions.
- Since:
- Apache Tika 1.2
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final PropertyName of the principal author(s) of a documentstatic final PropertyThe number of Characters in the documentstatic final PropertyThe number of Characters in the document, including spacesstatic final PropertyWhen was the document created?static final PropertyThe number of Images in the documentstatic final PropertyName of the initial creator/author of a documentstatic final PropertyKeywords pertaining to a document.static final PropertyName of the last (most recent) author of a documentstatic final PropertyThe number of lines in the documentstatic final Propertystatic final Propertystatic final PropertyMAPI message class.static final Propertystatic final Propertystatic final Stringstatic final PropertyThe number of Objects in the document.static final Propertystatic final PropertyThe number of Pages are there in the (paged) documentstatic final PropertyThe number of individual Paragraphs in the documentstatic final Stringstatic final PropertyWhen was the document last printed?static final PropertyEmbedded files may have a "progID" associated with them, such as Word.Document.12 or AcroExch.Document.DCstatic final PropertyWhen was the document last saved?static final PropertyThe number of Slides are there in the (presentation) documentstatic final PropertyThe number of Tables in the documentstatic final StringFor user defined metadata entries in the document, what prefix should be attached to the key names.static final PropertyThe number of Words in the document
-
Field Details
-
NAMESPACE_URI_DOC_META
- See Also:
-
PREFIX_DOC_META
- See Also:
-
USER_DEFINED_METADATA_NAME_PREFIX
For user defined metadata entries in the document, what prefix should be attached to the key names. eg <meta:user-defined meta:name="Info1">Text1</meta:user-defined> becomes custom:Info1=Text1- See Also:
-
KEYWORDS
Keywords pertaining to a document. Also populatesDublinCore.SUBJECT. -
INITIAL_AUTHOR
Name of the initial creator/author of a document -
LAST_AUTHOR
Name of the last (most recent) author of a document -
AUTHOR
Name of the principal author(s) of a document -
CREATION_DATE
When was the document created? -
SAVE_DATE
When was the document last saved? -
PRINT_DATE
When was the document last printed? -
SLIDE_COUNT
The number of Slides are there in the (presentation) document -
PAGE_COUNT
The number of Pages are there in the (paged) document -
PARAGRAPH_COUNT
The number of individual Paragraphs in the document -
LINE_COUNT
The number of lines in the document -
WORD_COUNT
The number of Words in the document -
CHARACTER_COUNT
The number of Characters in the document -
CHARACTER_COUNT_WITH_SPACES
The number of Characters in the document, including spaces -
TABLE_COUNT
The number of Tables in the document -
IMAGE_COUNT
The number of Images in the document -
OBJECT_COUNT
The number of Objects in the document. These are typically non-Image resources embedded in the document, such as other documents or non-Image media. -
MAPI_MESSAGE_CLASS
MAPI message class. What type of .msg/MAPI file is it? -
MAPI_SENT_BY_SERVER_TYPE
-
MAPI_FROM_REPRESENTING_NAME
-
MAPI_FROM_REPRESENTING_EMAIL
-
MAPI_MESSAGE_CLIENT_SUBMIT_TIME
-
PROG_ID
Embedded files may have a "progID" associated with them, such as Word.Document.12 or AcroExch.Document.DC -
OCX_NAME
-