org.apache.tika.metadata
Interface Office


public interface Office

Office Document properties collection. These properties apply to Office / Productivity Documents of all forms, including (but not limited to) MS Office and OpenDocument formats. This is a logical collection of properties, which may be drawn from a few different external definitions. Note that some of the legacy properties from the MSOffice collection still need to be migrated over

Since:
Apache Tika 1.2

Field Summary
static Property AUTHOR
          Name of the principal author(s) of a document
static Property CHARACTER_COUNT
          The number of Characters in the document
static Property CHARACTER_COUNT_WITH_SPACES
          The number of Characters in the document, including spaces
static Property CREATION_DATE
          When was the document created?
static Property IMAGE_COUNT
          The number of Images in the document
static Property INITIAL_AUTHOR
          Name of the initial creator/author of a document
static Property KEYWORDS
          Keywords pertaining to a document.
static Property LAST_AUTHOR
          Name of the last (most recent) author of a document
static Property LINE_COUNT
          The number of lines in the document
static String NAMESPACE_URI_DOC_META
           
static Property OBJECT_COUNT
          The number of Objects in the document.
static Property PAGE_COUNT
          The number of Pages are there in the (paged) document
static Property PARAGRAPH_COUNT
          The number of individual Paragraphs in the document
static String PREFIX_DOC_META
           
static Property PRINT_DATE
          When was the document last printed?
static Property SAVE_DATE
          When was the document last saved?
static Property SLIDE_COUNT
          The number of Slides are there in the (presentation) document
static Property TABLE_COUNT
          The number of Tables in the document
static String USER_DEFINED_METADATA_NAME_PREFIX
          For user defined metadata entries in the document, what prefix should be attached to the key names.
static Property WORD_COUNT
          The number of Words in the document
 

Field Detail

NAMESPACE_URI_DOC_META

static final String NAMESPACE_URI_DOC_META
See Also:
Constant Field Values

PREFIX_DOC_META

static final String PREFIX_DOC_META
See Also:
Constant Field Values

USER_DEFINED_METADATA_NAME_PREFIX

static final String USER_DEFINED_METADATA_NAME_PREFIX
For user defined metadata entries in the document, what prefix should be attached to the key names. eg Text1 becomes custom:Info1=Text1

See Also:
Constant Field Values

KEYWORDS

static final Property KEYWORDS
Keywords pertaining to a document.


INITIAL_AUTHOR

static final Property INITIAL_AUTHOR
Name of the initial creator/author of a document


LAST_AUTHOR

static final Property LAST_AUTHOR
Name of the last (most recent) author of a document


AUTHOR

static final Property AUTHOR
Name of the principal author(s) of a document


CREATION_DATE

static final Property CREATION_DATE
When was the document created?


SAVE_DATE

static final Property SAVE_DATE
When was the document last saved?


PRINT_DATE

static final Property PRINT_DATE
When was the document last printed?


SLIDE_COUNT

static final Property SLIDE_COUNT
The number of Slides are there in the (presentation) document


PAGE_COUNT

static final Property PAGE_COUNT
The number of Pages are there in the (paged) document


PARAGRAPH_COUNT

static final Property PARAGRAPH_COUNT
The number of individual Paragraphs in the document


LINE_COUNT

static final Property LINE_COUNT
The number of lines in the document


WORD_COUNT

static final Property WORD_COUNT
The number of Words in the document


CHARACTER_COUNT

static final Property CHARACTER_COUNT
The number of Characters in the document


CHARACTER_COUNT_WITH_SPACES

static final Property CHARACTER_COUNT_WITH_SPACES
The number of Characters in the document, including spaces


TABLE_COUNT

static final Property TABLE_COUNT
The number of Tables in the document


IMAGE_COUNT

static final Property IMAGE_COUNT
The number of Images in the document


OBJECT_COUNT

static final Property OBJECT_COUNT
The number of Objects in the document. These are typically non-Image resources embedded in the document, such as other documents or non-Image media.



Copyright © 2007-2012 The Apache Software Foundation. All Rights Reserved.