Interface Office


  • public interface Office
    Office Document properties collection. These properties apply to Office / Productivity Documents of all forms, including (but not limited to) MS Office and OpenDocument formats. This is a logical collection of properties, which may be drawn from a few different external definitions.
    Since:
    Apache Tika 1.2
    • Field Detail

      • USER_DEFINED_METADATA_NAME_PREFIX

        static final String USER_DEFINED_METADATA_NAME_PREFIX
        For user defined metadata entries in the document, what prefix should be attached to the key names. eg Text1 becomes custom:Info1=Text1
        See Also:
        Constant Field Values
      • INITIAL_AUTHOR

        static final Property INITIAL_AUTHOR
        Name of the initial creator/author of a document
      • LAST_AUTHOR

        static final Property LAST_AUTHOR
        Name of the last (most recent) author of a document
      • AUTHOR

        static final Property AUTHOR
        Name of the principal author(s) of a document
      • CREATION_DATE

        static final Property CREATION_DATE
        When was the document created?
      • SAVE_DATE

        static final Property SAVE_DATE
        When was the document last saved?
      • PRINT_DATE

        static final Property PRINT_DATE
        When was the document last printed?
      • SLIDE_COUNT

        static final Property SLIDE_COUNT
        The number of Slides are there in the (presentation) document
      • PAGE_COUNT

        static final Property PAGE_COUNT
        The number of Pages are there in the (paged) document
      • PARAGRAPH_COUNT

        static final Property PARAGRAPH_COUNT
        The number of individual Paragraphs in the document
      • LINE_COUNT

        static final Property LINE_COUNT
        The number of lines in the document
      • WORD_COUNT

        static final Property WORD_COUNT
        The number of Words in the document
      • CHARACTER_COUNT

        static final Property CHARACTER_COUNT
        The number of Characters in the document
      • CHARACTER_COUNT_WITH_SPACES

        static final Property CHARACTER_COUNT_WITH_SPACES
        The number of Characters in the document, including spaces
      • TABLE_COUNT

        static final Property TABLE_COUNT
        The number of Tables in the document
      • IMAGE_COUNT

        static final Property IMAGE_COUNT
        The number of Images in the document
      • OBJECT_COUNT

        static final Property OBJECT_COUNT
        The number of Objects in the document. These are typically non-Image resources embedded in the document, such as other documents or non-Image media.
      • MAPI_MESSAGE_CLASS

        static final Property MAPI_MESSAGE_CLASS
        MAPI message class. What type of .msg/MAPI file is it?
      • MAPI_SENT_BY_SERVER_TYPE

        static final Property MAPI_SENT_BY_SERVER_TYPE
      • MAPI_FROM_REPRESENTING_NAME

        static final Property MAPI_FROM_REPRESENTING_NAME
      • MAPI_FROM_REPRESENTING_EMAIL

        static final Property MAPI_FROM_REPRESENTING_EMAIL
      • MAPI_MESSAGE_CLIENT_SUBMIT_TIME

        static final Property MAPI_MESSAGE_CLIENT_SUBMIT_TIME
      • PROG_ID

        static final Property PROG_ID
        Embedded files may have a "progID" associated with them, such as Word.Document.12 or AcroExch.Document.DC