Package | Description |
---|---|
org.apache.tika.batch | |
org.apache.tika.batch.fs | |
org.apache.tika.embedder | |
org.apache.tika.eval.metadata | |
org.apache.tika.metadata |
Multi-valued metadata container, and set of constant metadata fields.
|
org.apache.tika.parser |
Tika parsers.
|
org.apache.tika.parser.csv | |
org.apache.tika.parser.executable | |
org.apache.tika.parser.image | |
org.apache.tika.parser.mail | |
org.apache.tika.parser.microsoft | |
org.apache.tika.parser.xml | |
org.apache.tika.sax |
SAX utilities.
|
org.apache.tika.utils |
Utilities.
|
org.apache.tika.xmp | |
org.apache.tika.xmp.convert |
Modifier and Type | Field and Description |
---|---|
static Property |
FileResource.FILE_EXTENSION |
Modifier and Type | Field and Description |
---|---|
static Property |
FSProperties.FS_REL_PATH
File's relative path (including file name) from a given source root
|
Modifier and Type | Method and Description |
---|---|
Map<Property,String[]> |
ExternalEmbedder.getMetadataCommandArguments()
Gets the map of Metadata keys to command line parameters.
|
Modifier and Type | Method and Description |
---|---|
void |
ExternalEmbedder.setMetadataCommandArguments(Map<Property,String[]> arguments)
Sets the map of Metadata keys to command line parameters.
|
Modifier and Type | Field and Description |
---|---|
static Property |
TikaEvalMetadataFilter.LANGUAGE |
static Property |
TikaEvalMetadataFilter.LANGUAGE_CONFIDENCE |
static Property |
TikaEvalMetadataFilter.NUM_ALPHA_TOKENS |
static Property |
TikaEvalMetadataFilter.NUM_TOKENS |
static Property |
TikaEvalMetadataFilter.NUM_UNIQUE_ALPHA_TOKENS |
static Property |
TikaEvalMetadataFilter.NUM_UNIQUE_TOKENS |
static Property |
TikaEvalMetadataFilter.OUT_OF_VOCABULARY |
Modifier and Type | Field and Description |
---|---|
static Property |
XMP.ABOUT
Unordered text strings of advisories.
|
static Property |
XMPDM.ABS_PEAK_AUDIO_FILE_PATH
"The absolute path to the file's peak audio file.
|
static Property |
PDF.ACTION_TRIGGER
This specifies where an action or destination would be found/triggered
in the document: on document open, before close, etc.
|
static Property |
IPTC.ADDITIONAL_MODEL_INFO
Information about the ethnicity and other facets of the model(s) in a
model-released image.
|
static Property |
XMP.ADVISORY
Unordered text strings of advisories.
|
static Property |
XMPDM.ALBUM
"The name of the album."
|
static Property |
XMPDM.ALBUM_ARTIST
"The name of the album artist or group for compilation albums."
|
static Property |
XMPDM.ALT_TAPE_NAME
"An alternative tape name, set via the project window or timecode
dialog in Premiere.
|
static Property |
Geographic.ALTITUDE
The WGS84 Altitude of the Point
|
static Property |
TikaCoreProperties.ALTITUDE |
static Property |
OfficeOpenXMLExtended.APP_VERSION |
static Property |
OfficeOpenXMLExtended.APPLICATION |
static Property |
XMPDM.ARTIST
"The name of the artist or artists."
|
static Property |
IPTC.ARTWORK_OR_OBJECT
A set of metadata about artwork or an object in the item
|
static Property |
IPTC.ARTWORK_OR_OBJECT_DETAIL_COPYRIGHT_NOTICE
Contains any necessary copyright notice for claiming the intellectual
property for artwork or an object in the image and should identify the
current owner of the copyright of this work with associated intellectual
property rights.
|
static Property |
IPTC.ARTWORK_OR_OBJECT_DETAIL_CREATOR
Contains the name of the artist who has created artwork or an object in the image.
|
static Property |
IPTC.ARTWORK_OR_OBJECT_DETAIL_DATE_CREATED
Designates the date and optionally the time the artwork or object in the
image was created.
|
static Property |
IPTC.ARTWORK_OR_OBJECT_DETAIL_SOURCE
The organisation or body holding and registering the artwork or object in
the image for inventory purposes.
|
static Property |
IPTC.ARTWORK_OR_OBJECT_DETAIL_SOURCE_INVENTORY_NUMBER
The inventory number issued by the organisation or body holding and
registering the artwork or object in the image.
|
static Property |
IPTC.ARTWORK_OR_OBJECT_DETAIL_TITLE
A reference for the artwork or object in the image.
|
static Property |
AccessPermissions.ASSEMBLE_DOCUMENT
Can the user insert/rotate/delete pages.
|
static Property |
XMPDM.AUDIO_CHANNEL_TYPE
"The audio channel type."
|
static Property |
XMPDM.AUDIO_COMPRESSOR
"The audio compression used.
|
static Property |
XMPDM.AUDIO_MOD_DATE
"The date and time when the audio was last modified."
|
static Property |
XMPDM.AUDIO_SAMPLE_RATE
"The audio sample rate.
|
static Property |
XMPDM.AUDIO_SAMPLE_TYPE
"The audio sample type."
|
static Property |
Office.AUTHOR
Name of the principal author(s) of a document
|
static Property |
Photoshop.AUTHORS_POSITION |
static Property |
TIFF.BITS_PER_SAMPLE
"Number of bits per component in each channel."
|
static Property |
QuattroPro.BUILD
Build.
|
static Property |
AccessPermissions.CAN_MODIFY
Can any modifications be made to the document
|
static Property |
AccessPermissions.CAN_MODIFY_ANNOTATIONS
Can the user modify annotations
|
static Property |
AccessPermissions.CAN_PRINT
Can the user print the document
|
static Property |
AccessPermissions.CAN_PRINT_DEGRADED
Can the user print an image-degraded version of the document.
|
static Property |
Photoshop.CAPTION_WRITER |
static Property |
OfficeOpenXMLCore.CATEGORY
A categorization of the content of this package.
|
static Property |
IPTC.CATEGORY
Deprecated.
|
static Property |
Photoshop.CATEGORY |
static Property |
XMPRights.CERTIFICATE
A Web URL for a rights management certificate.
|
static Property |
MSOffice.CHARACTER_COUNT
Deprecated.
|
static Property |
Office.CHARACTER_COUNT
The number of Characters in the document
|
static Property |
MSOffice.CHARACTER_COUNT_WITH_SPACES
Deprecated.
|
static Property |
Office.CHARACTER_COUNT_WITH_SPACES
The number of Characters in the document, including spaces
|
static Property |
PDF.CHARACTERS_PER_PAGE |
static Property |
IPTC.CITY
Name of the city the content is focussing on -- either the place shown
in visual media or referenced by text or audio media.
|
static Property |
Photoshop.CITY |
static Property |
Photoshop.COLOR_MODE |
static Property |
Database.COLUMN_COUNT |
static Property |
Database.COLUMN_NAME |
static Property |
OfficeOpenXMLExtended.COMMENTS |
static Property |
TikaCoreProperties.COMMENTS |
static Property |
OfficeOpenXMLExtended.COMPANY |
static Property |
XMPDM.COMPILATION
"An album created by various artists."
|
static Property |
XMPDM.COMPOSER
"The composer's name."
|
static Property |
IPTC.CONTACT_INFO_ADDRESS
The contact information address part.
|
static Property |
IPTC.CONTACT_INFO_CITY
The contact information city part.
|
static Property |
IPTC.CONTACT_INFO_COUNTRY
The contact information country part.
|
static Property |
IPTC.CONTACT_INFO_EMAIL
The contact information email address part.
|
static Property |
IPTC.CONTACT_INFO_PHONE
The contact information phone number part.
|
static Property |
IPTC.CONTACT_INFO_POSTAL_CODE
The contact information part denoting the local postal code.
|
static Property |
IPTC.CONTACT_INFO_STATE_PROVINCE
The contact information part denoting regional information such as state or province.
|
static Property |
IPTC.CONTACT_INFO_WEB_URL
The contact information web address part.
|
static Property |
OfficeOpenXMLCore.CONTENT_STATUS
The status of the content.
|
static Property |
TikaCoreProperties.CONTENT_TYPE_HINT
This is currently used to identify Content-Type that may be
included within a document, such as in html documents
(e.g.
|
static Property |
TikaCoreProperties.CONTENT_TYPE_OVERRIDE |
static Property |
DublinCore.CONTRIBUTOR
An entity responsible for making contributions to the content of the
resource.
|
static Property |
TikaCoreProperties.CONTRIBUTOR |
static Property |
IPTC.CONTROLLED_VOCABULARY_TERM
A term to describe the content of the image by a value from a Controlled
Vocabulary.
|
static Property |
XMPDM.COPYRIGHT
"The copyright information."
|
static Property |
IPTC.COPYRIGHT_NOTICE
Contains any necessary copyright notice for claiming the intellectual
property for this item and should identify the current owner of the
copyright for the item.
|
static Property |
IPTC.COPYRIGHT_OWNER
Owner or owners of the copyright in the licensed image.
|
static Property |
IPTC.COPYRIGHT_OWNER_ID
The ID of the owner or owners of the copyright in the licensed image.
|
static Property |
IPTC.COPYRIGHT_OWNER_NAME
The name of the owner or owners of the copyright in the licensed image.
|
static Property |
IPTC.COUNTRY
Full name of the country the content is focussing on -- either the
country shown in visual media or referenced in text or audio media.
|
static Property |
Photoshop.COUNTRY |
static Property |
IPTC.COUNTRY_CODE
Code of the country the content is focussing on -- either the country
shown in visual media or referenced in text or audio media.
|
static Property |
DublinCore.COVERAGE
The extent or scope of the content of the resource.
|
static Property |
TikaCoreProperties.COVERAGE |
static Property |
XMP.CREATE_DATE
The date and time the resource was created.
|
static Property |
DublinCore.CREATED
Date of creation of the resource.
|
static Property |
TikaCoreProperties.CREATED |
static Property |
MSOffice.CREATION_DATE
Deprecated.
|
static Property |
Office.CREATION_DATE
When was the document created?
|
static Property |
IPTC.CREATOR
Contains the name of the person who created the content of this item, a
photographer for photos, a graphic artist for graphics, or a writer for
textual news, but in cases where the photographer should not be
identified the name of a company or organisation may be appropriate.
|
static Property |
DublinCore.CREATOR
An entity primarily responsible for making the content of the resource.
|
static Property |
TikaCoreProperties.CREATOR |
static Property |
XMP.CREATOR_TOOL
The name of the first known tool used to create the resource.
|
static Property |
TikaCoreProperties.CREATOR_TOOL |
static Property |
IPTC.CREATORS_CONTACT_INFO
The creator's contact information provides all necessary information to
get in contact with the creator of this item and comprises a set of
sub-properties for proper addressing.
|
static Property |
IPTC.CREATORS_JOB_TITLE
Contains the job title of the person who created the content of this
item.
|
static Property |
Photoshop.CREDIT |
static Property |
IPTC.CREDIT_LINE
The credit to person(s) and/or organisation(s) required by the supplier
of the item to be used when published.
|
static Property |
Metadata.DATE
Deprecated.
use TikaCoreProperties#CREATED
|
static Property |
DublinCore.DATE
A date associated with an event in the life cycle of the resource.
|
static Property |
IPTC.DATE_CREATED
Designates the date and optionally the time the intellectual content was
created rather than the date of the creation of the physical
representation.
|
static Property |
Photoshop.DATE_CREATED |
static Property |
XMPMM.DERIVED_FROM_DOCUMENTID
Document id for the document that this document
was derived from
|
static Property |
XMPMM.DERIVED_FROM_INSTANCEID
Instance id for the document instance that this
document was derived from
|
static Property |
IPTC.DESCRIPTION
A textual description, including captions, of the item's content,
particularly used where the object is not text.
|
static Property |
DublinCore.DESCRIPTION
An account of the content of the resource.
|
static Property |
TikaCoreProperties.DESCRIPTION |
static Property |
IPTC.DESCRIPTION_WRITER
Identifier or the name of the person involved in writing, editing or
correcting the description of the content.
|
static Property |
IPTC.DIGITAL_IMAGE_GUID
Globally unique identifier for the item.
|
static Property |
IPTC.DIGITAL_SOURCE_FILE_TYPE
Deprecated.
|
static Property |
IPTC.DIGITAL_SOURCE_TYPE
The type of the source of this digital image
|
static Property |
XMPDM.DISC_NUMBER
"The disc number for part of an album set."
|
static Property |
PDF.DOC_INFO_CREATED |
static Property |
PDF.DOC_INFO_CREATOR |
static Property |
PDF.DOC_INFO_CREATOR_TOOL |
static Property |
PDF.DOC_INFO_KEY_WORDS |
static Property |
PDF.DOC_INFO_MODIFICATION_DATE |
static Property |
PDF.DOC_INFO_PRODUCER |
static Property |
PDF.DOC_INFO_SUBJECT |
static Property |
PDF.DOC_INFO_TITLE |
static Property |
PDF.DOC_INFO_TRAPPED |
static Property |
OfficeOpenXMLExtended.DOC_SECURITY |
static Property |
OfficeOpenXMLExtended.DOC_SECURITY_STRING |
static Property |
XMPMM.DOCUMENTID
The common identifier for all versions and renditions of a resource.
|
static Property |
XMPDM.DURATION
"The duration of the media file."
|
static Property |
RTFMetadata.EMB_APP_VERSION
if an application and version is given as part of the
embedded object, this is the literal string
|
static Property |
RTFMetadata.EMB_CLASS |
static Property |
RTFMetadata.EMB_ITEM |
static Property |
RTFMetadata.EMB_TOPIC |
static Property |
TikaCoreProperties.EMBEDDED_RESOURCE_TYPE
Embedded resource type property
|
static Property |
WordPerfect.ENCRYPTED
Is encrypted?.
|
static Property |
XMPDM.ENGINEER
"The engineer's name."
|
static Property |
TIFF.EQUIPMENT_MAKE
"Manufacturer of the recording equipment."
|
static Property |
TIFF.EQUIPMENT_MODEL
"Model name or number of the recording equipment."
|
static Property |
IPTC.EVENT
Names or describes the specific event the content relates to.
|
static Property |
TIFF.EXIF_PAGE_COUNT |
static Property |
TIFF.EXPOSURE_TIME
"Exposure time in seconds."
|
static Property |
AccessPermissions.EXTRACT_CONTENT
Should content be extracted, generally.
|
static Property |
AccessPermissions.EXTRACT_FOR_ACCESSIBILITY
Should content be extracted for the purposes
of accessibility.
|
static Property |
TIFF.F_NUMBER
"F-Number."
The f-number is the focal length divided by the "effective" aperture
diameter.
|
static Property |
XMPDM.FILE_DATA_RATE
"The file data rate in megabytes per second.
|
static Property |
WordPerfect.FILE_ID
File identifier.
|
static Property |
WordPerfect.FILE_SIZE
File size as defined in document header.
|
static Property |
WordPerfect.FILE_TYPE
File type.
|
static Property |
AccessPermissions.FILL_IN_FORM
Can the user fill in a form
|
static Property |
TIFF.FLASH_FIRED
Did the Flash fire when taking this image?
|
static Property |
TIFF.FOCAL_LENGTH
"Focal length of the lens, in millimeters."
|
static Property |
Font.FONT_NAME
Basic name of a font used in a file
|
static Property |
DublinCore.FORMAT
Typically, Format may include the media-type or dimensions of the
resource.
|
static Property |
TikaCoreProperties.FORMAT |
static Property |
XMPDM.GENRE
"The name of the genre."
|
static Property |
PDF.HAS_ACROFORM_FIELDS
Has > 0 AcroForm fields
|
static Property |
PDF.HAS_MARKED_CONTENT |
static Property |
TikaCoreProperties.HAS_SIGNATURE |
static Property |
PDF.HAS_XFA
Has XFA
|
static Property |
PDF.HAS_XMP
Has XMP, whether or not it is valid
|
static Property |
IPTC.HEADLINE
A brief synopsis of the caption.
|
static Property |
Photoshop.HEADLINE |
static Property |
OfficeOpenXMLExtended.HIDDEN_SLIDES |
static Property |
XMPMM.HISTORY_ACTION
Action in the XMPMM's history section
|
static Property |
XMPMM.HISTORY_EVENT_INSTANCEID
Instance id in the XMPMM's history section
|
static Property |
XMPMM.HISTORY_SOFTWARE_AGENT
Software agent that created the action in the XMPMM's
history section
|
static Property |
XMPMM.HISTORY_WHEN
When the action occurred in the XMPMM's history section
|
static Property |
QuattroPro.ID
ID.
|
static Property |
XMP.IDENTIFIER
An unordered array of text strings that unambiguously identify the resource
within a given context.
|
static Property |
DublinCore.IDENTIFIER
Recommended best practice is to identify the resource by means of
a string or number conforming to a formal identification system.
|
static Property |
TikaCoreProperties.IDENTIFIER |
static Property |
MSOffice.IMAGE_COUNT
Deprecated.
|
static Property |
Office.IMAGE_COUNT
The number of Images in the document
|
static Property |
IPTC.IMAGE_CREATOR
Creator or creators of the image.
|
static Property |
IPTC.IMAGE_CREATOR_ID
The ID of the creator or creators of the image.
|
static Property |
IPTC.IMAGE_CREATOR_NAME
The name of the creator or creators of the image.
|
static Property |
TIFF.IMAGE_LENGTH
"Image height in pixels."
|
static Property |
IPTC.IMAGE_REGISTRY_ENTRY
Both a Registry Item Id and a Registry Organisation Id to record any
registration of this item with a registry.
|
static Property |
IPTC.IMAGE_SUPPLIER
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
|
static Property |
IPTC.IMAGE_SUPPLIER_ID
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
|
static Property |
IPTC.IMAGE_SUPPLIER_IMAGE_ID
Optional identifier assigned by the Image Supplier to the image.
|
static Property |
IPTC.IMAGE_SUPPLIER_NAME
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
|
static Property |
TIFF.IMAGE_WIDTH
"Image width in pixels."
|
static Property |
Office.INITIAL_AUTHOR
Name of the initial creator/author of a document
|
static Property |
XMPMM.INSTANCEID
An identifier for a specific incarnation of a resource, updated
each time a file is saved.
|
static Property |
IPTC.INSTRUCTIONS
Any of a number of instructions from the provider or creator to the
receiver of the item.
|
static Property |
Photoshop.INSTRUCTIONS |
static Property |
XMPDM.INSTRUMENT
"The musical instrument."
|
static Property |
IPTC.INTELLECTUAL_GENRE
Describes the nature, intellectual, artistic or journalistic
characteristic of a item, not specifically its content.
|
static Property |
IPTC.IPTC_LAST_EDITED
The date and optionally time when any of the IPTC photo metadata fields
has been last edited
|
static Property |
PDF.IS_ENCRYPTED |
static Property |
TIFF.ISO_SPEED_RATINGS
"ISO Speed and ISO Latitude of the input device as specified in ISO 12232"
|
static Property |
IPTC.JOB_ID
Number or identifier for the purpose of improved workflow handling.
|
static Property |
XMPDM.KEY
"The audio's musical key."
|
static Property |
IPTC.KEYWORDS
Keywords to express the subject of the content.
|
static Property |
Office.KEYWORDS
Keywords pertaining to a document.
|
static Property |
TikaCoreProperties.KEYWORDS
DublinCore.SUBJECT ; should include both subject and keywords
if a document format has both. |
static Property |
XMP.LABEL
A word or short phrase that identifies a resource as a member of a userdefined collection.
|
static Property |
DublinCore.LANGUAGE
A language of the intellectual content of the resource.
|
static Property |
TikaCoreProperties.LANGUAGE |
static Property |
Office.LAST_AUTHOR
Name of the last (most recent) author of a document
|
static Property |
HttpHeaders.LAST_MODIFIED |
static Property |
OfficeOpenXMLCore.LAST_MODIFIED_BY
The user who performed the last modification.
|
static Property |
OfficeOpenXMLCore.LAST_PRINTED
The date and time of the last printing.
|
static Property |
MSOffice.LAST_PRINTED
Deprecated.
|
static Property |
MSOffice.LAST_SAVED
Deprecated.
|
static Property |
Geographic.LATITUDE
The WGS84 Latitude of the Point
|
static Property |
TikaCoreProperties.LATITUDE |
static Property |
IPTC.LICENSOR
A person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_CITY
The city of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_COUNTRY
The country of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_EMAIL
The email of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_EXTENDED_ADDRESS
The extended address of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_ID
The ID of the person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_NAME
The name of the person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_POSTAL_CODE
The postal code of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_REGION
The region of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_STREET_ADDRESS
The street address of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_TELEPHONE_1
The phone number of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_TELEPHONE_2
The phone number of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
IPTC.LICENSOR_URL
The URL of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
|
static Property |
MSOffice.LINE_COUNT
Deprecated.
|
static Property |
Office.LINE_COUNT
The number of lines in the document
|
static Property |
IPTC.LOCATION_CREATED
The location the content of the item was created.
|
static Property |
IPTC.LOCATION_CREATED_CITY
Name of the city of a location.
|
static Property |
IPTC.LOCATION_CREATED_COUNTRY_CODE
The ISO code of a country of a location.
|
static Property |
IPTC.LOCATION_CREATED_COUNTRY_NAME
The name of a country of a location.
|
static Property |
IPTC.LOCATION_CREATED_PROVINCE_OR_STATE
The name of a subregion of a country - a province or state - of a
location.
|
static Property |
IPTC.LOCATION_CREATED_SUBLOCATION
Name of a sublocation.
|
static Property |
IPTC.LOCATION_CREATED_WORLD_REGION
The name of a world region of a location.
|
static Property |
IPTC.LOCATION_SHOWN
A location the content of the item is about.
|
static Property |
IPTC.LOCATION_SHOWN_CITY
Name of the city of a location.
|
static Property |
IPTC.LOCATION_SHOWN_COUNTRY_CODE
The ISO code of a country of a location.
|
static Property |
IPTC.LOCATION_SHOWN_COUNTRY_NAME
The name of a country of a location.
|
static Property |
IPTC.LOCATION_SHOWN_PROVINCE_OR_STATE
The name of a subregion of a country - a province or state - of a
location.
|
static Property |
IPTC.LOCATION_SHOWN_SUBLOCATION
Name of a sublocation.
|
static Property |
IPTC.LOCATION_SHOWN_WORLD_REGION
The name of a world region of a location.
|
static Property |
XMPDM.LOG_COMMENT
"User's log comments."
|
static Property |
Geographic.LONGITUDE
The WGS84 Longitude of the Point
|
static Property |
TikaCoreProperties.LONGITUDE |
static Property |
XMPDM.LOOP
"When true, the clip can be looped seamlessly."
|
static Property |
QuattroPro.LOWEST_VERSION
Lowest version.
|
static Property |
WordPerfect.MAJOR_VERSION
Major version.
|
static Property |
OfficeOpenXMLExtended.MANAGER |
static Property |
Office.MAPI_FROM_REPRESENTING_EMAIL |
static Property |
Office.MAPI_FROM_REPRESENTING_NAME |
static Property |
Office.MAPI_MESSAGE_CLASS
MAPI message class.
|
static Property |
Office.MAPI_MESSAGE_CLIENT_SUBMIT_TIME |
static Property |
Office.MAPI_SENT_BY_SERVER_TYPE |
static Property |
XMPRights.MARKED
When true, indicates that this is a rights-managed resource.
|
static Property |
IPTC.MAX_AVAIL_HEIGHT
The maximum available height in pixels of the original photo from which
this photo has been derived by downsizing.
|
static Property |
IPTC.MAX_AVAIL_WIDTH
The maximum available width in pixels of the original photo from which
this photo has been derived by downsizing.
|
static Property |
Message.MESSAGE_BCC_DISPLAY_NAME |
static Property |
Message.MESSAGE_BCC_EMAIL
Where possible, this records the email value in the bcc field.
|
static Property |
Message.MESSAGE_BCC_NAME
In Outlook messages, there are sometimes separate fields for "bcc-name" and
"bcc-display-name" name.
|
static Property |
Message.MESSAGE_CC_DISPLAY_NAME |
static Property |
Message.MESSAGE_CC_EMAIL
Where possible, this records the email value in the cc field.
|
static Property |
Message.MESSAGE_CC_NAME
In Outlook messages, there are sometimes separate fields for "cc-name" and
"cc-display-name" name.
|
static Property |
Message.MESSAGE_FROM_EMAIL
Where possible, this records the value from the name field.
|
static Property |
Message.MESSAGE_FROM_NAME
Where possible, this records the value from the name field.
|
static Property |
Message.MESSAGE_TO_DISPLAY_NAME |
static Property |
Message.MESSAGE_TO_EMAIL
Where possible, this records the email value in the to field.
|
static Property |
Message.MESSAGE_TO_NAME
In Outlook messages, there are sometimes separate fields for "to-name" and
"to-display-name" name.
|
static Property |
XMP.METADATA_DATE
The date and time that any metadata for this resource was last changed.
|
static Property |
TikaCoreProperties.METADATA_DATE |
static Property |
XMPDM.METADATA_MOD_DATE
"The date and time when the metadata was last modified."
|
static Property |
IPTC.MINOR_MODEL_AGE_DISCLOSURE
Age of the youngest model pictured in the image, at the time that the
image was made.
|
static Property |
WordPerfect.MINOR_VERSION
Minor version.
|
static Property |
IPTC.MODEL_AGE
Age of the human model(s) at the time this image was taken in a model
released image.
|
static Property |
IPTC.MODEL_RELEASE_ID
Optional identifier associated with each Model Release.
|
static Property |
IPTC.MODEL_RELEASE_STATUS
Summarizes the availability and scope of model releases authorizing usage
of the likenesses of persons appearing in the photograph.
|
static Property |
DublinCore.MODIFIED
Date on which the resource was changed.
|
static Property |
TikaCoreProperties.MODIFIED |
static Property |
TikaCoreProperties.MODIFIER |
static Property |
XMP.MODIFY_DATE
The date and time the resource was last modified.
|
static Property |
PagedText.N_PAGES
"The number of pages in the document (including any in contained
documents)."
|
static Property |
XMP.NICKNAME
A word or short phrase that represents the nick name fo the file
|
static Property |
OfficeOpenXMLExtended.NOTES |
static Property |
XMPDM.NUMBER_OF_BEATS
"The number of beats."
|
static Property |
MSOffice.OBJECT_COUNT
Deprecated.
|
static Property |
Office.OBJECT_COUNT
The number of Objects in the document.
|
static Property |
IPTC.ORGANISATION_CODE
A set of metadata about artwork or an object in the item
|
static Property |
IPTC.ORGANISATION_NAME
Name of the organisation or company which is featured in the content.
|
static Property |
TIFF.ORIENTATION
"The Orientation of the image."
1 = 0th row at top, 0th column at left
2 = 0th row at top, 0th column at right
3 = 0th row at bottom, 0th column at right
4 = 0th row at bottom, 0th column at left
5 = 0th row at left, 0th column at top
6 = 0th row at right, 0th column at top
7 = 0th row at right, 0th column at bottom
8 = 0th row at left, 0th column at bottom
|
static Property |
TIFF.ORIGINAL_DATE
"Date and time when original image was generated"
|
static Property |
XMPMM.ORIGINAL_DOCUMENTID
The common identifier for the original resource from which
the current resource is derived.
|
static Property |
TikaCoreProperties.ORIGINAL_RESOURCE_NAME
Some file formats can store information about their original
file name/location or about their attachment's original file name/location.
|
static Property |
XMPRights.OWNER
A list of legal owners of the resource.
|
static Property |
MSOffice.PAGE_COUNT
Deprecated.
|
static Property |
Office.PAGE_COUNT
The number of Pages are there in the (paged) document
|
static Property |
MSOffice.PARAGRAPH_COUNT
Deprecated.
|
static Property |
Office.PARAGRAPH_COUNT
The number of individual Paragraphs in the document
|
static Property |
PDF.PDF_EXTENSION_VERSION |
static Property |
PDF.PDF_VERSION |
static Property |
PDF.PDFA_VERSION |
static Property |
PDF.PDFAID_CONFORMANCE |
static Property |
PDF.PDFAID_PART |
static Property |
IPTC.PERSON
Name of a person the content of the item is about.
|
static Property |
IPTC.PLUS_VERSION
The version number of the PLUS standards in place at the time of the
transaction.
|
static Property |
PDF.PREFLIGHT_ICC_PROFILE |
static Property |
PDF.PREFLIGHT_INCREMENTAL_UPDATES |
static Property |
PDF.PREFLIGHT_IS_LINEARIZED |
static Property |
PDF.PREFLIGHT_IS_VALID |
static Property |
PDF.PREFLIGHT_PARSE_EXCEPTION |
static Property |
PDF.PREFLIGHT_SPECIFICATION |
static Property |
PDF.PREFLIGHT_TRAILER_COUNT |
static Property |
PDF.PREFLIGHT_VALIDATION_ERRORS |
static Property |
PDF.PREFLIGHT_XREF_TYPE |
static Property |
OfficeOpenXMLExtended.PRESENTATION_FORMAT |
static Property |
Office.PRINT_DATE
When was the document last printed?
|
static Property |
TikaCoreProperties.PRINT_DATE |
static Property |
PDF.PRODUCER |
static Property |
WordPerfect.PRODUCT_TYPE
Product type.
|
static Property[] |
IPTC.PROPERTY_GROUP_IPTC_CORE |
static Property[] |
IPTC.PROPERTY_GROUP_IPTC_EXT |
static Property |
IPTC.PROPERTY_RELEASE_ID
Optional identifier associated with each Property Release.
|
static Property |
IPTC.PROPERTY_RELEASE_STATUS
Summarises the availability and scope of property releases authorizing
usage of the properties appearing in the photograph.
|
static Property |
IPTC.PROVINCE_OR_STATE
Name of the subregion of a country -- either called province or state or
anything else -- the content is focussing on -- either the subregion
shown in visual media or referenced by text or audio media.
|
static Property |
DublinCore.PUBLISHER
An entity responsible for making the resource available.
|
static Property |
TikaCoreProperties.PUBLISHER |
static Property |
XMPDM.PULL_DOWN
"The sampling phase of film to be converted to video (pull-down)."
|
static Property |
XMP.RATING
A user-assigned rating for this file.
|
static Property |
TikaCoreProperties.RATING |
static Property |
IPTC.REGISTRY_ENTRY_CREATED_ITEM_ID
A unique identifier created by a registry and applied by the creator of
the item.
|
static Property |
IPTC.REGISTRY_ENTRY_CREATED_ORGANISATION_ID
An identifier for the registry which issued the corresponding Registry Image Id.
|
static Property |
DublinCore.RELATION
A reference to a related resource.
|
static Property |
TikaCoreProperties.RELATION |
static Property |
XMPDM.RELATIVE_PEAK_AUDIO_FILE_PATH
"The relative path to the file's peak audio file.
|
static Property |
XMPDM.RELEASE_DATE
"The date the title was released."
|
static Property |
XMPMM.RENDITION_CLASS
The rendition class name for this resource.
|
static Property |
XMPMM.RENDITION_PARAMS
Can be used to provide additional rendition parameters that
are too complex or verbose to encode in xmpMM:RenditionClass
|
static Property |
TIFF.RESOLUTION_HORIZONTAL
"Horizontal resolution in pixels per unit."
|
static Property |
TIFF.RESOLUTION_UNIT
"Units used for Horizontal and Vertical Resolutions."
One of "Inch" or "cm"
|
static Property |
TIFF.RESOLUTION_VERTICAL
"Vertical resolution in pixels per unit."
|
static Property |
OfficeOpenXMLCore.REVISION
The revision number.
|
static Property |
DublinCore.RIGHTS
Information about rights held in and over the resource.
|
static Property |
TikaCoreProperties.RIGHTS |
static Property |
IPTC.RIGHTS_USAGE_TERMS
The licensing parameters of the item expressed in free-text.
|
static Property |
Database.ROW_COUNT |
static Property |
TIFF.SAMPLES_PER_PIXEL
"Number of components per pixel."
|
static Property |
Office.SAVE_DATE
When was the document last saved?
|
static Property |
XMPDM.SCALE_TYPE
"The musical scale used in the music.
|
static Property |
XMPDM.SCENE
"The name of the scene."
|
static Property |
IPTC.SCENE_CODE
Describes the scene of a news content.
|
static Property |
XMPIdq.SCHEME
A qualifier providing the name of the formal identification
scheme used for an item in the xmp:Identifier array.
|
static Property |
HTML.SCRIPT_SOURCE
If a script element contains a src value, this value
is set in the embedded document's metadata
|
static Property |
XMPDM.SHOT_DATE
"The date and time when the video was shot."
|
static Property |
XMPDM.SHOT_LOCATION
"The name of the location where the video was shot.
|
static Property |
XMPDM.SHOT_NAME
"The name of the shot or take."
|
static Property |
MSOffice.SLIDE_COUNT
Deprecated.
|
static Property |
Office.SLIDE_COUNT
The number of Slides are there in the (presentation) document
|
static Property |
TIFF.SOFTWARE
"Software or firmware used to generate the image."
|
static Property |
IPTC.SOURCE
Identifies the original owner of the copyright for the intellectual
content of the item.
|
static Property |
DublinCore.SOURCE
A reference to a resource from which the present resource is derived.
|
static Property |
Photoshop.SOURCE |
static Property |
TikaCoreProperties.SOURCE |
static Property |
XMPDM.SPEAKER_PLACEMENT
"A description of the speaker angles from center front in degrees.
|
static Property |
Photoshop.STATE |
static Property |
XMPDM.STRETCH_MODE
"The audio stretch mode."
|
static Property |
OfficeOpenXMLCore.SUBJECT
The document's subject.
|
static Property |
DublinCore.SUBJECT
The topic of the content of the resource.
|
static Property |
IPTC.SUBJECT_CODE
Specifies one or more Subjects from the IPTC Subject-NewsCodes taxonomy
to categorise the content.
|
static Property |
IPTC.SUBLOCATION
Name of a sublocation the content is focussing on -- either the
location shown in visual media or referenced by text or audio media.
|
static Property |
IPTC.SUPPLEMENTAL_CATEGORIES
Deprecated.
|
static Property |
Photoshop.SUPPLEMENTAL_CATEGORIES |
static Property |
MSOffice.TABLE_COUNT
Deprecated.
|
static Property |
Office.TABLE_COUNT
The number of Tables in the document
|
static Property |
Database.TABLE_NAME |
static Property |
XMPDM.TAPE_NAME
"The name of the tape from which the clip was captured, as set during
the capture process."
|
static Property |
OfficeOpenXMLExtended.TEMPLATE |
static Property |
XMPDM.TEMPO
"The audio's tempo."
|
static Property |
RTFMetadata.THUMBNAIL
if set to true, this means that an image file is probably a "thumbnail"
any time a pict/emf/wmf is in an object
|
static Property |
TikaCoreProperties.TIKA_META_EXCEPTION_EMBEDDED_STREAM
Use this to store exceptions caught while trying to read the
stream of an embedded resource.
|
static Property |
TikaCoreProperties.TIKA_META_EXCEPTION_WARNING
Use this to store exceptions caught during a parse that are
non-fatal, e.g.
|
static Property |
XMPDM.TIME_SIGNATURE
"The time signature of the music."
|
static Property |
IPTC.TITLE
A shorthand reference for the item.
|
static Property |
DublinCore.TITLE
A name given to the resource.
|
static Property |
TikaCoreProperties.TITLE |
static Property |
OfficeOpenXMLExtended.TOTAL_TIME |
static Property |
XMPDM.TRACK_NUMBER
"A numeric value indicating the order of the audio file within its
original recording."
|
static Property |
TikaCoreProperties.TRANSITION_KEYWORDS_TO_DC_SUBJECT
Deprecated.
use TikaCoreProperties#KEYWORDS
|
static Property |
TikaCoreProperties.TRANSITION_SUBJECT_TO_DC_DESCRIPTION
Deprecated.
use TikaCoreProperties#DESCRIPTION
|
static Property |
TikaCoreProperties.TRANSITION_SUBJECT_TO_DC_TITLE
Deprecated.
use TikaCoreProperties#TITLE
|
static Property |
TikaCoreProperties.TRANSITION_SUBJECT_TO_OO_SUBJECT
Deprecated.
use OfficeOpenXMLCore#SUBJECT
|
static Property |
Photoshop.TRANSMISSION_REFERENCE |
static Property |
DublinCore.TYPE
The nature or genre of the content of the resource.
|
static Property |
TikaCoreProperties.TYPE |
static Property |
PDF.UNMAPPED_UNICODE_CHARS_PER_PAGE |
static Property |
IPTC.URGENCY
Deprecated.
|
static Property |
Photoshop.URGENCY |
static Property |
XMPRights.USAGE_TERMS
A word or short phrase that identifies a resource as a member of a userdefined collection.
|
static Property |
QuattroPro.VERSION
Version.
|
static Property |
OfficeOpenXMLCore.VERSION
The version number.
|
static Property |
XMPDM.VIDEO_ALPHA_MODE
"The alpha mode."
|
static Property |
XMPDM.VIDEO_ALPHA_UNITY_IS_TRANSPARENT
"When true, unity is clear, when false, it is opaque."
|
static Property |
XMPDM.VIDEO_COLOR_SPACE
"The color space."
|
static Property |
XMPDM.VIDEO_COMPRESSOR
"Video compression used.
|
static Property |
XMPDM.VIDEO_FIELD_ORDER
"The field order for video."
|
static Property |
XMPDM.VIDEO_FRAME_RATE
"The video frame rate."
|
static Property |
XMPDM.VIDEO_MOD_DATE
"The date and time when the video was last modified."
|
static Property |
XMPDM.VIDEO_PIXEL_ASPECT_RATIO
"The aspect ratio, expressed as wd/ht.
|
static Property |
XMPDM.VIDEO_PIXEL_DEPTH
"The size in bits of each color component of a pixel.
|
static Property |
XMPRights.WEB_STATEMENT
A Web URL for a statement of the ownership and usage rights for this resource.
|
static Property |
MSOffice.WORD_COUNT
Deprecated.
|
static Property |
Office.WORD_COUNT
The number of Words in the document
|
static Property |
PDF.XMP_LOCATION
If xmp is extracted by, e.g.
|
Modifier and Type | Method and Description |
---|---|
static Property |
Property.composite(Property primaryProperty,
Property[] secondaryExtractProperties)
Constructs a new composite property from the given primary and array of secondary properties.
|
static Property |
Property.externalBoolean(String name) |
static Property |
Property.externalClosedChoise(String name,
String... choices) |
static Property |
Property.externalDate(String name) |
static Property |
Property.externalInteger(String name) |
static Property |
Property.externalOpenChoise(String name,
String... choices) |
static Property |
Property.externalReal(String name) |
static Property |
Property.externalText(String name) |
static Property |
Property.externalTextBag(String name) |
static Property |
Property.get(String key)
Retrieve the property object that corresponds to the given key
|
Property |
Property.getPrimaryProperty()
Gets the primary property for a composite property
|
Property[] |
Property.getSecondaryExtractProperties()
Gets the secondary properties for a composite property
|
static Property |
Property.internalBoolean(String name) |
static Property |
Property.internalClosedChoise(String name,
String... choices) |
static Property |
Property.internalDate(String name) |
static Property |
Property.internalInteger(String name) |
static Property |
Property.internalIntegerSequence(String name) |
static Property |
Property.internalOpenChoise(String name,
String... choices) |
static Property |
Property.internalRational(String name) |
static Property |
Property.internalReal(String name) |
static Property |
Property.internalText(String name) |
static Property |
Property.internalTextBag(String name) |
static Property |
Property.internalURI(String name) |
Modifier and Type | Method and Description |
---|---|
static SortedSet<Property> |
Property.getProperties(String prefix) |
Modifier and Type | Method and Description |
---|---|
void |
Metadata.add(Property property,
int value)
Adds the integer value of the identified metadata property.
|
void |
Metadata.add(Property property,
String value)
Add a metadata property/value mapping.
|
int |
Property.compareTo(Property o) |
static Property |
Property.composite(Property primaryProperty,
Property[] secondaryExtractProperties)
Constructs a new composite property from the given primary and array of secondary properties.
|
static Property |
Property.composite(Property primaryProperty,
Property[] secondaryExtractProperties)
Constructs a new composite property from the given primary and array of secondary properties.
|
String |
Metadata.get(Property property)
Returns the value (if any) of the identified metadata property.
|
Date |
Metadata.getDate(Property property)
Returns the value of the identified Date based metadata property.
|
Integer |
Metadata.getInt(Property property)
Returns the value of the identified Integer based metadata property.
|
int[] |
Metadata.getIntValues(Property property)
Gets the array of ints of the identified "seq" integer metadata property.
|
String[] |
Metadata.getValues(Property property)
Get the values associated to a metadata name.
|
boolean |
Metadata.isMultiValued(Property property)
Returns true if named value is multivalued.
|
void |
Metadata.set(Property property,
Calendar date)
Sets the date value of the identified metadata property.
|
void |
Metadata.set(Property property,
Date date)
Sets the date value of the identified metadata property.
|
void |
Metadata.set(Property property,
double value)
Sets the real or rational value of the identified metadata property.
|
void |
Metadata.set(Property property,
int value)
Sets the integer value of the identified metadata property.
|
void |
Metadata.set(Property property,
String value)
Sets the value of the identified metadata property.
|
void |
Metadata.set(Property property,
String[] values)
Sets the values of the identified metadata property.
|
Modifier and Type | Field and Description |
---|---|
static Property |
RecursiveParserWrapper.EMBEDDED_EXCEPTION
Deprecated.
|
static Property |
RecursiveParserWrapper.EMBEDDED_RESOURCE_LIMIT_REACHED
|
static Property |
RecursiveParserWrapper.EMBEDDED_RESOURCE_PATH
Deprecated.
|
static Property |
RecursiveParserWrapper.PARSE_TIME_MILLIS
Deprecated.
|
static Property |
RecursiveParserWrapper.TIKA_CONTENT
Deprecated.
|
static Property |
RecursiveParserWrapper.WRITE_LIMIT_REACHED
Deprecated.
|
Modifier and Type | Field and Description |
---|---|
static Property |
TextAndCSVParser.DELIMITER_PROPERTY |
Modifier and Type | Field and Description |
---|---|
static Property |
MachineMetadata.ARCHITECTURE_BITS |
static Property |
MachineMetadata.ENDIAN |
static Property |
MachineMetadata.MACHINE_TYPE |
static Property |
MachineMetadata.PLATFORM |
Modifier and Type | Method and Description |
---|---|
static boolean |
MetadataFields.isMetadataField(Property property) |
Modifier and Type | Method and Description |
---|---|
static void |
MailUtil.addPersonAndEmail(String string,
Property personProperty,
Property emailProperty,
Metadata metadata)
This tries to split a "from" or "to" value into a person field and an email field.
|
static void |
MailUtil.setPersonAndEmail(String string,
Property personProperty,
Property emailProperty,
Metadata metadata)
This tries to split a "from" or "to" value into a person field and an email field.
|
Modifier and Type | Field and Description |
---|---|
static Property |
JackcessParser.MDB_PW |
Modifier and Type | Method and Description |
---|---|
static void |
OutlookExtractor.addEvenIfNull(Property property,
String value,
Metadata metadata) |
static void |
SummaryExtractor.addMulti(Metadata metadata,
Property property,
String string) |
Modifier and Type | Field and Description |
---|---|
static Property |
XMLProfiler.ENTITY_LOCAL_NAMES |
static Property |
XMLProfiler.ENTITY_URIS |
static Property |
XMLProfiler.ROOT_ENTITY |
Constructor and Description |
---|
AttributeMetadataHandler(String uri,
String localName,
Metadata metadata,
Property property) |
ElementMetadataHandler(String uri,
String localName,
Metadata metadata,
Property targetProperty)
Constructor for Property metadata keys.
|
ElementMetadataHandler(String uri,
String localName,
Metadata metadata,
Property targetProperty,
boolean allowDuplicateValues,
boolean allowEmptyValues)
Constructor for Property metadata keys which allows change of behavior
for duplicate and empty entry values.
|
MetadataHandler(Metadata metadata,
Property property)
Deprecated.
|
Modifier and Type | Field and Description |
---|---|
static Property |
AbstractRecursiveParserWrapperHandler.CONTAINER_EXCEPTION |
static Property |
AbstractRecursiveParserWrapperHandler.EMBEDDED_DEPTH |
static Property |
AbstractRecursiveParserWrapperHandler.EMBEDDED_EXCEPTION |
static Property |
AbstractRecursiveParserWrapperHandler.EMBEDDED_RESOURCE_LIMIT_REACHED |
static Property |
AbstractRecursiveParserWrapperHandler.EMBEDDED_RESOURCE_PATH |
static Property |
AbstractRecursiveParserWrapperHandler.PARSE_TIME_MILLIS |
static Property |
AbstractRecursiveParserWrapperHandler.TIKA_CONTENT |
static Property |
AbstractRecursiveParserWrapperHandler.TIKA_CONTENT_HANDLER
Simple class name of the content handler
|
static Property |
AbstractRecursiveParserWrapperHandler.WRITE_LIMIT_REACHED |
Modifier and Type | Field and Description |
---|---|
static Property |
ParserUtils.EMBEDDED_EXCEPTION |
static Property |
ParserUtils.EMBEDDED_PARSER |
Modifier and Type | Method and Description |
---|---|
String |
XMPMetadata.get(Property property) |
Date |
XMPMetadata.getDate(Property property) |
Integer |
XMPMetadata.getInt(Property property) |
String[] |
XMPMetadata.getValues(Property property) |
boolean |
XMPMetadata.isMultiValued(Property property) |
void |
XMPMetadata.remove(Property property) |
void |
XMPMetadata.set(Property property,
Date date) |
void |
XMPMetadata.set(Property property,
double value) |
void |
XMPMetadata.set(Property property,
int value) |
void |
XMPMetadata.set(Property property,
String value) |
void |
XMPMetadata.set(Property property,
String[] values)
Sets array properties.
|
Modifier and Type | Method and Description |
---|---|
protected void |
AbstractConverter.createArrayProperty(Property metadataProperty,
String nsDc,
String arrayProperty,
int arrayType) |
protected void |
AbstractConverter.createCommaSeparatedArray(Property metadataProperty,
String nsDc,
String arrayProperty,
int arrayType) |
protected void |
AbstractConverter.createLangAltProperty(Property metadataProperty,
String ns,
String propertyName) |
protected void |
AbstractConverter.createProperty(Property metadataProperty,
String ns,
String propertyName) |
Copyright © 2007–2020 The Apache Software Foundation. All rights reserved.