Package org.apache.tika.parser.image
Class ImageMetadataExtractor
java.lang.Object
org.apache.tika.parser.image.ImageMetadataExtractor
Uses the Metadata Extractor library
to read EXIF and IPTC image metadata and map to Tika fields.
As of 2.4.0 the library supports jpeg and tiff.
As of 2.8.0 the library supports webp.
-
Constructor Summary
ConstructorsConstructorDescriptionImageMetadataExtractor(Metadata metadata) ImageMetadataExtractor(Metadata metadata, org.apache.tika.parser.image.ImageMetadataExtractor.DirectoryHandler... handlers) -
Method Summary
Modifier and TypeMethodDescriptionprotected voidhandle(com.drew.metadata.Metadata metadataExtractor) Copies extracted tags to tika metadata using registered handlers.protected voidCopies extracted tags to tika metadata using registered handlers.voidparseHeif(InputStream is) voidvoidparseRawExif(byte[] exifData) voidparseRawExif(InputStream stream, int length, boolean needsExifHeader) voidparseRawXMP(byte[] xmpData) voidvoid
-
Constructor Details
-
ImageMetadataExtractor
- Parameters:
metadata- to extract to, using default directory handlers
-
ImageMetadataExtractor
public ImageMetadataExtractor(Metadata metadata, org.apache.tika.parser.image.ImageMetadataExtractor.DirectoryHandler... handlers) - Parameters:
metadata- to extract tohandlers- handlers in order, note that handlers may override values from earlier handlers
-
-
Method Details
-
parseJpeg
- Throws:
IOExceptionSAXExceptionTikaException
-
parseTiff
- Throws:
IOExceptionSAXExceptionTikaException
-
parseWebP
- Throws:
IOExceptionTikaException
-
parseHeif
- Throws:
IOExceptionTikaException
-
parseRawExif
public void parseRawExif(InputStream stream, int length, boolean needsExifHeader) throws IOException, SAXException, TikaException - Throws:
IOExceptionSAXExceptionTikaException
-
parseRawExif
- Throws:
IOExceptionSAXExceptionTikaException
-
parseRawXMP
- Throws:
IOExceptionSAXExceptionTikaException
-
handle
protected void handle(com.drew.metadata.Metadata metadataExtractor) throws com.drew.metadata.MetadataException Copies extracted tags to tika metadata using registered handlers.- Parameters:
metadataExtractor- Tag directories from a Metadata Extractor "reader"- Throws:
com.drew.metadata.MetadataException- This method does not handle exceptions from Metadata Extractor
-
handle
protected void handle(Iterator<com.drew.metadata.Directory> directories) throws com.drew.metadata.MetadataException Copies extracted tags to tika metadata using registered handlers.- Parameters:
directories- Metadata ExtractorDirectoryinstances.- Throws:
com.drew.metadata.MetadataException- This method does not handle exceptions from Metadata Extractor
-