Package org.apache.tika.parser.gdal
Class GDALParser
- java.lang.Object
-
- org.apache.tika.parser.gdal.GDALParser
-
- All Implemented Interfaces:
Serializable,Parser
public class GDALParser extends Object implements Parser
Wraps execution of the Geospatial Data Abstraction Library (GDAL)gdalinfotool used to extract geospatial information out of hundreds of geo file formats. The parser requires the installation of GDAL and forgdalinfoto be located on the path. Basic information (Size, Coordinate System, Bounding Box, Driver, and resource info) are extracted as metadata, and the remaining metadata patterns are extracted and added. The output of the command is available from the providedContentHandlerin theparse(InputStream, ContentHandler, Metadata, ParseContext)method.- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description static longDEFAULT_TIMEOUT_MS
-
Constructor Summary
Constructors Constructor Description GDALParser()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description StringgetCommand()Set<MediaType>getSupportedTypes(ParseContext context)Returns the set of media types supported by this parser when used with the given parse context.voidparse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context)Parses a document stream into a sequence of XHTML SAX events.StringprocessCommand(InputStream stream)voidsetCommand(String command)voidsetMaxStdErr(int maxStdErr)voidsetMaxStdOut(int maxStdOut)voidsetTimeoutMs(long timeoutMs)
-
-
-
Field Detail
-
DEFAULT_TIMEOUT_MS
public static final long DEFAULT_TIMEOUT_MS
- See Also:
- Constant Field Values
-
-
Method Detail
-
getCommand
public String getCommand()
-
setCommand
public void setCommand(String command)
-
processCommand
public String processCommand(InputStream stream)
-
getSupportedTypes
public Set<MediaType> getSupportedTypes(ParseContext context)
Description copied from interface:ParserReturns the set of media types supported by this parser when used with the given parse context.- Specified by:
getSupportedTypesin interfaceParser- Parameters:
context- parse context- Returns:
- immutable set of media types
-
parse
public void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException
Description copied from interface:ParserParses a document stream into a sequence of XHTML SAX events. Fills in related document metadata in the given metadata object.The given document stream is consumed but not closed by this method. The responsibility to close the stream remains on the caller.
Information about the parsing context can be passed in the context parameter. See the parser implementations for the kinds of context information they expect.
- Specified by:
parsein interfaceParser- Parameters:
stream- the document stream (input)handler- handler for the XHTML SAX events (output)metadata- document metadata (input and output)context- parse context- Throws:
IOException- if the document stream could not be readSAXException- if the SAX events could not be processedTikaException- if the document could not be parsed
-
setTimeoutMs
@Field public void setTimeoutMs(long timeoutMs)
-
setMaxStdErr
@Field public void setMaxStdErr(int maxStdErr)
-
setMaxStdOut
@Field public void setMaxStdOut(int maxStdOut)
-
-