public abstract class AbstractDBParser extends AbstractParser
Constructor and Description |
---|
AbstractDBParser() |
Modifier and Type | Method and Description |
---|---|
protected void |
close()
Override this for any special handling of closing the connection.
|
protected Connection |
getConnection(InputStream stream,
Metadata metadata,
ParseContext context)
Override this for special configuration of the connection, such as limiting
the number of rows to be held in memory.
|
protected abstract String |
getConnectionString(InputStream stream,
Metadata metadata,
ParseContext parseContext)
Implement for db specific connection information, e.g.
|
protected abstract String |
getJDBCClassName()
JDBC class name, e.g.
|
Set<MediaType> |
getSupportedTypes(ParseContext context)
Returns the set of media types supported by this parser when used
with the given parse context.
|
protected abstract List<String> |
getTableNames(Connection connection,
Metadata metadata,
ParseContext context)
Returns the names of the tables to process
|
protected abstract JDBCTableReader |
getTableReader(Connection connection,
String tableName,
EmbeddedDocumentUtil embeddedDocumentUtil)
Given a connection and a table name, return the JDBCTableReader for this db.
|
protected abstract JDBCTableReader |
getTableReader(Connection connection,
String tableName,
ParseContext parseContext)
Deprecated.
|
void |
parse(InputStream stream,
ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses a document stream into a sequence of XHTML SAX events.
|
parse
public Set<MediaType> getSupportedTypes(ParseContext context)
Parser
context
- parse contextpublic void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException
Parser
The given document stream is consumed but not closed by this method. The responsibility to close the stream remains on the caller.
Information about the parsing context can be passed in the context parameter. See the parser implementations for the kinds of context information they expect.
stream
- the document stream (input)handler
- handler for the XHTML SAX events (output)metadata
- document metadata (input and output)context
- parse contextIOException
- if the document stream could not be readSAXException
- if the SAX events could not be processedTikaException
- if the document could not be parsedprotected void close() throws SQLException, IOException
SQLException
IOException
protected Connection getConnection(InputStream stream, Metadata metadata, ParseContext context) throws IOException, TikaException
stream
- stream to usemetadata
- metadata that could be used in parameterizing the connectioncontext
- parsecontext that could be used in parameterizing the connectionIOException
TikaException
protected abstract String getConnectionString(InputStream stream, Metadata metadata, ParseContext parseContext) throws IOException
stream
- stream for processingmetadata
- metadata might be useful in determining connection infoparseContext
- context to use to help create connectionStringgetConnection(java.io.InputStream, org.apache.tika.metadata.Metadata, org.apache.tika.parser.ParseContext)
.IOException
protected abstract String getJDBCClassName()
protected abstract List<String> getTableNames(Connection connection, Metadata metadata, ParseContext context) throws SQLException
connection
- Connection to use to make the sql call(s) to get the names of the tablesmetadata
- Metadata to use (potentially) in decision about which tables to extractcontext
- ParseContext to use (potentially) in decision about which tables to extractSQLException
@Deprecated protected abstract JDBCTableReader getTableReader(Connection connection, String tableName, ParseContext parseContext)
getTableReader(Connection, String, EmbeddedDocumentUtil)
connection
- tableName
- protected abstract JDBCTableReader getTableReader(Connection connection, String tableName, EmbeddedDocumentUtil embeddedDocumentUtil)
connection
- tableName
- embeddedDocumentUtil
- embedded doc utilCopyright © 2007–2023 The Apache Software Foundation. All rights reserved.