Class JDBCPipesReporter
- java.lang.Object
-
- org.apache.tika.pipes.PipesReporter
-
- org.apache.tika.pipes.PipesReporterBase
-
- org.apache.tika.pipes.reporters.jdbc.JDBCPipesReporter
-
- All Implemented Interfaces:
Closeable,AutoCloseable,Initializable
public class JDBCPipesReporter extends PipesReporterBase implements Initializable
This is an initial draft of a JDBCPipesReporter. This will drop the tika_status table with each run. If you'd like different behavior, please open a ticket on our JIRA!
-
-
Field Summary
Fields Modifier and Type Field Description static StringTABLE_NAME-
Fields inherited from class org.apache.tika.pipes.PipesReporter
NO_OP_REPORTER
-
-
Constructor Summary
Constructors Constructor Description JDBCPipesReporter()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidcheckInitialization(InitializableProblemHandler problemHandler)voidclose()No-op implementation.voiderror(String msg)This is called if the process has crashed.voiderror(Throwable t)This is called if the process has crashed.StringgetReportSql()List<String>getReportVariables()StringgetTableName()voidinitialize(Map<String,Param> params)booleanisCreateTable()voidreport(FetchEmitTuple t, PipesResult result, long elapsed)voidsetCacheSize(int cacheSize)Commit the reports if the cache is greater than or equal to this size.voidsetConnection(String connection)voidsetCreateTable(boolean createTable)The default is true.voidsetPostConnection(String postConnection)This sql will be called immediately after the connection is made.voidsetReportSql(String reportSql)This is the sql for the prepared statement to execute to store the report record. the default is:insert into tika_status (id, status, timestamp) values (?voidsetReportVariables(List<String> variables)ADVANCED: This is used to set the variables in the prepared statement for the report.voidsetReportWithinMs(long reportWithinMs)Commit the reports if the amount of time elapsed since the last report commit exceeds this value.voidsetTableName(String tableName)The default isTABLE_NAME-
Methods inherited from class org.apache.tika.pipes.PipesReporterBase
accept, setExcludes, setIncludes
-
Methods inherited from class org.apache.tika.pipes.PipesReporter
report, supportsTotalCount
-
-
-
-
Field Detail
-
TABLE_NAME
public static final String TABLE_NAME
- See Also:
- Constant Field Values
-
-
Method Detail
-
initialize
public void initialize(Map<String,Param> params) throws TikaConfigException
- Specified by:
initializein interfaceInitializable- Overrides:
initializein classPipesReporterBase- Parameters:
params- params to use for initialization- Throws:
TikaConfigException
-
checkInitialization
public void checkInitialization(InitializableProblemHandler problemHandler) throws TikaConfigException
- Specified by:
checkInitializationin interfaceInitializable- Overrides:
checkInitializationin classPipesReporterBase- Parameters:
problemHandler- if there is a problem and no custom initializableProblemHandler has been configured via Initializable parameters, this is called to respond.- Throws:
TikaConfigException
-
setCacheSize
@Field public void setCacheSize(int cacheSize)
Commit the reports if the cache is greater than or equal to this size. Default isDEFAULT_CACHE_SIZE. The reports will be committed if the cache size triggers reporting or if the amount of time since last reported (reportWithinMs) triggers reporting.- Parameters:
cacheSize-
-
setCreateTable
@Field public void setCreateTable(boolean createTable)
The default is true. In a distributed setting with multiple servers, this should be set to false, and you'll need to set up the table on your own. NOTE The default behavior is to drop the table if it exists and then create it. Make sure to set this to false if you do not want to drop the table.- Parameters:
createTable-
-
setTableName
@Field public void setTableName(String tableName)
The default isTABLE_NAME- Parameters:
tableName-
-
setReportSql
@Field public void setReportSql(String reportSql)
This is the sql for the prepared statement to execute to store the report record. the default is:insert into tika_status (id, status, timestamp) values (?,?,?)This can be modified for specific dialects of SQL or to run an upsert, merge or update instead of the default insert. Users need to coordinate this withsetReportVariables(List)- Parameters:
reportSql-
-
getTableName
public String getTableName()
-
getReportSql
public String getReportSql()
-
isCreateTable
public boolean isCreateTable()
-
setReportVariables
@Field public void setReportVariables(List<String> variables)
ADVANCED: This is used to set the variables in the prepared statement for the report. This needs to be coordinated withsetReportSql(String). The available variables are "id, status, timestamp". If you're modifying to an update statement like "update table tika_status set status=?, timestamp=? where id = ?" then the values for this would be ["status", "timestamp", "id"]. The default for the insert is ["id", "status", "timestamp"]- Parameters:
variables-
-
setReportWithinMs
@Field public void setReportWithinMs(long reportWithinMs)
Commit the reports if the amount of time elapsed since the last report commit exceeds this value. Default isDEFAULT_REPORT_WITHIN_MS. The reports will be committed if the cache size triggers reporting or if the amount of time since last reported triggers reporting.- Parameters:
reportWithinMs-
-
setPostConnection
@Field public void setPostConnection(String postConnection)
This sql will be called immediately after the connection is made. This was initially added for setting pragmas on sqlite3, but may be used for other connection configuration in other dbs. Note: This is called before the table is created if it needs to be created.- Parameters:
postConnection-
-
report
public void report(FetchEmitTuple t, PipesResult result, long elapsed)
- Specified by:
reportin classPipesReporter
-
error
public void error(Throwable t)
Description copied from class:PipesReporterThis is called if the process has crashed. Implementers should not rely on close() to be called after this.- Specified by:
errorin classPipesReporter
-
error
public void error(String msg)
Description copied from class:PipesReporterThis is called if the process has crashed. Implementers should not rely on close() to be called after this.- Specified by:
errorin classPipesReporter
-
close
public void close() throws IOExceptionDescription copied from class:PipesReporterNo-op implementation. Override for custom behavior- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceCloseable- Overrides:
closein classPipesReporter- Throws:
IOException
-
-