public class OpenSearchEmitter extends AbstractEmitter implements Initializable
| Modifier and Type | Class and Description |
|---|---|
static class |
OpenSearchEmitter.AttachmentStrategy |
static class |
OpenSearchEmitter.UpdateStrategy |
| Modifier and Type | Field and Description |
|---|---|
static String |
DEFAULT_EMBEDDED_FILE_FIELD_NAME |
| Constructor and Description |
|---|
OpenSearchEmitter() |
| Modifier and Type | Method and Description |
|---|---|
void |
checkInitialization(InitializableProblemHandler problemHandler) |
void |
emit(List<? extends EmitData> emitData)
The default behavior is to call
Emitter.emit(String, List) on each item. |
void |
emit(String emitKey,
List<Metadata> metadataList) |
int |
getCommitWithin() |
void |
initialize(Map<String,Param> params) |
void |
setAttachmentStrategy(String attachmentStrategy)
Options: SEPARATE_DOCUMENTS, PARENT_CHILD.
|
void |
setAuthScheme(String authScheme) |
void |
setCommitWithin(int commitWithin) |
void |
setConnectionTimeout(int connectionTimeout) |
void |
setEmbeddedFileFieldName(String embeddedFileFieldName)
If using the
OpenSearchEmitter.AttachmentStrategy.PARENT_CHILD, this is the field name
used to store the child documents. |
void |
setIdField(String idField)
Specify the field in the first Metadata that should be
used as the id field for the document.
|
void |
setOpenSearchUrl(String openSearchUrl) |
void |
setPassword(String password) |
void |
setProxyHost(String proxyHost) |
void |
setProxyPort(int proxyPort) |
void |
setSocketTimeout(int socketTimeout) |
void |
setUpdateStrategy(OpenSearchEmitter.UpdateStrategy updateStrategy) |
void |
setUpdateStrategy(String strategy) |
void |
setUserName(String userName) |
getName, setNamepublic static String DEFAULT_EMBEDDED_FILE_FIELD_NAME
public OpenSearchEmitter()
throws TikaConfigException
TikaConfigExceptionpublic void emit(List<? extends EmitData> emitData) throws IOException, TikaEmitterException
AbstractEmitterEmitter.emit(String, List) on each item.
Some implementations, e.g. Solr/ES/vespa, can benefit from subclassing this and
emitting a bunch of docs at once.emit in interface Emitteremit in class AbstractEmitterIOExceptionTikaEmitterExceptionpublic void emit(String emitKey, List<Metadata> metadataList) throws IOException, TikaEmitterException
emit in interface EmitterIOExceptionTikaEmitterException@Field public void setAttachmentStrategy(String attachmentStrategy)
HandlerConfig's parseMode to
HandlerConfig.PARSE_MODE.CONCATENATE
in your FetchEmitTuple or in the
<parseMode> element in your PipesIterator
configuration.@Field public void setConnectionTimeout(int connectionTimeout)
@Field public void setSocketTimeout(int socketTimeout)
public int getCommitWithin()
@Field public void setCommitWithin(int commitWithin)
@Field public void setIdField(String idField)
idField - @Field public void setProxyPort(int proxyPort)
public void setUpdateStrategy(OpenSearchEmitter.UpdateStrategy updateStrategy)
public void setUpdateStrategy(String strategy) throws TikaConfigException
TikaConfigException@Field public void setEmbeddedFileFieldName(String embeddedFileFieldName)
OpenSearchEmitter.AttachmentStrategy.PARENT_CHILD, this is the field name
used to store the child documents. Note that we artificially flatten all embedded
documents, no matter how nested in the container document, into direct children
of the root document.embeddedFileFieldName - public void initialize(Map<String,Param> params) throws TikaConfigException
initialize in interface Initializableparams - params to use for initializationTikaConfigExceptionpublic void checkInitialization(InitializableProblemHandler problemHandler) throws TikaConfigException
checkInitialization in interface InitializableproblemHandler - if there is a problem and no
custom initializableProblemHandler has been configured
via Initializable parameters,
this is called to respond.TikaConfigExceptionCopyright © 2007–2023 The Apache Software Foundation. All rights reserved.