JSON Plugin
The JSON plugin (tika-pipes-json) provides an iterator that reads work items from a JSON-lines file (one JSON object per line). It is iterator-only — pair it with a fetcher and emitter.
| Interface | Component name | Class |
|---|---|---|
Iterator |
|
|
JSON Iterator (json-pipes-iterator)
Reads each line of a JSONL file as a work item and emits one FetchEmitTuple per object.
{
"pipes-iterator": {
"json-pipes-iterator": {
"jsonPath": "/data/work-items.jsonl",
"fetcherId": "fsf",
"emitterId": "fse"
}
}
}
Configuration
| Field | Default | Description |
|---|---|---|
|
required |
Path to the JSONL file on disk. |
|
required |
IDs of the fetcher and emitter to bind to each emitted tuple. See Pipes Iterators for the shared iterator contract. |
Notes
-
The file format is JSON-lines (also called NDJSON) — one valid JSON object per line, no surrounding array brackets.
-
Each line’s JSON object is parsed and its fields become the metadata of the emitted
FetchEmitTuple. The keys used for fetch / emit identification come from the tuple-shaped fields (e.g.,fetchKey,emitKey). -
For columnar work items in a CSV, use the CSV iterator instead.