Pipes Configuration
The pipes section of the JSON config controls the pipeline process itself:
how many forked JVMs to run, timeouts, memory management, and parse behavior.
{
"pipes": {
"numClients": 4,
"socketTimeoutMs": 60000,
"maxFilesProcessedPerProcess": 10000,
"parseMode": "RMETA",
"onParseException": "EMIT",
"forkedJvmArgs": ["-Xmx512m"]
}
}
Process Management
| Field | Default | Description |
|---|---|---|
|
|
Number of parallel forked JVM processes. Each processes one document at a time. See Forked-JVM CPU Sizing for guidance on choosing this value relative to host CPU count. |
|
|
JVM arguments for forked processes (e.g., |
|
|
Path to the Java executable for forked processes. |
|
|
Restart forked processes after this many files. Prevents slow-building memory leaks in parsing libraries. |
|
system default |
Directory for temporary files. Consider a RAM-backed filesystem (e.g., |
Timeouts
See also Timeouts for the full timeout model.
| Field | Default | Description |
|---|---|---|
|
|
Maximum time (ms) to wait for data from a forked process. If no heartbeat or result is received within this window, the parse is considered hung. |
|
|
Interval (ms) between heartbeats sent from the forked process. Must be significantly less than |
|
|
Maximum time (ms) to wait for a forked process to start up. |
|
|
Shut down an idle forked process after this many milliseconds of inactivity. |
|
|
Maximum time (ms) to wait for an available forked process when all are busy. |
Parse Behavior
| Field | Default | Description |
|---|---|---|
|
|
How embedded documents are handled: |
|
|
What to do when a parse fails: |
|
|
When |
Async / Emit Batching
These settings control how parsed results are batched before sending to emitters.
| Field | Default | Description |
|---|---|---|
|
|
Number of emitter threads. |
|
|
Size of the fetch/emit tuple queue. |
|
|
Flush the emit batch if nothing has been emitted within this many milliseconds, even if the batch is not full. |
|
|
Flush the emit batch when the estimated size reaches this many bytes. |
|
|
Emit partial results as they become available (rather than waiting for the full parse to complete). |
Shared Server Mode (Experimental)
| Field | Default | Description |
|---|---|---|
|
|
When |
See Shared Server Mode for details.