public class ParserUtils extends Object
| Modifier and Type | Field and Description |
|---|---|
static Property |
EMBEDDED_PARSER |
| Constructor and Description |
|---|
ParserUtils() |
| Modifier and Type | Method and Description |
|---|---|
static Metadata |
cloneMetadata(Metadata m)
Does a deep clone of a Metadata object.
|
static InputStream |
ensureStreamReReadable(InputStream stream,
TemporaryResources tmp)
Ensures that the Stream will be able to be re-read, by buffering to
a temporary file if required.
|
static String |
getParserClassname(Parser parser)
Identifies the real class name of the
Parser, unwrapping
any ParserDecorator decorations on top of it. |
static void |
recordParserDetails(Parser parser,
Metadata metadata)
|
static void |
recordParserFailure(Parser parser,
Throwable failure,
Metadata metadata)
|
static InputStream |
streamResetForReRead(InputStream stream,
TemporaryResources tmp)
Resets the given
TikaInputStream (checked by
ensureStreamReReadable(InputStream, TemporaryResources))
so that it can be re-read again. |
public static final Property EMBEDDED_PARSER
public static Metadata cloneMetadata(Metadata m)
public static String getParserClassname(Parser parser)
Parser, unwrapping
any ParserDecorator decorations on top of it.public static void recordParserFailure(Parser parser, Throwable failure, Metadata metadata)
public static InputStream ensureStreamReReadable(InputStream stream, TemporaryResources tmp) throws IOException
TikaInputStreams
created from Files or InputStreamFactories, and RereadableInputStream.IOExceptionpublic static InputStream streamResetForReRead(InputStream stream, TemporaryResources tmp) throws IOException
TikaInputStream (checked by
ensureStreamReReadable(InputStream, TemporaryResources))
so that it can be re-read again.IOExceptionCopyright © 2007–2022 The Apache Software Foundation. All rights reserved.