Class | Description |
---|---|
AdvancedTypeDetector | |
ContentHandlerExample |
Examples of using different Content Handlers to
get different parts of the file's contents
|
CustomMimeInfo | |
DescribeMetadata |
Print the supported Tika Metadata models and their fields.
|
DirListParser |
Parses the output of /bin/ls and counts the number of files and the number of
executables using Tika.
|
DisplayMetInstance |
Grabs a PDF file from a URL and prints its
Metadata |
DumpTikaConfigExample |
This class shows how to dump a TikaConfig object to a configuration file.
|
EncryptedPrescriptionDetector | |
EncryptedPrescriptionParser | |
ExtractEmbeddedFiles | |
GrabPhoneNumbersExample |
Class to demonstrate how to use the
PhoneExtractingContentHandler
to get a list of all of the phone numbers from every file in a directory. |
ImportContextImpl |
ImportContextImpl ... |
InterruptableParsingExample |
This example demonstrates how to interrupt document parsing if
some condition is met.
|
Language | |
LanguageDetectingParser | |
LanguageDetectorExample | |
LazyTextExtractorField |
LazyTextExtractorField implements a Lucene field with a String
value that is lazily initialized from a given Reader . |
LuceneIndexer | |
LuceneIndexerExtended | |
MediaTypeExample | |
MetadataAwareLuceneIndexer |
Builds on the LuceneIndexer from Chapter 5 and adds indexing of Metadata.
|
MyFirstTika | |
ParsingExample | |
Pharmacy | |
PrescriptionParser | |
RecentFiles |
Builds on top of the LuceneIndexer and the Metadata discussions in Chapter 6
to output an RSS (or RDF) feed of files crawled by the LuceneIndexer within
the last N minutes.
|
RollbackSoftware |
Demonstrates Tika and its ability to sense symlinks.
|
SimpleTextExtractor | |
SimpleTypeDetector | |
SpringExample | |
TIAParsingExample | |
TranslatorExample | |
TrecDocumentGenerator |
Generates document summaries for corpus analysis in the Open Relevance
project.
|
ZipListFiles |
Example code listing from Chapter 1.
|
Copyright © 2007–2018 The Apache Software Foundation. All rights reserved.