Interface DocumentSelector

  • All Known Implementing Classes:

    public interface DocumentSelector
    Interface for different document selection strategies for purposes like embedded document extraction by a ContainerExtractor instance. An implementation of this interface defines some specific selection criteria to be applied against the document metadata passed to the select(Metadata) method.
    Apache Tika 0.8
    • Method Detail

      • select

        boolean select​(Metadata metadata)
        Checks if a document with the given metadata matches the specified selection criteria.
        metadata - document metadata
        true if the document matches the selection criteria, false otherwise