Class RecursiveMetadataResource

java.lang.Object
org.apache.tika.server.core.resource.RecursiveMetadataResource

@Path("/rmeta") public class RecursiveMetadataResource extends Object
  • Field Details

  • Constructor Details

    • RecursiveMetadataResource

      public RecursiveMetadataResource()
  • Method Details

    • parseMetadata

      public static List<Metadata> parseMetadata(InputStream is, Metadata metadata, jakarta.ws.rs.core.MultivaluedMap<String,String> httpHeaders, jakarta.ws.rs.core.UriInfo info, HandlerConfig handlerConfig) throws Exception
      Throws:
      Exception
    • getMetadataFromMultipart

      @POST @Consumes("multipart/form-data") @Produces("application/json") @Path("form{handler : (\\w+)?}") public jakarta.ws.rs.core.Response getMetadataFromMultipart(org.apache.cxf.jaxrs.ext.multipart.Attachment att, @Context jakarta.ws.rs.core.UriInfo info, @PathParam("handler") String handlerTypeName) throws Exception
      Returns an InputStream that can be deserialized as a list of Metadata objects. The first in the list represents the main document, and the rest represent metadata for the embedded objects. This works recursively through all descendants of the main document, not just the immediate children.

      The extracted text content is stored with the key TikaCoreProperties.TIKA_CONTENT.

      Specify the handler for the content (xml, html, text, ignore) in the path:
      /rmeta/form (default: xml)
      /rmeta/form/xml (store the content as xml)
      /rmeta/form/text (store the content as text)
      /rmeta/form/ignore (don't record any content)

      Parameters:
      att - attachment
      info - uri info
      handlerTypeName - which type of handler to use
      Returns:
      InputStream that can be deserialized as a list of Metadata objects
      Throws:
      Exception
    • getMetadata

      @PUT @Produces("application/json") @Path("{handler : (\\w+)?}") public jakarta.ws.rs.core.Response getMetadata(InputStream is, @Context jakarta.ws.rs.core.HttpHeaders httpHeaders, @Context jakarta.ws.rs.core.UriInfo info, @PathParam("handler") String handlerTypeName) throws Exception
      Returns an InputStream that can be deserialized as a list of Metadata objects. The first in the list represents the main document, and the rest represent metadata for the embedded objects. This works recursively through all descendants of the main document, not just the immediate children.

      The extracted text content is stored with the key TikaCoreProperties.TIKA_CONTENT.

      Specify the handler for the content (xml, html, text, ignore) in the path:
      /rmeta (default: xml)
      /rmeta/xml (store the content as xml)
      /rmeta/text (store the content as text)
      /rmeta/ignore (don't record any content)

      Parameters:
      info - uri info
      handlerTypeName - which type of handler to use
      Returns:
      InputStream that can be deserialized as a list of Metadata objects
      Throws:
      Exception