Class PaginatedLocator

java.lang.Object
org.apache.tika.inference.locator.PaginatedLocator

public class PaginatedLocator extends Object
Locator for paginated documents (PDF, PPTX, DOCX, etc.).

The bounding box coordinates are normalized to [0, 1] relative to page dimensions, making them resolution-independent. The bbox array is [x0, y0, x1, y1] where (x0, y0) is the top-left corner and (x1, y1) is the bottom-right corner.

  • Constructor Details

    • PaginatedLocator

      public PaginatedLocator(int page, float[] bbox)
      Parameters:
      page - 1-based page number
      bbox - normalized bounding box [x0, y0, x1, y1], or null if the entire page is referenced
    • PaginatedLocator

      public PaginatedLocator(int page)
  • Method Details

    • getPage

      public int getPage()
    • getBbox

      public float[] getBbox()
      Returns:
      normalized bbox [x0, y0, x1, y1], or null if whole page