Class PaginatedLocator
java.lang.Object
org.apache.tika.inference.locator.PaginatedLocator
Locator for paginated documents (PDF, PPTX, DOCX, etc.).
The bounding box coordinates are normalized to [0, 1] relative to
page dimensions, making them resolution-independent.
The bbox array is [x0, y0, x1, y1] where (x0, y0) is the
top-left corner and (x1, y1) is the bottom-right corner.
-
Constructor Summary
Constructors -
Method Summary
-
Constructor Details
-
PaginatedLocator
public PaginatedLocator(int page, float[] bbox) - Parameters:
page- 1-based page numberbbox- normalized bounding box [x0, y0, x1, y1], or null if the entire page is referenced
-
PaginatedLocator
public PaginatedLocator(int page)
-
-
Method Details
-
getPage
public int getPage() -
getBbox
public float[] getBbox()- Returns:
- normalized bbox [x0, y0, x1, y1], or null if whole page
-