Class PdfDocumentParser

java.lang.Object
ai.doctruth.PdfDocumentParser

public final class PdfDocumentParser extends Object
Layer 1 entry point: read a PDF file from disk into a ParsedDocument with source locations preserved per detected layout block. PDFBox owns raw glyph extraction; PdfPageBlockExtractor owns page-level grouping and visual classification.
Since:
0.1.0