Package org.apache.pdfbox.pdfparser


package org.apache.pdfbox.pdfparser
The pdfparser package contains classes to parse PDF documents and objects within the document.
  • Class
    Description
    This class is used to contain parsing logic that will be used by both the PDFParser and the COSStreamParser.
     
    PDFParser which first reads startxref and xref tables in order to know valid objects and parse only these objects.
    This will parse a PDF 1.5 object stream and extract all of the objects from the stream.
    This class will handle the parsing of the PDF document.
    This will parse a PDF byte stream and extract operands and such.
     
     
    This will parse a PDF 1.5 (or better) Xref stream and extract the xref information from the stream.
     
    This class will collect all XRef/trailer objects and creates correct xref/trailer information after all objects are read using startxref and 'Prev' information (unused XRef/trailer objects are discarded).