Class PDFHighlighter


public class PDFHighlighter extends PDFTextStripper
Highlighting of words in a PDF document with an XML file.
Version:
$Revision: 1.7 $
Author:
slagraulet (slagraulet@cardiweb.com), Ben Litchfield
See Also:
  • Constructor Details

    • PDFHighlighter

      public PDFHighlighter() throws IOException
      Default constructor.
      Throws:
      IOException - If there is an error constructing this class.
  • Method Details

    • generateXMLHighlight

      public void generateXMLHighlight(PDDocument pdDocument, String highlightWord, Writer xmlOutput) throws IOException
      Generate an XML highlight string based on the PDF.
      Parameters:
      pdDocument - The PDF to find words in.
      highlightWord - The word to search for.
      xmlOutput - The resulting output xml file.
      Throws:
      IOException - If there is an error reading from the PDF, or writing to the XML.
    • generateXMLHighlight

      public void generateXMLHighlight(PDDocument pdDocument, String[] sWords, Writer xmlOutput) throws IOException
      Generate an XML highlight string based on the PDF.
      Parameters:
      pdDocument - The PDF to find words in.
      sWords - The words to search for.
      xmlOutput - The resulting output xml file.
      Throws:
      IOException - If there is an error reading from the PDF, or writing to the XML.
    • endPage

      protected void endPage(PDPage pdPage) throws IOException
      End a page. Default implementation is to do nothing. Subclasses may provide additional information.
      Overrides:
      endPage in class PDFTextStripper
      Parameters:
      pdPage - The page we are about to process.
      Throws:
      IOException - If there is any error writing to the stream.
    • main

      public static void main(String[] args) throws IOException
      Command line application.
      Parameters:
      args - The command line arguments to the application.
      Throws:
      IOException - If there is an error generating the highlight file.