public final class PDFDocument extends Object
Lucene Field Name | Description |
path | File system path if loaded from a file |
url | URL to PDF document |
contents | Entire contents of PDF document, indexed but not stored |
summary | First 500 characters of content |
modified | The modified date/time according to the url or path |
uid | A unique identifier for the Lucene document. |
CreationDate | From PDF meta-data if available |
Creator | From PDF meta-data if available |
Keywords | From PDF meta-data if available |
ModificationDate | From PDF meta-data if available |
Producer | From PDF meta-data if available |
Subject | From PDF meta-data if available |
Trapped | From PDF meta-data if available |
Modifier and Type | Method and Description |
---|---|
static org.apache.lucene.document.Document |
getDocument(Resource res)
This will get a lucene document from a PDF file.
|
static org.apache.lucene.document.Document |
getDocument(StringBuffer content,
InputStream is)
This will get a lucene document from a PDF file.
|
public static org.apache.lucene.document.Document getDocument(StringBuffer content, InputStream is)
is
- The stream to read the PDF from.IOException
- If there is an error parsing or indexing the document.public static org.apache.lucene.document.Document getDocument(Resource res)
res
- The file to get the document for.IOException
- If there is an error parsing or indexing the document.Copyright © 2015 Lucee