| Class | Description |
|---|---|
| AccessChecker |
Checks whether or not a document allows extraction generally
or extraction for accessibility only.
|
| PDFMarkedContent2XHTML |
This was added in Tika 1.24 as an alpha version of a text extractor
that builds the text from the marked text tree and includes/normalizes
some of the structural tags.
|
| PDFParser |
PDF parser.
|
| PDFParserConfig |
Config for PDFParser.
|
| PDFPreflightParser | Deprecated
This will be removed in 2.x.
|
| Enum | Description |
|---|---|
| PDFParserConfig.OCR_STRATEGY |
Copyright © 2010 - 2023 Adobe. All Rights Reserved