| Interface | Description |
|---|---|
| ContentHandlerFactory |
Interface to allow easier injection of code for getting a new ContentHandler
|
| Class | Description |
|---|---|
| AbstractRecursiveParserWrapperHandler |
This is a special handler to be used only with the
RecursiveParserWrapper. |
| BasicContentHandlerFactory |
Basic factory for creating common types of ContentHandlers
|
| BodyContentHandler |
Content handler decorator that only passes everything inside
the XHTML <body/> tag to the underlying handler.
|
| CleanPhoneText |
Class to help de-obfuscate phone numbers in text.
|
| ContentHandlerDecorator |
Decorator base class for the
ContentHandler interface. |
| DIFContentHandler | |
| ElementMappingContentHandler |
Content handler decorator that maps element
QNames using
a Map. |
| ElementMappingContentHandler.TargetElement | |
| EmbeddedContentHandler |
Content handler decorator that prevents the
EmbeddedContentHandler.startDocument()
and EmbeddedContentHandler.endDocument() events from reaching the decorated handler. |
| EndDocumentShieldingContentHandler |
A wrapper around a
ContentHandler which will ignore normal
SAX calls to EndDocumentShieldingContentHandler.endDocument(), and only fire them later. |
| ExpandedTitleContentHandler |
Content handler decorator which wraps a
TransformerHandler in order to
allow the TITLE tag to render as <title></title>
rather than <title/> which is accomplished
by calling the ContentHandler.characters(char[], int, int) method
with a length of 1 but a zero length char array. |
| Link | |
| LinkContentHandler |
Content handler that collects links from an XHTML document.
|
| OfflineContentHandler |
Content handler decorator that always returns an empty stream from the
OfflineContentHandler.resolveEntity(String, String) method to prevent potential
network or other external resources from being accessed by an XML parser. |
| PhoneExtractingContentHandler |
Class used to extract phone numbers while parsing.
|
| RecursiveParserWrapperHandler |
This is the default implementation of
AbstractRecursiveParserWrapperHandler. |
| RichTextContentHandler |
Content handler for Rich Text, it will extract XHTML <img/>
tag <alt/> attribute and XHTML <a/> tag <name/>
attribute into the output.
|
| SafeContentHandler |
Content handler decorator that makes sure that the character events
(
SafeContentHandler.characters(char[], int, int) or
SafeContentHandler.ignorableWhitespace(char[], int, int)) passed to the decorated
content handler contain only valid XML characters. |
| SecureContentHandler |
Content handler decorator that attempts to prevent denial of service
attacks against Tika parsers.
|
| StandardOrganizations |
This class provides a collection of the most important technical standard organizations.
|
| StandardReference |
Class that represents a standard reference.
|
| StandardReference.StandardReferenceBuilder | |
| StandardsExtractingContentHandler |
StandardsExtractingContentHandler is a Content Handler used to extract
standard references while parsing.
|
| StandardsText |
StandardText relies on regular expressions to extract standard references
from text.
|
| TaggedContentHandler |
A content handler decorator that tags potential exceptions so that the
handler that caused the exception can easily be identified.
|
| TeeContentHandler |
Content handler proxy that forwards the received SAX events to zero or
more underlying content handlers.
|
| TextContentHandler |
Content handler decorator that only passes the
TextContentHandler.characters(char[], int, int) and
(@link TextContentHandler.ignorableWhitespace(char[], int, int)
(plus TextContentHandler.startDocument() and TextContentHandler.endDocument() events to
the decorated content handler. |
| ToHTMLContentHandler |
SAX event handler that serializes the HTML document to a character stream.
|
| ToTextContentHandler |
SAX event handler that writes all character content out to a character
stream.
|
| ToXMLContentHandler |
SAX event handler that serializes the XML document to a character stream.
|
| WriteOutContentHandler |
SAX event handler that writes content up to an optional write
limit out to a character stream or other decorated handler.
|
| XHTMLContentHandler |
Content handler decorator that simplifies the task of producing XHTML
events for Tika content parsers.
|
| XMPContentHandler |
Content handler decorator that simplifies the task of producing XMP output.
|
| Enum | Description |
|---|---|
| BasicContentHandlerFactory.HANDLER_TYPE |
Common handler types for content.
|
| Exception | Description |
|---|---|
| TaggedSAXException |
A
SAXException wrapper that tags the wrapped exception with
a given object reference. |
Copyright © 2010 - 2023 Adobe. All Rights Reserved