public class ToTextContentHandler
extends org.xml.sax.helpers.DefaultHandler
As of Tika 1.20, this handler ignores content within <script> and <style> tags.
| Constructor and Description |
|---|
ToTextContentHandler()
Creates a content handler that writes character events
to an internal string buffer.
|
ToTextContentHandler(java.io.OutputStream stream)
Creates a content handler that writes character events to
the given output stream using the platform default encoding.
|
ToTextContentHandler(java.io.OutputStream stream,
java.lang.String encoding)
Creates a content handler that writes character events to
the given output stream using the given encoding.
|
ToTextContentHandler(java.io.Writer writer)
Creates a content handler that writes character events to
the given writer.
|
| Modifier and Type | Method and Description |
|---|---|
void |
characters(char[] ch,
int start,
int length)
Writes the given characters to the given character stream.
|
void |
endDocument()
Flushes the character stream so that no characters are forgotten
in internal buffers.
|
void |
endElement(java.lang.String uri,
java.lang.String localName,
java.lang.String qName) |
void |
ignorableWhitespace(char[] ch,
int start,
int length)
Writes the given ignorable characters to the given character stream.
|
void |
startElement(java.lang.String uri,
java.lang.String localName,
java.lang.String qName,
org.xml.sax.Attributes atts) |
java.lang.String |
toString()
Returns the contents of the internal string buffer where
all the received characters have been collected.
|
public ToTextContentHandler(java.io.Writer writer)
writer - writerpublic ToTextContentHandler(java.io.OutputStream stream)
stream - output streampublic ToTextContentHandler(java.io.OutputStream stream,
java.lang.String encoding)
throws java.io.UnsupportedEncodingException
stream - output streamencoding - output encodingjava.io.UnsupportedEncodingException - if the encoding is unsupportedpublic ToTextContentHandler()
toString()
method to access the collected character content.public void characters(char[] ch,
int start,
int length)
throws org.xml.sax.SAXException
characters in interface org.xml.sax.ContentHandlercharacters in class org.xml.sax.helpers.DefaultHandlerorg.xml.sax.SAXExceptionpublic void ignorableWhitespace(char[] ch,
int start,
int length)
throws org.xml.sax.SAXException
characters(char[], int, int) method.ignorableWhitespace in interface org.xml.sax.ContentHandlerignorableWhitespace in class org.xml.sax.helpers.DefaultHandlerorg.xml.sax.SAXExceptionpublic void endDocument()
throws org.xml.sax.SAXException
endDocument in interface org.xml.sax.ContentHandlerendDocument in class org.xml.sax.helpers.DefaultHandlerorg.xml.sax.SAXException - if the stream can not be flushedpublic void startElement(java.lang.String uri,
java.lang.String localName,
java.lang.String qName,
org.xml.sax.Attributes atts)
throws org.xml.sax.SAXException
startElement in interface org.xml.sax.ContentHandlerstartElement in class org.xml.sax.helpers.DefaultHandlerorg.xml.sax.SAXExceptionpublic void endElement(java.lang.String uri,
java.lang.String localName,
java.lang.String qName)
throws org.xml.sax.SAXException
endElement in interface org.xml.sax.ContentHandlerendElement in class org.xml.sax.helpers.DefaultHandlerorg.xml.sax.SAXExceptionpublic java.lang.String toString()
StringWriter to the
other constructor.toString in class java.lang.ObjectCopyright © 2010 - 2023 Adobe. All Rights Reserved