Constructs an element object.
Constructs an element object.
Constructs an element object.
Constructs an element object.
Returns true if the specified element is accepted.
Unsynchronized list of elements.
Returns true if the specified element should be removed.
This class is a document filter capable of removing specified
elements from the processing stream.
Element information as a contiguous list.
Element information organized by first letter.
Returns a set of empty attributes.
Returns true if the objects are equal.
Property identifier: error handler.
Defines an error reporter for reporting HTML errors.
Expanded system identifier.
Expands a system id and returns the system id as a URI, if
it can be expanded.
Include infoset augmentations.
Length of bytes read into byte buffer.
Offset into byte buffer during playback.
The playback byte stream.
The current entity stack.
Doctype declaration public identifier.
Doctype declaration system identifier.
Document fragment balancing only.
DTD content model handler.
Name of element whose content needs to be scanned as text.
Fix Microsoft Windows® character entity references.
Auto-detected IANA encoding.
Ignore specified character set.
Filters property identifier.
FILTERS - static field in class org.cyberneko.html.filters.
Writer Filters property identifier.
True if within a CDATA section.
True if inside a CDATA section.
Insert document type declaration.
True if the encoding matches "ISO-8859-*".
Fix Microsoft Windows® character entity references.
Fixes a platform dependent filename to standard URI form.
Fixes Microsoft Windows® specific characters.
Auto-detected Java encoding.
The levels of the entries.
Modify HTML attribute names.
Modify HTML attribute names.
Modify HTML attribute names.
Modify HTML element names.
Modify HTML element names.
Modify HTML element names.
Normalize character content.
Notify character entity references.
Notify HTML built-in general entity references.
Notify XML built-in general entity references.
Format message without reporting error.
Format message without reporting error.
Override doctype declaration public and system identifiers.
The print writer used for serializing the document with the
appropriate character encoding.
Public identifier of doctype declaration.
The element depth at element removal.
The current scanner state.
True if <script> element.
Strip CDATA delimiters from SCRIPT tags.
Strip comment delimiters from SCRIPT tags.
True if seen <body< element.
True if root element has been seen.
True if the doctype declaration was seen.
True if seen <head< element.
Seen http-equiv directive.
True if root element has been seen.
True if root element was seen.
True if seen the end of the document element.
Special scanner used for elements whose content needs to be scanned
as plain text, ignoring markup such as elements and entity references.
Strip CDATA delimiters from STYLE tags.
Strip comment delimiters from STYLE tags.
Synthesized namespace binding count.
System identifier of doctype declaration.
True if <textarea> element.
Returns the name associated to the given character or null if
the character is not known.
Returns the character associated to the given entity name, or
-1 if the name is not known.
Returns the base system identifier.
Returns the column number of the beginning of this event.
Returns the column number of the beginning of this event.
Returns the column number of the beginning of this event.
Returns the line number of the beginning of this event.
Returns the line number of the beginning of this event.
Returns the line number of the beginning of this event.
Returns the current character offset.
Returns the current column number.
Get declared prefix count.
Returns the document handler.
Returns the document handler.
Returns the document handler.
Returns the document handler.
Returns the document source.
Returns the document source.
Returns the document source.
Returns the DTD content model handler.
Returns the element information for the specified element code.
Returns the element information for the specified element name.
Returns the element information for the specified element name.
Returns the depth of the open tag associated with the specified
element name or -1 if no matching element is found.
Returns the column number of the end of this event.
Returns the column number of the end of this event.
Returns the column number of the end of this event.
Returns the line number of the end of this event.
Returns the line number of the end of this event.
Returns the line number of the end of this event.
Returns the entity resolver.
Return the current error handler.
Returns the error handler.
Returns the expanded system identifier.
Query the state of a feature.
Returns the default state for a feature, or null if this
component does not want to report a default value for this
feature.
Returns the default state for a feature, or null if this
component does not want to report a default value for this
feature.
Returns the default state for a feature.
Returns the default state for a feature.
Returns the default state for a feature, or null if this
component does not want to report a default value for this
feature.
Get information identified by a key from the Augmentations structure.
Returns the current line number.
Returns the literal system identifier.
Converts HTML names string value to constant value.
Converts HTML names string value to constant value.
Converts HTML names string value to constant value.
Returns the depth of the open tag associated with the specified
element parent names or -1 if no matching element is found.
Query the value of a property.
Returns the default state for a property, or null if this
component does not want to report a default value for this
property.
Returns the default state for a property, or null if this
component does not want to report a default value for this
property.
Returns the default state for a property.
Returns the default state for a property.
Returns the default value for a property, or null if this
component does not want to report a default value for this
property.
Returns the public identifier.
Returns a list of feature identifiers that are recognized by
this component.
Returns recognized features.
Returns recognized features.
Returns a list of feature identifiers that are recognized by
this component.
Returns a list of property identifiers that are recognized by
this component.
Returns recognized properties.
Returns recognized properties.
Returns a list of property identifiers that are recognized by
this component.
Returns the value of the specified attribute, ignoring case.
Returns a hash code for this object.
HTML 4.01 frameset public identifier ("-//W3C//DTD HTML 4.01 Frameset//EN").
HTML 4.01 frameset system identifier ("http://www.w3.org/TR/html4/frameset.dtd").
HTML 4.01 strict public identifier ("-//W3C//DTD HTML 4.01//EN").
HTML 4.01 strict system identifier ("http://www.w3.org/TR/html4/strict.dtd").
HTML 4.01 transitional public identifier ("-//W3C//DTD HTML 4.01 Transitional//EN").
HTML 4.01 transitional system identifier ("http://www.w3.org/TR/html4/loose.dtd").
This class is here to overcome the XNI changes to the
Augmentations
interface.
This interface extends the XNI XMLComponent
interface
to add methods that allow the preferred default values for features
and properties to be queried.
An XNI-based parser configuration that can be used to parse HTML
documents.
Collection of HTML element information.
Pre-defined HTML entities.
Defines an error reporter for reporting HTML errors.
This interface is used to pass augmentated information to the
application through the XNI pipeline.
Balances tags in an HTML document.
This filter performs the identity operation of the original
document event stream generated by the HTML scanner by removing
events that are synthesized by the tag balancer.
Ignore specified charset found in the <meta equiv='Content-Type'
content='text/html;charset=…'> tag.
Element info for each start element.
Creates an element information object.
Creates an element information object.
Unsynchronized stack of element information.
Insert document type declaration.
Insert namespace binding URIs.
Returns true if this element is a block element.
Returns true if this element is a container element.
Returns true if this element is an empty element.
Returns true if this element is an inline element.
Returns true if this element is special -- if its content
should be parsed ignoring markup.
Returns true if this corresponding event was synthesized.
Returns true if this corresponding event was synthesized.
Returns true if this corresponding event was synthesized.
Modify HTML attribute names: { "upper", "lower", "default" }.
Modify HTML attribute names: { "upper", "lower", "default" }.
Modify HTML attribute names: { "upper", "lower", "default" }.
Modify HTML attribute names: { "upper", "lower", "default" }.
Modify HTML element names: { "upper", "lower", "default" }.
Modify HTML element names: { "upper", "lower", "default" }.
Modify HTML element names: { "upper", "lower", "default" }.
Modify HTML element names: { "upper", "lower", "default" }.
Match HTML element names.
This filter binds namespaces if namespace processing is turned on
by setting the feature "http://xml.org/sax/features/namespaces" is
set to true
.
This namespace context object implements the old and new XNI
NamespaceContext
interface methods so that it can
be used across all versions of Xerces2.
Notify character entity references (e.g.
Notify character entity references.
Notify handler of built-in entity references (e.g.
Notify built-in entity references.
Notify handler of built-in entity references (e.g.
A SAX parser for HTML documents.
Scans part of the document.
Scans an attribute, pseudo or real.
Scans an entity reference.
Scans a processing instruction.
Scans a pseudo attribute.
Strip XHTML CDATA delimiters ("<![CDATA[" and "]]>") from
SCRIPT tag contents.
Strip HTML comment delimiters ("<!−−" and
"−−>") from SCRIPT tag contents.
Reverse mapping from characters to names.
Sets the document handler.
Sets the document handler.
Sets the document handler.
Sets the document handler.
Sets the document source.
Sets the document source.
Sets the document source.
Sets the DTD content model handler.
Sets the entity resolver.
Allow an application to register an error event handler.
Sets the state of a feature.
Set the state of any feature in a SAX2 parser.
Sets the input source for the document to parse.
Sets the value of a property.
Set the value of any property in a SAX2 parser.
Sets the values of this item.
Returns true if the specified text is present and is skipped.
Skips newlines and returns the number of newlines skipped.
Skips newlines and returns the number of newlines skipped.
Special scanner used for elements whose content needs to be scanned
as plain text, ignoring markup such as elements and entity references.
Strip XHTML CDATA delimiters ("<![CDATA[" and "]]>") from
STYLE tag contents.
Strip HTML comment delimiters ("<!−−" and
"−−>") from STYLE tag contents.
Synthesize namespace binding.
Returns true if the information provided is synthesized.
Synthesized event info item.
Synthesized event info item.
Synthesized event info item.
Synthesized namespace binding prefix.
Returns an augmentations object with a synthesized item added.
Returns an augmentations object with a synthesized item added.
Returns an augmentations object with a synthesized item added.
Synthesized infoset item.