au.id.jericho.lib.html
Class NumericCharacterReference
- CharSequence, Comparable
public class NumericCharacterReference
static String | encode(CharSequence unencodedText) - Encodes the specified text, escaping special characters into numeric character references.
|
static String | encodeDecimal(CharSequence unencodedText) - Encodes the specified text, escaping special characters into decimal character references.
|
static String | encodeHexadecimal(CharSequence unencodedText) - Encodes the specified text, escaping special characters into hexadecimal character references.
|
String | getCharacterReferenceString() - Returns the correct encoded form of this numeric character reference.
|
static String | getCharacterReferenceString(int codePoint) - Returns the numeric character reference encoded form of the specified unicode code point.
|
String | getDebugInfo() - Returns a string representation of this object useful for debugging purposes.
|
boolean | isDecimal() - Indicates whether this numeric character reference specifies the unicode code point in decimal format.
|
boolean | isHexadecimal() - Indicates whether this numeric character reference specifies the unicode code point in hexadecimal format.
|
decode , decode , decodeCollapseWhiteSpace , encode , encodeWithWhiteSpaceFormatting , getChar , getCharacterReferenceString , getCharacterReferenceString , getCodePoint , getCodePointFromCharacterReferenceString , getDecimalCharacterReferenceString , getDecimalCharacterReferenceString , getHexadecimalCharacterReferenceString , getHexadecimalCharacterReferenceString , getUnicodeText , getUnicodeText , isTerminated , parse , reencode , requiresEncoding |
charAt , compareTo , encloses , encloses , equals , extractText , extractText , findAllCharacterReferences , findAllComments , findAllElements , findAllElements , findAllElements , findAllStartTags , findAllStartTags , findAllStartTags , findAllTags , findAllTags , findFormControls , findFormFields , findWords , getBegin , getChildElements , getDebugInfo , getEnd , getSourceText , getSourceTextNoWhitespace , hashCode , ignoreWhenParsing , isComment , isWhiteSpace , isWhiteSpace , length , parseAttributes , subSequence , toString |
encode
public static String encode(CharSequence unencodedText)
- encode in interface CharacterReference
unencodedText
- the text to encode.
encodeDecimal
public static String encodeDecimal(CharSequence unencodedText)
unencodedText
- the text to encode.
encodeHexadecimal
public static String encodeHexadecimal(CharSequence unencodedText)
unencodedText
- the text to encode.
getCharacterReferenceString
public String getCharacterReferenceString()
Returns the correct encoded form of this numeric character reference.
The returned string uses the same radix as the original character reference in the source document,
i.e. decimal format if
isDecimal()
is
true
, and hexadecimal format if
isHexadecimal()
is
true
.
Note that the returned string is not necessarily the same as the original source text used to create this object.
This library recognises certain invalid forms of character references,
as detailed in the
decode(CharSequence)
method.
To retrieve the original source text, use the
toString()
method instead.
CharacterReference.parse(">").getCharacterReferenceString()
>
- getCharacterReferenceString in interface CharacterReference
- the correct encoded form of this numeric character reference.
CharacterReference.getCharacterReferenceString(int codePoint)
getCharacterReferenceString
public static String getCharacterReferenceString(int codePoint)
Returns the numeric character reference encoded form of the specified unicode code point.
This method returns the character reference in decimal format, and is exactly the same as calling
getDecimalCharacterReferenceString(int codePoint)
.
To get either the character entity reference or numeric character reference, use the
CharacterReference.getCharacterReferenceString(int codePoint)
method instead.
To get the character reference in hexadecimal format, use the
getHexadecimalCharacterReferenceString(int codePoint)
method instead.
NumericCharacterReference.getCharacterReferenceString(62)
>
NumericCharacterReference.getCharacterReferenceString('>')
>
- getCharacterReferenceString in interface CharacterReference
- the numeric character reference encoded form of the specified unicode code point.
CharacterReference.getCharacterReferenceString(int codePoint)
getDebugInfo
public String getDebugInfo()
Returns a string representation of this object useful for debugging purposes.
- getDebugInfo in interface Segment
- a string representation of this object useful for debugging purposes.
isDecimal
public boolean isDecimal()
Indicates whether this numeric character reference specifies the unicode code point in decimal format.
A numeric character reference in decimal format is referred to in this library as a
decimal character reference.
true
if this numeric character reference specifies the unicode code point in decimal format, otherwise false
.
isHexadecimal
public boolean isHexadecimal()
Indicates whether this numeric character reference specifies the unicode code point in hexadecimal format.
A numeric character reference in hexadecimal format is referred to in this library as a
hexadecimal character reference.
true
if this numeric character reference specifies the unicode code point in hexadecimal format, otherwise false
.