FSM_ASCII
public static final int FSM_ASCII
states for ISO 2022 A document in ISO-2022 based encoding uses some ESC sequences called "designator" to switch
character sets. The designators defined and used in ISO-2022-JP are: "ESC" + "(" + ? for ISO646 variants "ESC" +
"$" + ? and "ESC" + "$" + "(" + ? for multibyte character sets. State ASCII.
FSM_ESC
public static final int FSM_ESC
state ESC.
FSM_ESCD
public static final int FSM_ESCD
state ESCD.
FSM_ESCDP
public static final int FSM_ESCDP
state ESCDP.
FSM_ESCP
public static final int FSM_ESCP
state ESCP.
FSM_NONASCII
public static final int FSM_NONASCII
state NONASCII.
HIGH_UTF16_SURROGATE
public static final int HIGH_UTF16_SURROGATE
UTF-16 high surrogate.
LOW_UTF16_SURROGATE
public static final int LOW_UTF16_SURROGATE
utf16 low surrogate.
MAX_UTF16_FROM_UCS4
public static final int MAX_UTF16_FROM_UCS4
Max UTF-16 value.
MAX_UTF8_FROM_UCS4
public static final int MAX_UTF8_FROM_UCS4
Max UTF-88 valid char value.
UNICODE_BOM
public static final int UNICODE_BOM
the default (big-endian) UNICODE BOM.
UNICODE_BOM_BE
public static final int UNICODE_BOM_BE
the big-endian (default) UNICODE BOM.
UNICODE_BOM_LE
public static final int UNICODE_BOM_LE
the little-endian UNICODE BOM.
UNICODE_BOM_UTF8
public static final int UNICODE_BOM_UTF8
the UTF-8 UNICODE BOM.
UTF16_HIGH_SURROGATE_BEGIN
public static final int UTF16_HIGH_SURROGATE_BEGIN
UTF-16 surrogate pair areas: high surrogates begin.
UTF16_HIGH_SURROGATE_END
public static final int UTF16_HIGH_SURROGATE_END
UTF-16 surrogate pair areas: high surrogates end.
UTF16_LOW_SURROGATE_BEGIN
public static final int UTF16_LOW_SURROGATE_BEGIN
UTF-16 surrogate pair areas: low surrogates begin.
UTF16_LOW_SURROGATE_END
public static final int UTF16_LOW_SURROGATE_END
UTF-16 surrogate pair areas: low surrogates end.
UTF16_SURROGATES_BEGIN
public static final int UTF16_SURROGATES_BEGIN
UTF-16 surrogates begin.