Class UnicodeEncoding
java.lang.Object
org.graalvm.shadowed.org.jcodings.Encoding
org.graalvm.shadowed.org.jcodings.MultiByteEncoding
org.graalvm.shadowed.org.jcodings.unicode.UnicodeEncoding
- All Implemented Interfaces:
Cloneable
- Direct Known Subclasses:
CESU8Encoding,FixedWidthUnicodeEncoding,NonStrictUTF8Encoding,UTF16BEEncoding,UTF16LEEncoding,UTF8Encoding
-
Field Summary
-
Constructor Summary
ConstructorsModifierConstructorDescriptionprotectedUnicodeEncoding(String name, int minLength, int maxLength, int[] EncLen) protectedUnicodeEncoding(String name, int minLength, int maxLength, int[] EncLen, int[][] Trans) -
Method Summary
Modifier and TypeMethodDescriptionvoidapplyAllCaseFold(int flag, ApplyAllCaseFoldFunction fun, Object arg) onigenc_ascii_apply_all_case_fold / used also by multibyte encodingsprotected final voidasciiApplyAllCaseFold(int flag, ApplyAllCaseFoldFunction fun, Object arg) protected final CaseFoldCodeItem[]asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end) protected final intasciiMbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower) caseFoldCodesByString(int flag, byte[] bytes, int p, int end) onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodingsfinal intOniguruma equivalent:case_mapprotected final int[]ctypeCodeRange(int ctype) The name of the equivalent Java Charset for this encoding.booleanisCodeCType(int code, int ctype) Perform a check whether given code is of given character type (e.g.protected final booleanisCodeCTypeInternal(int code, int ctype) ONIGENC_IS_XXXXXX_CODE_CTYPEstatic booleanisInCodeRange(UnicodeCodeRange range, int code) booleanisNewLine(byte[] bytes, int p, int end) onigenc_is_mbc_newline_0x0a / used also by multibyte encodingsintmbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] fold) onigenc_ascii_mbc_case_foldintpropertyNameToCType(byte[] name, int p, int end) onigenc_minimum_property_name_to_ctype notably overridden by unicode encodingsMethods inherited from class org.graalvm.shadowed.org.jcodings.MultiByteEncoding
isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLengthMethods inherited from class org.graalvm.shadowed.org.jcodings.Encoding
asciiToLower, asciiToUpper, codeToMbc, codeToMbcLength, ctypeCodeRange, digitVal, equals, getCharset, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isReverseMatchAllowed, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, leftAdjustCharHead, length, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, mbcToCode, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
Constructor Details
-
UnicodeEncoding
-
UnicodeEncoding
-
-
Method Details
-
getCharsetName
Description copied from class:EncodingThe name of the equivalent Java Charset for this encoding. Defaults to the name of the encoding. Subclasses can override this to provide a different name.- Overrides:
getCharsetNamein classEncoding- Returns:
- the name of the equivalent Java Charset for this encoding
-
isCodeCType
public boolean isCodeCType(int code, int ctype) Description copied from class:EncodingPerform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)- Specified by:
isCodeCTypein classEncoding- Parameters:
code- a code point of a characterctype- a character type to check against Oniguruma equivalent:is_code_ctype
-
isInCodeRange
-
ctypeCodeRange
protected final int[] ctypeCodeRange(int ctype) -
propertyNameToCType
public int propertyNameToCType(byte[] name, int p, int end) onigenc_minimum_property_name_to_ctype notably overridden by unicode encodings -
mbcCaseFold
onigenc_ascii_mbc_case_fold- Parameters:
flag- case fold flagpp- anIntHolderthat points at character headfold- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
applyAllCaseFold
onigenc_ascii_apply_all_case_fold / used also by multibyte encodings- Parameters:
flag- case fold flagfun- case folding functor (look at:ApplyCaseFold)arg- case folding functor argument (look at:ApplyCaseFoldArg) Oniguruma equivalent:apply_all_case_fold
-
caseFoldCodesByString
onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodings -
caseMap
public final int caseMap(IntHolder flagP, byte[] bytes, IntHolder pp, int end, byte[] to, int toP, int toEnd) Description copied from class:EncodingOniguruma equivalent:case_map- Overrides:
caseMapin classMultiByteEncoding
-
isCodeCTypeInternal
protected final boolean isCodeCTypeInternal(int code, int ctype) ONIGENC_IS_XXXXXX_CODE_CTYPE -
isNewLine
public boolean isNewLine(byte[] bytes, int p, int end) onigenc_is_mbc_newline_0x0a / used also by multibyte encodings -
asciiMbcCaseFold
-
asciiApplyAllCaseFold
-
asciiCaseFoldCodesByString
protected final CaseFoldCodeItem[] asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end)
-