Class EUCJPEncoding
java.lang.Object
org.graalvm.shadowed.org.jcodings.Encoding
org.graalvm.shadowed.org.jcodings.MultiByteEncoding
org.graalvm.shadowed.org.jcodings.EucEncoding
org.graalvm.shadowed.org.jcodings.specific.EUCJPEncoding
- All Implemented Interfaces:
Cloneable
-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoidapplyAllCaseFold(int flag, ApplyAllCaseFoldFunction fun, Object arg) onigenc_ascii_apply_all_case_fold / used also by multibyte encodingsprotected final voidasciiApplyAllCaseFold(int flag, ApplyAllCaseFoldFunction fun, Object arg) protected final CaseFoldCodeItem[]asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end) protected final intasciiMbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower) caseFoldCodesByString(int flag, byte[] bytes, int p, int end) onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodingsintcodeToMbc(int code, byte[] bytes, int p) Extracts code point into it's multibyte representationintcodeToMbcLength(int code) Returns character length given a code point Oniguruma equivalent:code_to_mbclenint[]ctypeCodeRange(int ctype, IntHolder sbOut) Returns code range for a given character type Oniguruma equivalent:get_ctype_code_rangeThe name of the equivalent Java Charset for this encoding.booleanisCodeCType(int code, int ctype) Perform a check whether given code is of given character type (e.g.protected final booleanisCodeCTypeInternal(int code, int ctype) ONIGENC_IS_XXXXXX_CODE_CTYPEprotected booleanisLead(int c) booleanisNewLine(byte[] bytes, int p, int end) onigenc_is_mbc_newline_0x0a / used also by multibyte encodingsbooleanisReverseMatchAllowed(byte[] bytes, int p, int end) Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_matchintlength(byte[] bytes, int p, int end) Returns character length given stream, character position and stream end returns1for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwiseintmbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower) onigenc_ascii_mbc_case_foldintmbcToCode(byte[] bytes, int p, int end) Returns code point for a character Oniguruma equivalent:mbc_to_codeintpropertyNameToCType(byte[] bytes, int p, int end) onigenc_minimum_property_name_to_ctype notably overridden by unicode encodingsMethods inherited from class org.graalvm.shadowed.org.jcodings.EucEncoding
leftAdjustCharHeadMethods inherited from class org.graalvm.shadowed.org.jcodings.MultiByteEncoding
caseMap, isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLengthMethods inherited from class org.graalvm.shadowed.org.jcodings.Encoding
asciiToLower, asciiToUpper, digitVal, equals, getCharset, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
Field Details
-
INSTANCE
-
-
Constructor Details
-
EUCJPEncoding
protected EUCJPEncoding()
-
-
Method Details
-
length
public int length(byte[] bytes, int p, int end) Description copied from class:EncodingReturns character length given stream, character position and stream end returns1for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwise -
getCharsetName
Description copied from class:EncodingThe name of the equivalent Java Charset for this encoding. Defaults to the name of the encoding. Subclasses can override this to provide a different name.- Overrides:
getCharsetNamein classEncoding- Returns:
- the name of the equivalent Java Charset for this encoding
-
mbcToCode
public int mbcToCode(byte[] bytes, int p, int end) Description copied from class:EncodingReturns code point for a character Oniguruma equivalent:mbc_to_code -
codeToMbcLength
public int codeToMbcLength(int code) Description copied from class:EncodingReturns character length given a code point Oniguruma equivalent:code_to_mbclen- Specified by:
codeToMbcLengthin classEncoding
-
codeToMbc
public int codeToMbc(int code, byte[] bytes, int p) Description copied from class:EncodingExtracts code point into it's multibyte representation -
mbcCaseFold
onigenc_ascii_mbc_case_fold- Parameters:
flag- case fold flagpp- anIntHolderthat points at character headlower- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
isLead
protected boolean isLead(int c) - Specified by:
isLeadin classEucEncoding
-
isReverseMatchAllowed
public boolean isReverseMatchAllowed(byte[] bytes, int p, int end) Description copied from class:EncodingReturns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_match- Specified by:
isReverseMatchAllowedin classEncoding
-
propertyNameToCType
public int propertyNameToCType(byte[] bytes, int p, int end) onigenc_minimum_property_name_to_ctype notably overridden by unicode encodings -
isCodeCType
public boolean isCodeCType(int code, int ctype) Description copied from class:EncodingPerform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)- Specified by:
isCodeCTypein classEncoding- Parameters:
code- a code point of a characterctype- a character type to check against Oniguruma equivalent:is_code_ctype
-
ctypeCodeRange
Description copied from class:EncodingReturns code range for a given character type Oniguruma equivalent:get_ctype_code_range- Specified by:
ctypeCodeRangein classEncoding
-
isCodeCTypeInternal
protected final boolean isCodeCTypeInternal(int code, int ctype) ONIGENC_IS_XXXXXX_CODE_CTYPE -
isNewLine
public boolean isNewLine(byte[] bytes, int p, int end) onigenc_is_mbc_newline_0x0a / used also by multibyte encodings -
asciiMbcCaseFold
-
asciiApplyAllCaseFold
-
applyAllCaseFold
onigenc_ascii_apply_all_case_fold / used also by multibyte encodings- Specified by:
applyAllCaseFoldin classEncoding- Parameters:
flag- case fold flagfun- case folding functor (look at:ApplyCaseFold)arg- case folding functor argument (look at:ApplyCaseFoldArg) Oniguruma equivalent:apply_all_case_fold
-
asciiCaseFoldCodesByString
protected final CaseFoldCodeItem[] asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end) -
caseFoldCodesByString
onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodings- Specified by:
caseFoldCodesByStringin classEncoding
-