UNIHAN_FIELD_NOT_INSTALLED |
The field have not been installed. |
UNIHAN_INVALID_FIELD |
End of an Unihan Field array or indicate invalid field. |
UNIHAN_FIELD_CODE |
Unicode code point in integer. |
UNIHAN_FIELD_kACCOUNTINGNUMERIC |
Character when used in the writing of accounting numerals. |
UNIHAN_FIELD_kBIGFIVE |
Big5 Encoding. |
UNIHAN_FIELD_kCANGJIE |
Cangjie input code. |
UNIHAN_FIELD_kCANTONESE |
Cantonese pronunciation(s) using the jyutping romanization. |
UNIHAN_FIELD_kCCCII |
Chinese Character Code for Information Interchange. |
UNIHAN_FIELD_kCHEUNGBAUER |
Data regarding the Cheung and Bauer, The Representation of Cantonese with Chinese Characters. |
UNIHAN_FIELD_kCHEUNGBAUERINDEX |
The position of the character in Cheung and Bauer, "The Representation of Cantonese with Chinese Characters". |
UNIHAN_FIELD_kCIHAIT |
The position of this character in the Cihai (辭海) dictionary. |
UNIHAN_FIELD_kCNS1986 |
CNS 11643-1986. |
UNIHAN_FIELD_kCNS1992 |
CNS 11643-1992. |
UNIHAN_FIELD_kCOMPATIBILITYVARIANT |
The compatibility decomposition for this ideograph. |
UNIHAN_FIELD_kCOWLES |
in Cowles, "A Pocket Dictionary of Cantonese". |
UNIHAN_FIELD_kDAEJAWEON |
in the Dae Jaweon (Korean) dictionary. |
UNIHAN_FIELD_kDEFINITION |
An English definition for this character. |
UNIHAN_FIELD_kEACC |
EACC mapping for this character in hex. |
UNIHAN_FIELD_kFENN |
from Fenn's Chinese-English Pocket Dictionary. |
UNIHAN_FIELD_kFENNINDEX |
The position in Fenn's Chinese-English Pocket Dictionary by Courtenay. |
UNIHAN_FIELD_kFOURCORNERCODE |
The four-corner code(s) for the character. |
UNIHAN_FIELD_kFREQUENCY |
A rough frequency measurement for the character based on analysis of traditional Chinese USENET postings. |
UNIHAN_FIELD_kGB0 |
GB 2312-80. |
UNIHAN_FIELD_kGB1 |
GB 12345-90. |
UNIHAN_FIELD_kGB3 |
GB 7589-87. |
UNIHAN_FIELD_kGB5 |
GB 7590-87. |
UNIHAN_FIELD_kGB7 |
General Purpose Hanzi List for Modern Chinese Language, and General List of Simplified Hanzi. |
UNIHAN_FIELD_kGB8 |
GB 8565-89. |
UNIHAN_FIELD_kGRADELEVEL |
The primary grade in the Hong Kong school system by which a student is expected to know the character. |
UNIHAN_FIELD_kGSR |
in Bernhard Karlgren's Grammata Serica Recensa. |
UNIHAN_FIELD_kHANGUL |
modern Korean pronunciation(s) in Hangul. |
UNIHAN_FIELD_kHANYU |
in Hanyu Da Zidian (HDZ) Chinese character dictionary. |
UNIHAN_FIELD_kHANYUPINLU |
in Xiandai Hanyu Pinlu Cidian [Modern Standard Beijing Chinese Frequency Dictionary]. |
UNIHAN_FIELD_kHDZRADBREAK |
Indicates that 《漢語大字典》 Hanyu Da Zidian has a radical break beginning at this character's position. |
UNIHAN_FIELD_kHKGLYPH |
The index of the character in 常用字字形表 (二零零零年修訂本),香港: 香港教育學院. |
UNIHAN_FIELD_kHKSCS |
Big5 extended code points for the HK Supplementary Character Set. |
UNIHAN_FIELD_kIBMJAPAN |
IBM Japanese mapping for this character in hexadecimal. |
UNIHAN_FIELD_kIICORE |
IICore, the IRG-produced minimal set of required ideographs for East Asian use. |
UNIHAN_FIELD_kIRGDAEJAWEON |
in Dae Jaweon (Korean) dictionary used in the four-dictionary sorting algorithm. |
UNIHAN_FIELD_kIRGDAIKANWAZITEN |
in Dai Kanwa Ziten, aka Morohashi dictionary (Japanese) used in the four-dictionary sorting algorithm. |
UNIHAN_FIELD_kIRGHANYUDAZIDIAN |
in Hanyu Da Zidian (PRC) used in the four-dictionary sorting algorithm. |
UNIHAN_FIELD_kIRGKANGXI |
in KangXi dictionary. |
UNIHAN_FIELD_kIRG_GSOURCE |
PRC/Singapore sources, including mapping information. |
UNIHAN_FIELD_kIRG_HSOURCE |
Hong Kong sources , including mapping information. |
UNIHAN_FIELD_kIRG_JSOURCE |
Japanese sources, including mapping information. |
UNIHAN_FIELD_kIRG_KPSOURCE |
North Korean sources, including mapping information. |
UNIHAN_FIELD_kIRG_KSOURCE |
South Korean sources, including mapping information. |
UNIHAN_FIELD_kIRG_TSOURCE |
Taiwan sources, including mapping information. |
UNIHAN_FIELD_kIRG_USOURCE |
Unicode/USA sources, including mapping information. |
UNIHAN_FIELD_kIRG_VSOURCE |
Vietname sources, including mapping information. |
UNIHAN_FIELD_kJAPANESEKUN |
Japanese pronunciation(s). |
UNIHAN_FIELD_kJAPANESEON |
Sino-Japanese pronunciation(s) of this character. |
UNIHAN_FIELD_kJIS0213 |
JIS X 0213-2000. |
UNIHAN_FIELD_kJIS0 |
JIS X 0208-1990. |
UNIHAN_FIELD_kJIS1 |
JIS X 0212-1990. |
UNIHAN_FIELD_kKANGXI |
in KangXi dictionary used in the four-dictionary sorting algorithm. |
UNIHAN_FIELD_kKARLGREN |
in Analytic Dictionary of Chinese and Sino-Japanese by Bernhard Karlgren. |
UNIHAN_FIELD_kKOREAN |
The Korean pronunciation(s) of this character. |
UNIHAN_FIELD_kKPS0 |
KPS 9566-97. |
UNIHAN_FIELD_kKPS1 |
KPS 10721-2000. |
UNIHAN_FIELD_kKSC0 |
KS X 1001:1992 (KS C 5601-1989). |
UNIHAN_FIELD_kKSC1 |
KS X 1002:1991 (KS C 5657-1991). |
UNIHAN_FIELD_kLAU |
A Practical Cantonese-English Dictionary by Sidney Lau. |
UNIHAN_FIELD_kMAINLANDTELEGRAPH |
PRC telegraph code for this character. |
UNIHAN_FIELD_kMANDARIN |
Mandarin pronunciation(s) for this character in pinyin. |
UNIHAN_FIELD_kMATTHEWS |
in Mathews' Chinese-English Dictionary. |
UNIHAN_FIELD_kMEYERWEMPE |
Student's Cantonese-English Dictionary. |
UNIHAN_FIELD_kMOROHASHI |
Dae Kanwa Ziten, aka Morohashi dictionary (Japanese). |
UNIHAN_FIELD_kNELSON |
The Modern Reader's Japanese-English Character Dictionary. |
UNIHAN_FIELD_kOTHERNUMERIC |
The numeric value for the character in certain unusual, specialized contexts. |
UNIHAN_FIELD_kPHONETIC |
The phonetic index for the character from Ten Thousand Characters: An Analytic Dictionary. |
UNIHAN_FIELD_kPRIMARYNUMERIC |
The value of the character when used in the writing of numbers in the standard fashion. |
UNIHAN_FIELD_kPSEUDOGB1 |
A "GB 12345-90" code point assigned this character for the purposes of including it within Unihan. |
UNIHAN_FIELD_kRSADOBE_JAPAN1_6 |
Information on the glyphs in Adobe-Japan1-6 as contributed by Adobe. |
UNIHAN_FIELD_kRSJAPANESE |
A Japanese radical/stroke count. |
UNIHAN_FIELD_kRSKANGXI |
The KangXi radical/stroke count. |
UNIHAN_FIELD_kRSKANWA |
A Morohashi radical/stroke count. |
UNIHAN_FIELD_kRSKOREAN |
A Korean radical/stroke count. |
UNIHAN_FIELD_kRSUNICODE |
A standard radical/stroke count. |
UNIHAN_FIELD_kSBGY |
Song Ben Guang Yun (SBGY) 《宋本廣韻》 Medieval Chinese character dictionary. |
UNIHAN_FIELD_kSEMANTICVARIANT |
Semantic variants for this character, including dictionaries that refer it. |
UNIHAN_FIELD_kSIMPLIFIEDVARIANT |
Simplified variant for this character (if any). |
UNIHAN_FIELD_kSPECIALIZEDSEMANTICVARIANT |
Specialized semantic variant for this character, including dictionaries that refer it. |
UNIHAN_FIELD_kTAIWANTELEGRAPH |
Taiwanese telegraph code for this character. |
UNIHAN_FIELD_kTANG |
T'ang Poetic Vocabulary. |
UNIHAN_FIELD_kTOTALSTROKES |
The total number of strokes in the character (including the radical). |
UNIHAN_FIELD_kTRADITIONALVARIANT |
Traditional Chinese variant(s) for this character. |
UNIHAN_FIELD_kVIETNAMESE |
character's pronunciation(s) in in Quốc ngữ. |
UNIHAN_FIELD_kXEROX |
The Xerox code for this character. |
UNIHAN_FIELD_kXHC1983 |
One or more Hanyu pinyin reading as given in Xiandai Hanyu Ciden. |
UNIHAN_FIELD_kZVARIANT |
Z-variants of this character, including the source that refers it. |
UNIHAN_FIELD_DICT_VOLUME |
Volume number in dictionary. |
UNIHAN_FIELD_DICT_PAGE |
Page number in dictionary. |
UNIHAN_FIELD_DICT_POSITION |
The character number in the page. |
UNIHAN_FIELD_DICT_VIRTUAL |
Virtual position of the character in dictionary. 0 if the character is in the dictionary, greater than 0 for a character assigned a "virtual" position in the dictionary. |
UNIHAN_FIELD_DICT_VARIANT_SERIAL |
Serial number of variant. 0 for a main entry and greater than 0 for a parenthesized variant. |
UNIHAN_FIELD_DICT_UNENCODED |
Unencoded character in the dictionary which is replaced by one or more encoded variants. Currently used only by kXHC1983. |
UNIHAN_FIELD_IRG_GSOURCE |
The abbreviated G source name such as "G0" or "G4K". |
UNIHAN_FIELD_IRG_HSOURCE |
The abbreviated H source name "H". |
UNIHAN_FIELD_IRG_JSOURCE |
The abbreviated J source name such as "J0" or "J1". |
UNIHAN_FIELD_IRG_KPSOURCE |
The abbreviated KP source name such as "KP0" or "KP1". |
UNIHAN_FIELD_IRG_KSOURCE |
The abbreviated K source name such as "K0" or "K1". |
UNIHAN_FIELD_IRG_TSOURCE |
The abbreviated T source name such as "T1" or "T2". |
UNIHAN_FIELD_IRG_USOURCE |
The abbreviated U source name "U". |
UNIHAN_FIELD_IRG_VSOURCE |
The abbreviated V source name such as "V0" or "V1". |
UNIHAN_FIELD_IRG_SOURCE_MAPPING |
The index (code) in hex as in the corresponding IRG source. |
UNIHAN_FIELD_PINYIN |
Pinyin with tone. Always in lower case. |
UNIHAN_FIELD_PINYIN_BASE |
Pinyin without tone. |
UNIHAN_FIELD_PINYIN_TONE |
Tone of Pinyin. |
UNIHAN_FIELD_PINYIN_FREQ |
Frequency appears in Xiandai Hanyu Pinlu Cidian (現代漢語頻率詞典). |
UNIHAN_FIELD_ZHUYIN |
Zhuyin field. |
UNIHAN_FIELD_ADOBE_CID_CV |
C or V. "C" indicates that the Unicode code point maps directly to the Adobe-Japan1-6 CID that appears after it, and "V" indicates that it is considered a variant form, and thus not directly encoded. |
UNIHAN_FIELD_ADOBE_CID |
The Adobe-Japan1-6 CID. |
UNIHAN_FIELD_ADOBE_CID_RADICAL_STROKE_COUNT |
Stroke counts of KangXi radical. |
UNIHAN_FIELD_RADICAL_INDEX |
Index of KangXi radical. |
UNIHAN_FIELD_ADDITIONAL_STROKE_COUNT |
Number of strokes of character without radical. As in paper dictionary. |
UNIHAN_FIELD_RADICAL_IS_SIMPLIFIED |
1 if radical is simplified, 0 for normal radical. |
UNIHAN_FIELD_SEMANTICVARIANT |
Semantic Variant in UCS4, without dictionary information. |
UNIHAN_FIELD_SPECIALIZEDSEMANTICVARIANT |
Specialized Semantic Variant in UCS4, without dictionary information. |
UNIHAN_FIELD_FROM_DICT |
The dictionary that define the semantic relation. |
UNIHAN_FIELD_SEMANTIC_T |
"Tong" (同,synonym). The character and variant one are interchangeable. |
UNIHAN_FIELD_SEMANTIC_B |
"Bu" (不,incompatible). The character and variant one are not interchangeable. |
UNIHAN_FIELD_SEMANTIC_Z |
"Zheng" (正,preferred). The variant character is preferred. |
UNIHAN_FIELD_ZVARIANT |
Z Variant in UCS4, without source information. |
UNIHAN_FIELD_ZVARIANT_SOURCE |
The "Source" of Z variants, such as "kHKGlyph". |
UNIHAN_FIELD_FREQ_RANK |
The rank of the frequency, 1 stands for most frequent, 2 for less frequent and so on. |
UNIHAN_FIELD_SCALAR_VALUE |
Scalar representation (U+XXXXX) of the character. |
UNIHAN_FIELD_SERIAL |
Hold an artificial sequence number for sorting. |
UNIHAN_FIELD_SERIAL_NO_JOIN |
Similar with UNIHAN_FIELD_SERIAL , but this field will not be used in automatic join. |
UNIHAN_FIELD_UTF8 |
UTF8 representation of the character. |
UNIHAN_FIELD_3RD_PARTY |
3RD party fields. |