proposed utc c2 non submission - unicodethis documents lists the cjk ideographs which have been...
TRANSCRIPT
-
This documents lists the CJK ideographs which have been rquested of the UTC (through me) which are not currently encoded and which I do not recommend we include with our C2 submission. Rather, I recommend they be represented using variation sequences per UTS 37.
Proposed UTC C2 Non-submissionJohn H. Jenkins
Apple Computer, Inc.
UTC00001
Cowles, Roy T. (1986) A Pocket Dictionary of Cantonese. Hong Kong: University Press. #4762, CHIN2, defined to mean “A coin. Cash. Money. S.” ≈U+9322; ultrafanti writing of [U+233b4]
Source
4.2RSIndex
UTC00006
Karlgren, Bernhard (1991) Analytic Dictionary of Chinese and Sino-Japanese. New York: Dover Publications, Inc. #570. lou/lau, “narrow, confined; retired”. Also: Mathews #4150, lou4, “to retire into obscurity”; Variant of [U+3537]
Source
1.5RSIndex
UTC00007
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
9.7RSIndex
UTC00013
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
211.8RSIndex
11/1/2006 1
[email protected] BoxL2/06-365
-
UTC00024
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
159.9RSIndex
UTC00029
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
120.5RSIndex
UTC00030
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
149.7RSIndex
UTC00038
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
164.6RSIndex
UTC00042
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
== CB 407.04 CHAU1 bunch, bundle
Source
64.5RSIndex
11/1/2006 2
-
UTC00048
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
140.7RSIndex
UTC00052
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
167.5RSIndex
UTC00061
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
196.10RSIndex
UTC00062
ABC2 (DeFrancis), etc.; See:http://linguistics.berkeley.edu/~rscook/bishop/Wenlin-PUA.pdf
Source
15.10RSIndex
UTC00111
Mathews #5434, sui4, “deep; abstruse; in the rear, far-off”
Source
116.13RSIndex
11/1/2006 3
-
UTC00117
ABC; Simp. of [U+9d4f]Source
196.7RSIndex
UTC00122
DYC:058.330 (=[U+5448];==[U+2f83e]? HDZ: 10589.110)
Cihai 258.501, a Z-variant of U+5448 (see Wenlin)
Source
30.4RSIndex
UTC00142
DYC:456.212 (HDZ: 20961.020)==[U+2f896]?Source
58.5RSIndex
UTC00154
DYC:675.410Source
142.16RSIndex
UTC00156
DYC:678.150 (phonetic is [U+66f0], not [U+65e5]);Cf. DYC 413.410 [U+6b25] [U+e1ee] (HDZ 32135.090)
Source
182.4RSIndex
11/1/2006 4
-
UTC00161
WL [U+e1e2]=[U+8fa5](!=[U+5dad]) SBGY:496.30Source
160.9RSIndex
UTC00169
WL [U+e00b]=[U+20982] SBGY:156.35Source
24.4RSIndex
UTC00185
WL [U+e02a]=[U+2a21c] SBGY:264.16Source
120.22RSIndex
UTC00197
WL [U+e039]=[U+27b0c] SBGY:230.18 ([U+5d52]ne[U+55a6]) cf. HDZ form 64018.150. (Note: IRG unified [U+27B0C]; HDZ has [U+55a6], while the Unicode glyph has [U+5d52]). The SBGY glyph (UTC#197) has [U+5d52] ... )
Source
149.12RSIndex
UTC00204
WL [U+e045]=[U+27e29] SBGY:394.50Source
154.14RSIndex
11/1/2006 5
-
UTC00212
WL [U+e053]=[U+26b83] SBGY:094.45 (Bottom has[U+571f] on left.)
Source
140.6RSIndex
UTC00213
WL [U+e054]=[U+29b81] SBGY:488.13 SBGY:491.14 (SBGY component is [U+5200] not [U+529b]; cf. HDZ 74529.080)
Source
190.9RSIndex
UTC00216
WL [U+e057]=[U+23037] SBGY:268.25Source
65.19RSIndex
UTC00217
WL [U+e058]=[U+4d66] SBGY:308.23Source
203.6RSIndex
UTC00231
WL [U+e073]=[U+3d94] SBGY:386.15U+3D94 VARIANT
Source
85.18RSIndex
11/1/2006 6
-
UTC00235
WL [U+e07a]=[U+269b7][U+2574b] SBGY:360.16Source
114.21RSIndex
UTC00237
WL [U+e07c]=[U+4c27] SBGY:083.24Source
194.8RSIndex
UTC00242
WL [U+e084]=[U+6722] SBGY:426.31U+6722 VARIANT
Source
96.10RSIndex
UTC00247
WL [U+e08c]=[U+27d6a] SBGY:530.08Source
154.6RSIndex
UTC00264
WL [U+e0a8]=[U+2755d] SBGY:197.09Source
142.18RSIndex
11/1/2006 7
-
UTC00275
WL [U+e0b5]=[U+403d] SBGY:411.04
Are we sure this is encoded? It doesn't look like either U+251A8 or U+043D to me. JHJ 9/1/06
Source
109.7RSIndex
UTC00298
LDS #18 (8151)Source
61.4RSIndex
UTC00317
WL [U+efbc]([U+27d2a]); cf. [U+efa6]Source
154.3RSIndex
UTC00320
WL [U+e1e7]=[U+2676e] DYC:054.221the radical is a variant of [U+8089]
Source
130.8RSIndex
UTC00324
WL [U+e1ec]=[U+86d3] DYC: 665.230Source
142.7RSIndex
11/1/2006 8
-
UTC00328
Cowles #15 == MW 11 == CB 383.06, (NG)AAI, to quarrel (equivalent to U+9698)
Source
30.12RSIndex
UTC00329
Lau 186a ≈ U+6491Source
64.14RSIndex
UTC00333
Lau 599b Fenn 497.06 ≈ U+64CB, U+6529, DANG3/DONG2, to obstruct
Source
64.10RSIndex
UTC00334
Lau 1074a ≈ U+7BA1, GUN2, pipe, tube, ductSource
87.8RSIndex
UTC00335
Yet another turtle!!! Lau 1028a, Cowles 2059, MW 1368, Fenn 272.04
Source
5.9RSIndex
11/1/2006 9
-
UTC00339
Lau 1371a, 1372a == Fenn 22.11 ≈ U+9019, JE5, this
Source
162.3RSIndex
UTC00340
Lau #1560a ≈ U+846C, JONG3, to burySource
140.9RSIndex
UTC00341
Lau 1956a, 1957a ≈ U+87BA, LO4, conchSource
142.13RSIndex
UTC00342
Lau 2136 ≈ U+8993 MIK6 to hunt for, seekSource
154.4RSIndex
UTC00343
Lau #2923 ≈ U+7B97, SYUN3, to calculate, count, regard as
Cowles 3708 == MW 2859 ≈ U+7B97 SYUN3 to calculate
Source
118.8RSIndex
11/1/2006 10
-
UTC00344
Lau 2962a ≈ U+69AB SEUN3 tenon, mortiseSource
24.12RSIndex
UTC00347
Lau 206f ≈ U+7CBB JEUNG1, fine white rice, provisions
Source
119.11RSIndex
UTC00349
Lau 321b, CHIU4, a sea animal that sings at night; could this be a variant of U+232CB? [no; it's a variantof U+9f02 (RSC)]
See also SBGY 147.13
Source
205.4RSIndex
UTC00353
Lau 516a ≈ U+874C FO1 tadpoleSource
30.9RSIndex
UTC00356
Lau 740a ≈ U+568E HOU1, to howl, roar; this is probably the same as CB 376.06 (and what about U+5651?)
Source
30.12RSIndex
11/1/2006 11
-
UTC00357
Lau 766 == Cowles 1139 ≈ U+58F3, U+6BBC, HOK3,husks; a dipper
Source
79.6RSIndex
UTC00358
MW 772 == Cowles 1148 == CB 410.03 ≈ U+627BHAM2 to knock, strike
Source
64.7RSIndex
UTC00369
MW 1212 ≈ U+69EA KOI3, the whole, altogetherSource
75.11RSIndex
UTC00370
MW 1414d ≈ U+6451, GWOK3, to slap, strike
== CB 412.04 GWAAK3
Source
64.8RSIndex
UTC00371
MW 1425, LA2, to take up with the fingers, clutch; I rather suspect this is a variant of U+650B; N.B., structurally, this is U+2FF0 U+624C U+983C, but Wenlin won't let me do it that way
Source
64.16RSIndex
11/1/2006 12
-
UTC00375
MW 1677 ≈ U+5C62 LEUI5 often, repeatedlySource
63.11RSIndex
UTC00380
Lau 2339/2340 == MW 2044 == Cowles 3033 == CB380.02 383.05 ≈ U+6271 NGAP6 to blink, nod, beckon; JAP1 a pinch of something
Source
30.12RSIndex
UTC00386
MW 2177 ≈ U+8334 WUI4 caraway, coriander
Here's a case crying out for variant selectors if ever one did
Source
140.5RSIndex
UTC00389
MW 2237 ≈ U+6500 PAAN1 to pull, drag; to climb; toimplicate
Source
64.15RSIndex
UTC00393
MW 2321 == Cowles 3266 ≈ U+568A, U+21020 BEI6or; emphatic particle
Source
30.10RSIndex
11/1/2006 13
-
UTC00394
MW 2339 == Cowles 3476 ≈ U+5288, U+64D7 PEK3to split, rive, break open
Source
64.13RSIndex
UTC00395
MW 2376a ≈ U+4F75 BING6 even; together; reduceSource
9.8RSIndex
UTC00398
MW 2403 == Cowles 3326 == CB 438.02 ≈ U+7172BOU1 to boil, cook; a pot
Source
86.15RSIndex
UTC00399
MW 2431 == Cowles 3523 ≈ U+6CE1 POU5 a bubble,froth
Source
85.8RSIndex
UTC00400
MW 2431 == Cowles 3523.5 ≈ U+6CE1 POU5 a bubble, froth
Source
30.8RSIndex
11/1/2006 14
-
UTC00410
Cowles 3815 == MW 2613 ≈ U+4F7F SAI2 to use, send, employ
Source
30.8RSIndex
UTC00414
MW 2941a ≈ U+9193 TAAM5 brine of pickled minceSource
130.9RSIndex
UTC00420
Cowles 4114 == MW 3029 == CB 375.08 ≈ U+7AC7DAU3 a cave
Source
30.11RSIndex
UTC00425
MW 3338f ≈ U+6AA3 CHEUNG4 a boom, sparSource
72.13RSIndex
UTC00427
Cowles 4582 == MW 3397 ≈ U+564D JIU6 to chew,masticate
Source
30.15RSIndex
11/1/2006 15
-
UTC00429
Cowles 4795 == CB 375.05 == MW 3438 ≈ U+554BCHOI1 interjection; fie
Source
30.11RSIndex
UTC00431
Cowles 4279 == MW 3602 ≈ U+6D1E DUNG6 to see through; a cave
Source
30.9RSIndex
UTC00432
MW 3708b ≈ U+9370 WAAN4 a metal ringSource
167.11RSIndex
UTC00438
MW 3854d ≈ U+60A0 YAU4 distant, far-reachingSource
61.10RSIndex
UTC00446
Cowles 515 == MW 253 CHI1 glutinous, sticky ≈ U+9ED0
Source
30.11RSIndex
11/1/2006 16
-
UTC00447
Fenn 21.13 ≈ U+8517 ZHE1 sugar-caneSource
140.14RSIndex
UTC00450
Cowles 1915 == MW 1026 ≈ U+8804 KAM4 a toad/spider
Source
142.13RSIndex
UTC00452
Cowles 2149 ≈ U+56B9 U+21148 LA3 a particle implying urgency or completion
Source
30.49RSIndex
UTC00453
Cowles 2220 ≈ U+7C15 LAK6 thorn, brambleSource
140.11RSIndex
UTC00454
Cowles 2560 MA4 sesamum; this is clearly a variant of U+8534
Source
140.13RSIndex
11/1/2006 17
-
UTC00455
Cowles 2632 == MW 1787 == CB 377.09 ≈ U+64DDMANG3 to pull, pluck
Source
30.11RSIndex
UTC00465
Cowles 3820 ≈ U+52E2 SAI3 power, authority, forceSource
19.11RSIndex
UTC00469
Cowles 4883 ≈ U+8438 YU4 sumacSource
140.6RSIndex
UTC00472
Lau 1293 ≈ U+8CFA JAAN6 to earn moneySource
154.16RSIndex
UTC00473
Fenn 67.06 (preferred form!) == Lau 1465, a clear Z-variant of U+76F4
Source
109.3RSIndex
11/1/2006 18
-
UTC00474
Lau 1467, a clear Z-variant of U+690DSource
75.8RSIndex
UTC00475
Lau 1580 ≈ U+947D JYUN to drillSource
167.15RSIndex
UTC00476
Lau 1581 ≈ U+947D JYUN to drillSource
167.17RSIndex
UTC00477
Lau 1653 (index form) KAM2 to cover
If it weren't for the definition and pronunciation, I'd say this was just a Z-variant of U+519A
Source
14.8RSIndex
UTC00479
Fenn 64.06 QIAN2 cash, a variant of UTC 1 and U+9322
Source
4.2RSIndex
11/1/2006 19
-
UTC00480
Lau 2317 == CB 346.03 ≈ U+8D17 NGAAN6 counterfeit, false
Source
154.12RSIndex
UTC00486
From the cover of my Chu Nom dictionary, used as the "chu" part of "chu nom" and hence a synonym forU+5B57
Source
40.8RSIndex
UTC00487
MW 3370 CHIN2 cash, yet another variant of UTC 1 and U+9322
Source
4.2RSIndex
UTC00493
MW 282d ≈ U+7FB6 JIN1 frowsy, rank
I'm not at all sure about the RS value here; the shapeis definitely what MW has, however
Source
24.13RSIndex
UTC00494
MW 866a ≈ U+8ABC YI6 related, proper
I'm not entirely sure of the glyph here, since neither form is in the RS index, but I believe this is correct
Source
149.12RSIndex
11/1/2006 20
-
UTC00495
MW 1827f ≈ U+982B MIN5 to have an audience
This form isn't in the RS index under 154.6, so I maybe misidentifying the glyph, but the glyph definitely looks like this
Source
154.6RSIndex
UTC00503
*sheesh* This one looks too familiar to be believed. I'm going to kick myself when I remember it. This looks like a variant of U+617C, based on appearance,Stimson's RS index (61.11, not 61.10), definition, andpronunciation, but the glyph is significantly different
Source
61.10RSIndex
UTC00508
Fenn 92.08 CHOU3 to look at ≈ U+77C1Source
109.10RSIndex
UTC00520
Shangwu 123.04 ≈ U+9762Source
30.5RSIndex
UTC00523
CB 358.05 358.06 JO2 aspect marker, completed action
This is just a variant of U+5528
Source
30.7RSIndex
11/1/2006 21
-
UTC00531
Fenn 136.08 ≈ U+98A8 FENG1 windSource
16.3RSIndex
UTC00536
Fenn 139.03 ≈ U+4F5B FO2 BuddhaSource
9.17RSIndex
UTC00539
Fenn 148.07 167P HAN4 Chinese
I can't believe we don't already have this one
Source
140.8RSIndex
UTC00547
Stimson 7.101 ≈ U+4FDC peng lonelySource
9.9RSIndex
UTC00553
CB 376.06 ≈ UTC 356Source
30.11RSIndex
11/1/2006 22
-
UTC00555
Fenn deserves to be shot for this bizarre variant of U+4F1A (209.04)
Source
9.4RSIndex
UTC00567
Fenn 241.02 KAN4 to look at ≈ U+770B
Yet another bizarre y-variant from Fenn
Source
109.5RSIndex
UTC00572
CB 386.04 BE6 final particle; this is probably just a variant of U+568A
Source
30.14RSIndex
UTC00580
Fenn 286.11 ≈ U+89BD LAN3 to look atSource
147.8RSIndex
UTC00582
Fenn 299.05 ≈ U+6B77 LI4 take place, past, historySource
61.12RSIndex
11/1/2006 23
-
UTC00592
Fenn 358.04 NEN4 tender ≈ U+5AE9, U+5AF0Source
38.9RSIndex
UTC00593
Fenn 359.01 ≈ U+80FD NENG2 ability, talentSource
130.7RSIndex
UTC00595
Fenn 430.07 ≈ U+5AC2 SAO3 elder brother's wifeSource
38.10RSIndex
UTC00608
Fenn 507.05 ≈ U+7B49 DENG3 to waitSource
140.6RSIndex
UTC00616
Fenn 559.12 ≈ U+8EA5 CUAN1 to leap, jump, pranceSource
187.18RSIndex
11/1/2006 24
-
UTC00619
CB 399.08 ME1 to carry on the back (y-variant of U+5B6D)
Source
39.9RSIndex
UTC00620
CB 399.07 ME1 to carry on the back (z-variant of U+2197C, y-variant of U+5B6D)
Source
39.9RSIndex
UTC00626
CB 401.09 YAN6 sharp; this is clearly a variant of U+5C7B, although both occur independently in the same source
Source
46.4RSIndex
UTC00627
CB 401.08 YAN6 sharp; the glyph here is less than certain, because it looks in CB like a standard Z-variant of U+5C7B. The character intended is either ay-variant of U+5C7B (as with the glyph here) or a z-variant
Source
46.3RSIndex
UTC00641
Fenn 644.07 645.06 ≈ U+8207 與 YU2 particle of query, exclamation
Source
55.2RSIndex
11/1/2006 25
-
UTC00658
Fenn 32.10 421K ≈ U+8640, U+9F4F, U+2718C, U+27171 ZHI1 a leek
I should re-evaluate this one, as it may be U+27171or U+2718C, but the glyphs are subtly different
Source
140.23RSIndex
UTC00663
Fenn 153.07≈ U+8861 HENG2 balance, measure, weigh
Source
144.11RSIndex
UTC00668
Xiandai Hanyu Cidian 832.08, SC form of U+5D19
Email from [email protected], 4 June 2004
Source
46.4RSIndex
UTC00669
Xiandai Hanyu Cidian 1282.05, SC form of U+7A68
Email from [email protected], 4 June 2004
Source
115.11RSIndex
UTC00674
XHC 1282.02 TUI2 SC form of U+96A4Source
170.9RSIndex
11/1/2006 26
-
UTC00675
XHC 1346.18 XI1 SC form of U+8B46Source
149.12RSIndex
UTC00676
XHC 1653.06 ZHUAN4 SC form of U+8B54Source
149.12RSIndex
UTC00677
XHC 391.04 FU3 SC form of U+982BSource
181.6RSIndex
UTC00678
XHC 532.08 HU2 SC form of U+992CSource
184.9RSIndex
UTC00679
XHC 1673.02 ZONG1 SC form of U+9A23Source
187.9RSIndex
11/1/2006 27
-
UTC00680
XHC 1363.05 SC form of U+9DB1Source
196.10RSIndex
UTC00687
HYDZD 20920.050, which IMHO is not a z-variant of U+5B9C
Source
40.4RSIndex
UTC00728
CB 451.07 PA4 chin (y-variant of U+26629)Source
130.5RSIndex
UTC00733
CB 453.09 CHEUN1 egg, ovum, a y-variant of U+81A5
Source
130.11RSIndex
UTC00777
HYDZD 31707.010 xuán
This is currently unified with U+6ECB, but the HYDZDtreats it as distinct in pronunciation and meaning (viz., as a y-variant of U+7386), so it should be separated
Source
85.10RSIndex
11/1/2006 28
-
UTC00789
CB 457.06 goeng2 koeng2 roots (variant of U+8503)Source
140.12RSIndex
UTC00792
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00793
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00794
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00795
Wenlin variant form used for glyph compositionSource
RSIndex
11/1/2006 29
-
UTC00796
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00797
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00798
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00799
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00800
Wenlin variant form used for glyph compositionSource
RSIndex
11/1/2006 30
-
UTC00802
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00803
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00804
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00805
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00806
Wenlin variant form used for glyph compositionSource
RSIndex
11/1/2006 31
-
UTC00807
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00808
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00809
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00812
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00814
Wenlin variant form used for glyph compositionSource
RSIndex
11/1/2006 32
-
UTC00815
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00818
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00819
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00820
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00821
Wenlin variant form used for glyph compositionSource
RSIndex
11/1/2006 33
-
UTC00825
Source
RSIndex
UTC00826
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00827
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00828
Wenlin variant form used for glyph compositionSource
RSIndex
UTC00831
Wenlin variant form used for glyph compositionSource
RSIndex
11/1/2006 34
-
UTC00833
Wenlin variant form used for glyph compositionSource
RSIndex
11/1/2006 35