[10] MULE はいくつかの私用終端バイト を使っていました。 >>16
[19] chinese-sisheng
("SiSheng (PinYin/ZhuYin)",
Emacs 20+;
Mule 2.3: sisheng_cwnn
>>65 ;
XEmacs : sisheng
,
ltr ) - 94集合 3/0 ★[69]
lc-thai
(TIS 620 , Mule 2.3 (廃止)) - 94集合 3/1 [26] lao
("Lao" U+0E81
- U+0EDF
,
Emacs 20+, XEmacs UTF-2000 ) - 94集合 3/1 ★[23] arabic-digit
("Arabic digit",
Mule 2.3: MuleArabic-0
>>65 = lc-arb0
,
ltr ) - 94集合 3/2 [24] arabic-1-column
("Arabic 1-column",
Mule 2.3: MuleArabic-1
>>65 = lc-arb1
,
rtl ) - 94集合 3/3 [25] arabic-2-column
("Arabic 2-column",
Mule 2.3: MuleArabic-2
>>65 = lc-arb2
,
rtl ) - 94集合 3/4 [27] indian-is13194
("Indian IS 13194 (DEV)",
Emacs 20+) - 94集合 3/5 ★[20] ipa
("IPA",
Mule 2.3: MuleIPA
>>65 ;
Emacs 20+,
XEmacs ,
ltr ) - 96集合 3/0 ★[21] vietnamese-viscii-lower
("VISCII lower-case",
Mule 2.3: VISCII1.1
>>65 ;
Emacs 20+,
XEmacs ) - 96集合 3/1 ★[22] vietnamese-viscii-upper
("VISCII upper-case",
Mule 2.3: VISCII1.1
>>65 ;
Emacs 20+;
XEmacs ) - 96集合 3/2 ★[99] mule-ucs-unicode-multichar
- 96集合 3/14 [17] chinese-big5-1
("Big5 (Level-1) A141-C67F",
Mule 2.3: lc-big5-1
,
Emacs 20+;
XEmacs ) - 942 集合 3/0 ★[18] chinese-big5-2
("Big5 (Level-2) C940-FEFE",
Mule 2.3: lc-big5-2
,
Emacs 20+;
XEmacs ) - 942 集合 3/1 ★[71] lc-ethio
(Mule 2.3, 廃止) - 942 集合 3/2 [36] ethiopic
("Ethiopic characters",
Emacs 20+;
XEmacs ) - 942 集合 3/3 ★[30] indian-2-column
("Indian 2 Column",
Emacs 20+) - 942 集合 3/5 [29] indian-1-column
("Indian 1 Column",
Emacs 20+) - 942 集合 3/6 [31] tibetan
("Tibetan 2 column",
Emacs 20+) - 942 集合 3/7 [32] tibetan-1-column
("Tibetan 1 column",
Emacs 20+) - 942 集合 3/8 [72] mojikyo-2022-1
(XEmacs UTF-2000 , 廃止) - 943 集合 3/10 [81] mojikyo-2022-2
(XEmacs UTF-2000 , 廃止) - 943 集合 3/11 [82] おそらく予約されただけで未実装のまま廃止[73] thai-xtis
(XEmacs ) - 942 集合 3/15 [74] bitmap
(BITMAP-MULE ) - 962 集合 3/0 [35] mule-unicode-0100-24ff
("Unicode subset (U+0100..U+24FF)",
Mule-UCS ; Emacs 21+) -
962 集合 3/1 [33] mule-unicode-2500-33ff
("Unicode subset (U+2500..U+33FF)",
Mule-UCS ; Emacs 21+) -
962 集合 3/2 [34] mule-unicode-e000-ffff
("Unicode subset (U+E000+FFFF) ",
Mule-UCS ; Emacs 21+) -
962 集合 3/3 [75] lc-arb3
(Mule 2.3 ) - 962 集合 3/3 [76] lc-arb4
(Mule 2.3 ) - 962 集合 3/4 [77] cgreek
(cgreek ) - 962 集合 3/4 [28] indian-glyph
("Indian glyph") - 962 集合 3/4 [43] 文字合成 を表現する制御機能
3/0 - 3/4
私用制御機能 [100] ★ 印のあるものは Mule-UCS に Unicode との変換表があります。
[63]
このうち Big5 の2つは X の ctext でも
Emacs
との互換性のためとして BIG5-E0
, BIG5-E1
の名前で実装されています。
>>16
[64] lao
は X の ctext で
extended segment の mulelao-1
として実装されています。
終端バイト は使っていません。
extended segment
[70]
lc-thai
と lao
は同じ終端バイト を使いまわしています。
lc-thai
は TIS 620 であるとされ、
公式な終端バイト が割り当てられたために Fp は廃止されたようです。
タイ文字 とラオス文字 の違いがある非互換変更 のように見えますが、
両者は親類関係にあり
lao
は TIS 620 のタイ文字 をラオス文字 に置き換えたものなので、
まったくの非互換でもありません。
[79]
Mule (GNU Emacs , XEmacs ) は elisp
で文字集合 やその終端バイト を定義できます。
Mule 本体は私用終端バイト の符号化図形集合 をその利用例のように説明していました
>>65 。
Mule 本体以外の追加で利用するパッケージで文字集合 を定義したものが実際にありましたし、
公開されていたもの以外にも私的な利用例はあったかもしれません。
[80]
そのように完全な中央管理ではなかったため (および旧 Mule ,
GNU Emacs ,
XEmacs
と複数系統の実装が存在したため) に一部で私用終端バイト の衝突が起きています。
[37] Mule内部コード も参照。
[83] なお Mule 独自の図形文字集合 と Unicode の対照表は
Mule-UCS に入っています。
[7] ISO-2022-JP-MS は ESC $ ( ? を EUDC に割り当てています。
[46]
VT
では94集合
3/0 - 3/9 ,
3/12 - 3/14 ,
2/2 3/1 ,
2/5 3/5 ,
2/5 3/6 ,
2/5 3/13
が使われています。
VT
[95]
VT
では、
ESC
2/8 3/1 ,
ESC
2/9 3/1 ,
ESC
2/9 4/2
が指示 とは違う意味で使われることがありました。
VT , 指示シーケンス
[98]
>>95 それと同時に
ESC
2/8 3/1
が G0 に、
ESC
2/9 4/2
が G1
に ASCII
を指示 するとされます。 >>97
4/2 は ASCII ですが、
3/1
は VT では独自の集合のはずです。
しかも
ESC
2/9 3/1
は指示 とはされていません。
本当にこの通りの実装だったのでしょうか。謎です。
[40]
VT
では告知シーケンス 相当の私用終端バイト
3/6 ,
3/7
が使われています。
VT
[42]
VT
では
3F 型の私用制御機能
3/3 - 3/9 , 3/15
が使われています。
VT
[44]
ctext
では
3F 型の私用制御機能 の中間バイト と
3/0 - 3/1
が使われています。
ctext
[45]
VT
では
DOCS
3/0 , 3/4 , 3/8
が使われています。
3/8 は異なる2種が知られています。
VT
[54]
Tektronix 4014
では
ESC
3/8 - 3/11 が使われていました。
VT , 私用制御機能
[55]
Digital Ansi-Compliant Print Protocol Lev 2 Program. Ref. Man. - PPLV2PMB.PDF , 1995-08-31T14:32:02.000Z , 2022-05-02T13:36:24.457Z http://sup.xenya.si/sup/info/digital/MDS/jun99/Cd3/PRINTER/PPLV2PMB.PDF#page=162
94集合 3/0 DEC Special Graphics3/4 DEC Dutch3/5 DEC Finnish3/6 DEC Norwegian/Danish3/7 DEC Swedish3/9 DEC French-Canadian3/12 User Preference SupplementalUser Preference Supplemental character set
は互換性のため
DEC Supplemental character set
に設定されている。 3/13 DEC Swiss3/14 DEC Technical2/2 3/4 DEC Hebrew Supplemental2/5 3/0 DEC 8-Bit Turkish Supplemental2/5 3/2 DEC 7-Bit Turkish2/5 3/4 Legal2/5 3/5 DEC Supplemental2/5 3/6 DEC Portuguese2/5 3/13 DEC 7-Bit Hebrew2/2 3/15 DEC Greek Supplemental96集合 3/12 User Preference SupplementalUser Preference Supplemental character set
は互換性のため
DEC Supplemental character set (94集合 )
に設定されている。 VT , DECの文字コード も参照
[89] SCS—Select Character Set , 2022-12-11T09:42:45.000Z https://vt100.net/docs/vt510-rm/SCS.html
[90] 94集合 0 DEC Special Graphic 5 Finnish NRCS 6 Norwegian/Danish NRCS 7 Swedish NRCS 9 French Canadian NRCS < User-preferred Supplemental = Swiss NRCS > DEC Technical Character Set " 4 DEC Hebrew " > Greek NRCS " ? DEC Greek % 0 DEC Turkish % 2 Turkish NRCS % 3 SCS NRCS % 5 DEC Supplemental % 6 Portuguese NRCS % = Hebrew NRCS & 4 DEC Cyrillic & 5 Russian NRCS [91] 96集合 < User-preferred Supplemental [93] contra/memo.txt at master · akinomyoga/contra · GitHub , 2022-12-11T10:03:01.000Z https://github.com/akinomyoga/contra/blob/master/memo.txt#L1313
RLogin は private 94 charsets の "1" として DEC_TCS を実装しているが、
然し一方で https://vt100.net/docs/vt510-rm/SCS.html によると、DEC_TCS は "<" である。
[57] 次のものは、 ISO-IR に登録された終端バイト も私用 の文字集合に
「fall back」するとされます。
>>55
(指示 されたビット組合せ は必ずしも互換ではない)
4/3 → DEC Finnish5/1 → DEC French Canadian4/5 → DEC Norwegian/Danish4/8 → DEC Swedish[92] >>89 では >>57 の4種に加えて 6/0 も Norwegian/Danish NRCS。
[96]
大韓民国 版 DOS >>94
ESC $ ) 1 = ハングル 入力モード ESC ( 2 = 英語 入力モード [56] DEC漢字 関係:
[61] libmoe : DOCS
[84] ecma35lib : DOCS
,
IRR
も参照。
[86] ecma35lib の Fp 指示列 および Fp IRR + Fp 指示列 type 種別 IRR IRR Fp F 指示列 Fp set 説明 >>88 type 94集合 F 2/3 (#
) 3/0 (0
)set KS X 1003 type 94集合 IRR 3/0 (0
)F 2/3 (#
) 3/0 (0
)set KS X 1003 with tildetype 94集合 F 2/3 (#
) 3/1 (1
)set ETS 300 706 Latin G0 for France type 94集合 F 2/3 (#
) 3/2 (2
)set ETS 300 706 Latin G0 for Spain and Portugal type 94集合 F 2/3 (#
) 3/3 (3 )set ETS 300 706 Latin G0 for Estonia type 94集合 F 2/3 (#
) 3/4 (4
)set ETS 300 706 Latin G0 for Latvia and Lithuania type 94集合 F 2/3 (#
) 3/5 (5
)set ETS 300 706 Latin G0 for Serbia, Bosnia, Croatia and Slovenia type 94集合 IRR 3/0 (0 )F 2/3 (#
) 3/5 (5
)set ETS 300 706 Latin G0 for Serbia, Slovenia et al. with the Dollar sign type 94集合 F 2/3 (#
) 3/6 (6
)set ETS 300 706 Latin G0 for Czech and Slovak type 94集合 F 2/3 (#
) 3/7 (7
)set ETS 300 706 Latin G0 for Poland type 94集合 F 2/3 (#
) 3/8 (8
)set ETS 300 706 Latin G0 for Romania type 94集合 F 2/3 (#
) 3/9 (9
)set ETS 300 706 Latin G0 for Turkey type 94集合 F 2/3 (#
) 3/10 (:
)set SoftBank 2G (single-byte) Emoji page E type 94集合 F 2/3 (#
) 3/11 (;
)set SoftBank 2G (single-byte) Emoji page F type 94集合 F 2/3 (#
) 3/12 (<
)set SoftBank 2G (single-byte) Emoji page G type 94集合 F 2/3 (#
) 3/13 (=
)set SoftBank 2G (single-byte) Emoji page O type 94集合 F 2/3 (#
) 3/14 (>
)set SoftBank 2G (single-byte) Emoji page P type 94集合 F 2/3 (#
) 3/15 (?
)set SoftBank 2G (single-byte) Emoji page Q type 94集合 F 2/4 ($
) 3/1 (1
)set DEC NRCS for Switzerland (corresponding to DEC's (not ARIB's) G*D4 4
) type 94集合 F 2/4 ($
) 3/2 (2
)set DEC NRCS for the Netherlands (corresponding to DEC's G*D4 =
) type 94集合 F 2/4 ($
) 3/3 (3
)set Marlett encoding type 94集合 F 2/4 ($
) 3/4 (4
)set Zapf Dingbats, GL range type 94集合 F 2/4 ($
) 3/5 (5
)set Zapf Dingbats, GR range type 94集合 F 2/4 ($
) 3/6 (6
)set Symbol font encoding, GL range type 94集合 F 2/4 ($
) 3/7 (7
)set Symbol font encoding, GR range (no euro) type 94集合 F 2/4 ($
) 3/8 (8
)set 7-bit Maltese type 94集合 F 2/4 ($
) 3/9 (9
)set 7-bit Icelandic type 94集合 F 2/4 ($
) 3/10 (:
)set 7-bit Polish type 94集合 F 2/4 ($
) 3/11 (;
)set ISO 11822:1996 Arabic supplementary set type 94集合 F 2/4 ($
) 3/12 (<
)set ISO 10586:1996 Georgian type 94集合 F 2/4 ($
) 3/13 (=
)set ISO 10585:1996 Armenian type 96集合 F 2/1 (!
) 3/0 (0
)set RFC 1345's so-called ISO-IR-111/ECMA-Cyrillic (incompatible with ISO-IR-111 itself). type 96集合 IRR 3/0 (0
)F 2/4 ($
) 3/7 (7
)set Symbol font encoding, GR range (with figure space) type 96集合 IRR 3/15 (?
)F 2/4 ($
) 3/7 (7
)set Symbol font encoding, GR range (with euro) type 942 集合 F 2/1 (!
) 3/0 (0
)set GB/T 12052 (Korean in Mainland China)type 942 集合 IRR 3/0 (0
)F 2/1 (!
) 3/1 (1
)set All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (ICU EUC-2014 version) type 942 集合 IRR 3/1 (1
)F 2/1 (!
) 3/1 (1
)set All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (Microsoft version) type 942 集合 IRR 3/2 (2
)F 2/1 (!
) 3/1 (1
)set All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (Apple version) type 942 集合 IRR 3/3 (3
)F 2/1 (!
) 3/1 (1
)set All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (GOV-TW version) type 942 集合 IRR 3/4 (4
)F 2/1 (!
) 3/1 (1
)set All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (old ICU version) type 942 集合 IRR 3/5 (5
)F 2/1 (!
) 3/1 (1
)set All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (IBM version) type 942 集合 IRR 3/6 (6
)F 2/1 (!
) 3/1 (1
)set All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (Yasuoka version) type 942 集合 IRR 3/7 (7
)F 2/1 (!
) 3/1 (1
)set Planes 2 and up of CNS 11643 as a 94^3 set, as included by IBM EUC-TW as its G2 set (ICU EUC-2014 version) type 942 集合 IRR 3/5 (5
)3/8 カ?F 2/1 (!
) 3/1 (1
)set Planes 2 and up of CNS 11643 as a 94^3 set, as included by IBM EUC-TW as its G2 set (IBM version) type 942 集合 IRR 3/15 (?
)F 2/1 (!
) 3/1 (1
)set All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (recommended version) type 942 集合 IRR 3/0 (0
)F 2/1 (!
) 3/2 (2
)set IBM Big-5 ETEN-based in-plane extensions (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 3/1 (1
)F 2/1 (!
) 3/2 (2
)set Big5-ETEN with the subset of GCCS encoded with lead bytes following, not preceeding, the standard Big-5 assignments (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 3/15 (?
)F 2/1 (!
) 3/2 (2
)set MS-950 Big-5 extensions (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 4/0 (@
)F 2/1 (!
) 3/2 (2
)set Big5-2003 extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 4/1 (A
)F 2/1 (!
) 3/2 (2
)set Big5-ETEN extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 4/2 (B
)F 2/1 (!
) 3/2 (2
)set Hong Kong GCCS extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 4/3 (C
)F 2/1 (!
) 3/2 (2
)set Hong Kong Supplementary Character Set 1999 extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 4/4 (D
)F 2/1 (!
) 3/2 (2
)set Hong Kong Supplementary Character Set 2001 extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 4/5 (E
)F 2/1 (!
) 3/2 (2
)set Hong Kong Supplementary Character Set 2004 extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 4/6 (F
)F 2/1 (!
) 3/2 (2
)set Hong Kong Supplementary Character Set full (GCCS + 2008) extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 3/15 (?
)F 2/1 (!
) 3/3 (3
)set Non-ETEN Big5 kana and Cyrillic (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) type 942 集合 IRR 4/0 (@
)F 2/1 (!
) 3/3 (3
)set Non-ETEN Big5 kana and Cyrillic (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) combined with Microsoft non-EUDC extensions, as in Python's built-in (used when not on Windows or when MS-950 is not the ACP) version of "cp950". type 942 集合 IRR 3/0 (0
)F 2/1 (!
) 3/4 (4
)set IBM extensions for Shift_JIS (accepted by Shift_JIS filter in G3 slot, mapped to/from Shift_JIS by the same mapping scheme as JIS X 0213 plane 2); old mappings for use with 78JIS type 942 集合 IRR 3/15 (?
)F 2/1 (!
) 3/4 (4
)set IBM extensions for Shift_JIS (accepted by Shift_JIS filter in G3 slot, mapped to/from Shift_JIS by the same mapping scheme as JIS X 0213 plane 2); excluding UDC type 942 集合 IRR 4/0 (@
)F 2/1 (!
) 3/4 (4
)set IBM extensions for Shift_JIS (accepted by Shift_JIS filter in G3 slot, mapped to/from Shift_JIS by the same mapping scheme as JIS X 0213 plane 2); including UDC type 942 集合 F 2/1 (!
) 3/5 (5
)set DoCoMo Emoji extensions for Shift_JIS (as above) type 942 集合 IRR 3/0 (0
)F 2/1 (!
) 3/6 (6
)set KDDI Emoji extensions for Shift_JIS (as above), symbolic zodiac variant type 942 集合 IRR 3/15 (?
)F 2/1 (!
) 3/6 (6
)set KDDI Emoji extensions for Shift_JIS (as above), pictorial zodiac variant type 942 集合 F 2/1 (!
) 3/7 (7
)set SoftBank Emoji extensions for Shift_JIS (as above) type 942 集合 IRR 3/0 (0
)F 2/1 (!
) 3/8 (8
)set GB 7589 (supplementary simplified)—This is generated from the GB 13131 mappings, and may be minorly inaccurate in places. type 942 集合 IRR 3/15 (?
)F 2/1 (!
) 3/8 (8
)set GB 13131 (supplementary traditional). type 942 集合 IRR 3/0 (0
)F 2/1 (!
) 3/9 (9
)set GB 7590 (further supplementary simplified)—This is generated from the GB 13132 mappings, and may be minorly inaccurate in places. type 942 集合 IRR 3/15 (?
)F 2/1 (!
) 3/9 (9
)set GB 13132 (further supplementary traditional) This is based on Unihan source data, and has several gaps. type 942 集合 IRR 3/0 (0
)F 2/1 (!
) 3/10 (:
)set HangulTalk second plane (accepted by HangulTalk filter in G3 slot), updated mappings (recommended; default) type 942 集合 IRR 3/1 (1
)F 2/1 (!
) 3/10 (:
)set HangulTalk second plane (accepted by HangulTalk filter in G3 slot), marginally newer (but not that up-to-date) Apple mappings given by Apple in mapping file comments type 942 集合 IRR 3/2 (2
)F 2/1 (!
) 3/10 (:
)set HangulTalk second plane (accepted by HangulTalk filter in G3 slot), old Apple mappings type 942 集合 IRR 3/3 (3
)F 2/1 (!
) 3/10 (:
)set HangulTalk second plane (accepted by HangulTalk filter in G3 slot), Adobe CID mappings (very partial) type 942 集合 IRR 3/4 (4
)F 2/1 (!
) 3/10 (:
)set HangulTalk second plane (accepted by HangulTalk filter in G3 slot), mappings taking advantage of the PUA of the Nishiki-teki font where possible type 942 集合 IRR 3/15 (?
)F 2/1 (!
) 3/10 (:
)set HangulTalk second plane (accepted by HangulTalk filter in G3 slot), Apple mappings type 942 集合 F 2/1 (!
) 3/11 (;
)set Non-syllable part of KPS 9566-2011 outside the main plane (accepted by UHC filter in G3 slot) type 942 集合 F 2/1 (!
) 3/12 (<
)set Big5-E extensions (for Big-5 filter's G3 slot) type 942 集合 F 2/1 (!
) 3/13 (=
)set KS X 1002 (South Korean first supplementary plane) type 942 集合 F 2/1 (!
) 3/14 (>
)set KS X 1027-1 (South Korean second supplementary plane) type 942 集合 F 2/1 (!
) 3/15 (?
)set KS X 1027-2 (South Korean third supplementary plane) type 942 集合 IRR 3/0 (0
)F 2/2 ("
) 3/0 (0
)set Big5 AtOn/ChinaSea extensions (for Big-5 filter's G3 slot), alternate version type 942 集合 IRR 3/15 (?
)F 2/2 ("
) 3/0 (0
)set Big5 AtOn/ChinaSea extensions (for Big-5 filter's G3 slot) type 942 集合 IRR 3/0 (0
)F 2/2 ("
) 3/1 (1
)set IBM-926 (IBM-944)'s 94×94 plane (not KS X 1001 compatible for the most part). No DOCS filter exists for it yet though. IBM mappings including corporate PUA. type 942 集合 IRR 3/15 (?
)F 2/2 ("
) 3/1 (1
)set IBM-926 (IBM-944)'s 94×94 plane (not KS X 1001 compatible for the most part). No DOCS filter exists for it yet though. Reconstructed original version. type 942 集合 IRR 4/0 (@
)F 2/2 ("
) 3/1 (1
)set IBM-926 (IBM-944)'s 94×94 plane (not KS X 1001 compatible for the most part). No DOCS filter exists for it yet though. type 942 集合 IRR 3/0 (0
)F 2/2 ("
) 3/2 (2
)set "General Purpose Hanzi" (a small supplement of fewer than 200 characters for use alongside GBs 2312, 7589 and 7590 for Simplified Chinese). type 942 集合 IRR 3/15 (?
)F 2/2 ("
) 3/2 (2
)set GB 16500 (yet another supplementary set, numbered the seventh in its title though Unihan calls it "GE"). type 942 集合 IRR 4/0 (@
)F 2/2 ("
) 3/2 (2
)set GB 16500 combined with "General Purpose Hanzi" (do not overlap, both are plane "7"). type 942 集合 F 2/2 ("
) 3/3 (3
)set Big5 DynaLab extensions (for Big-5 filter's G3 slot) type 942 集合 F 2/2 ("
) 3/4 (4
)set Big5 Monotype extensions (for Big-5 filter's G3 slot) type 942 集合 F 2/2 ("
) 3/5 (5
)set Big5-Plus in-plane extensions (for Big-5 filter's G3 slot) type 942 集合 F 2/2 ("
) 3/6 (6
)set Big5-Plus out-of-plane extensions (not currently usable as such) type 942 集合 F 2/2 ("
) 3/7 (7
)set IBM Big5 non-ETEN out-of-plane extensions (not currently usable as such) type 962 集合 IRR 3/0 (0
)F 2/1 (!
) 3/0 (0
)set GBK extras, WHATWG/HTML5 variant type 962 集合 IRR 3/1 (1
)F 2/1 (!
) 3/0 (0
)set GBK extras, mapping all characters with defined glyphs to non-PUA type 962 集合 IRR 3/15 (?
)F 2/1 (!
) 3/0 (0
)set GBK extras (GB 18030, level 5 with associated UDC zone and non-URO part of level 4; accepted by GBK filter in G3 slot) type 962 集合 IRR 3/0 (0
)F 2/1 (!
) 3/1 (1
)set EACC / CCCII, Koha Taiwan version type 962 集合 IRR 3/1 (1
)F 2/1 (!
) 3/1 (1
)set EACC / CCCII, Hong Kong Innovative Users Group / Hong Kong University version type 962 集合 IRR 3/2 (2
)F 2/1 (!
) 3/1 (1
)set EACC / CCCII, aggregate version with Taiwan layout of row 2, favouring Unihan kCCCII for kanji mappings (default) type 962 集合 IRR 3/3 (3
)F 2/1 (!
) 3/1 (1
)set EACC / CCCII, aggregate version with Hong Kong layout of rows 0–2, favouring Library of Congress for kanji mappings type 962 集合 IRR 3/15 (?
)F 2/1 (!
) 3/1 (1
)set EACC / CCCII, Library of Congress version [85] ISO/IEC 2022 は IRR
の Fp
を明確には規定していませんが、一部で使われています。
Fp
[78]
早稲田国際化多言語処理環境 が
ISO 2022
ベースだったらしいのですが、
ISO-IR
にない文字 いろいろに対応していたというのでもしかすると
Fp
を使っていたのかもしれませんが、ほとんど情報がなく不明です。
[102]
skf はドキュメントで
vi) 携帯電話絵文字などの関係で、IANA と無関係の私用コードセットが幾つか定義されてお
り、呼び出し可能である。
としています。 >>101 具体的には明記されていませんが、
skf_2.00.17.tar.xz
のソースコード in_code_table.c
によると、
/* ESC-$-? : shorten multibyte sequence */ /* following set is not iso-2022 compliant, but can't distinguish */
{'G',1,128,vodafone_p1_uni_byte,COD_MB,NULL,L_JP,NULL,
"Vodafone picture page1",NULL},
{'E',1,128,vodafone_p2_uni_byte,COD_MB,NULL,L_JP,NULL,
"Vodafone picture page2",NULL},
{'F',1,128,vodafone_p3_uni_byte,COD_MB,NULL,L_JP,NULL,
"Vodafone picture page3",NULL},
{'O',1,128,vodafone_p4_uni_byte,COD_MB,NULL,L_JP,NULL,
"Vodafone picture page4",NULL},
{'P',1,128,vodafone_p5_uni_byte,COD_MB,NULL,L_JP,NULL,
"Vodafone picture page5",NULL},
{'Q',1,128,vodafone_p6_uni_byte,COD_MB,NULL,L_JP,NULL,
"Vodafone picture page6",NULL},
{'b',2,ISOMB_CODE_END,cp932_uni_byte,COD_MB,NULL,M_JP,NULL,
NULL,"MS cp932"}, /* cp932 as documented */
{'i',2,ISOMB_CODE_END,cp943_uni_byte,COD_MB,NULL,M_JP,NULL,
NULL,"IBM cp943"}, /* cp943: OS/2 and AS/400 */
{'j',2,ISOMB_CODE_END,cp932w_uni_byte,COD_MB,NULL,M_JP,NULL,
NULL,"MS cp932w"}, /* cp932 + wchar compatibility */
{'B',2,ISOMB_CODE_END,cp932_uni_byte,COD_MB,NULL,M_JP,NULL,
NULL,"MS cp932-DMY"}, /* cp932-JIS dummy entry */ /* ESC-$-F : private sequence (F < '@') */ {0x38,4,0,NULL,(COD_PRIV | COD_MB | COD_MB_4),NULL,
L_NU,NULL,NULL,NULL}, /* arib graphic1 */
{0x39,4,0,NULL,(COD_PRIV | COD_MB | COD_MB_4),NULL,
L_NU,NULL,NULL,NULL}, /* arib graphic2 */
{0x3a,4,0,NULL,(COD_PRIV | COD_MB | COD_MB_4),NULL,
L_NU,NULL,NULL,NULL}, /* arib kanjiadd */
{0x3f,2,0,NULL,(COD_PRIV | COD_MB | COD_MB_4),NULL,
L_NU,NULL,NULL,NULL}, /* iso-2022-jp-ms */ が定義されています。