私用図形文字集合

[12] 私用終端バイト Fp は、私用のための終端バイトです。

ISO/IEC 2022 の一般規則

[1] Fp は終端バイトのうち、 03/00〜03/15 を表します。 JIS X 0202:1998 13.2

[2] どのエスケープ・シーケンスでも、 Fp は常に私用に予約されています。私用のエスケープ・シーケンスは ISO 2375 による登録の範囲外とされています。 JIS X 0202:1998 13.3.3

[3] ESC 02/04 02/08 03/00 (私用の多バイト図形文字集合を G0 に指示) において、 03/00 が Fp。

[4] ISO/IEC 2022 はどのエスケープ・シーケンスでもと書いていますが、本当にどのエスケープ・シーケンスでもよいのかはよくわかりません。

[51] 7F 型エスケープ・シーケンスは ISO/IEC 2022:1994 では使われていません (将来の標準化のために予約されています) が、それに属する、例えば ESC 02/07 03/00 を情報交換の当事者間の合意で適当な意味に割当ててしまってもよいのかははっきりしません。

[52] 使用されていない中間バイトを用いた ESC 02/08 02/07 02/06 02/05 03/08 を使っても良いのか、良いとしたらどんな意味に使っても良いのかはよくわかりません。

[53] 0F 型エスケープシーケンス (告知シーケンス) は Fp が使えるような規定には読めません。実際には私用した事例もあるようですが。

規格による割当

[13] ISO/IEC 2022 にとっての「私用」は、他の応用規格がその目的の範囲内で定義して利用させることと矛盾しません。実際いくつかの事例がありました。

[5] CNS 11643 は私用終端バイト 3/0 - 3/15, 2/1 3/0 - 2/4 3/15 の80個の94²文字集合として用いられると定められています。 CNS 11643

[9] KS C 5601-1987 は94²文字集合の私用終端バイト 3/1 を用いていました。 KS C 5601-1987

[11] ARIB STD-B5, ARIB STD-B24 は独自の図形文字集合に 3/0-3/11 を割り当てています。 3/0 - 3/8 は94集合、 3/9 - 3/11 は94²集合です。 >>15

[14] Videotex は独自の図形文字集合に私用終端バイトを用いていました。 94集合の 3/3 と 96集合の 3/8 と 3/9 がありました。 T.101 の1個目の PDF、111頁

[39] ctext は 3F の中間バイト1バイト + 3/0 or 3/1 を版付けに使っています。 ctext

[38] ctext は非標準復帰 DOCS の 3/0 - 3/4 を規定しています。 3/5 - 3/15 も将来の拡張で利用する可能性を想定した実装が要請されています。 ctext

[41] MARC-8 は 3/1, 3/2, 3/3, 3/4 を定義していました。 3/1 は94³集合 (EACC), その他は 94集合でした。 (複数バイト集合 G0 指示は旧仕様 ESC 2/4 I) MARC-8, EACC なお 4/1 と説明するものもあります。 CCCII

[8] GB 12054 は [ 3/0, 3/7 ] を中華人民共和国の国家登記簿による割当、 [ 3/8, 3/15 ] を私用としていました。 (これに基づく登録が実在するのかは不明。)

[6] JIS X 0208 は外字利用時に私用終端バイトを用いることを求めています。 (具体的な指定はなし。)

[15] Data Coding and Transmission Specification for Digital Broadcasting ARIB STANDARD - 6-STD-B24v6_4-1p3-E1.pdf, 2021-09-29T18:13:18.000Z, 2022-04-30T04:58:39.850Z http://www.arib.or.jp/english/html/overview/doc/6-STD-B24v6_4-1p3-E1.pdf#page=56

実装による割当

[10] MULE はいくつかの私用終端バイトを使っていました。 >>16

[19] chinese-sisheng ("SiSheng (PinYin/ZhuYin)", Emacs 20+; Mule 2.3: sisheng_cwnn >>65; XEmacs: sisheng, ltr) - 94集合 3/0 ★
[69] lc-thai (TIS 620, Mule 2.3 (廃止)) - 94集合 3/1
[26] lao ("Lao" U+0E81 - U+0EDF, Emacs 20+, XEmacs UTF-2000) - 94集合 3/1 ★
[23] arabic-digit ("Arabic digit", Mule 2.3: MuleArabic-0 >>65 = lc-arb0, ltr) - 94集合 3/2
[24] arabic-1-column ("Arabic 1-column", Mule 2.3: MuleArabic-1 >>65 = lc-arb1, rtl) - 94集合 3/3
[25] arabic-2-column ("Arabic 2-column", Mule 2.3: MuleArabic-2 >>65 = lc-arb2, rtl) - 94集合 3/4
[27] indian-is13194 ("Indian IS 13194 (DEV)", Emacs 20+) - 94集合 3/5 ★
[20] ipa ("IPA", Mule 2.3: MuleIPA >>65; Emacs 20+, XEmacs, ltr) - 96集合 3/0 ★
[21] vietnamese-viscii-lower ("VISCII lower-case", Mule 2.3: VISCII1.1 >>65; Emacs 20+, XEmacs) - 96集合 3/1 ★
[22] vietnamese-viscii-upper ("VISCII upper-case", Mule 2.3: VISCII1.1 >>65; Emacs 20+; XEmacs) - 96集合 3/2 ★
- [68] Mule 2.3 ドキュメントが終端バイト 3/1 とする >>65 のは誤植
[99] mule-ucs-unicode-multichar - 96集合 3/14
[17] chinese-big5-1 ("Big5 (Level-1) A141-C67F", Mule 2.3: lc-big5-1, Emacs 20+; XEmacs) - 94²集合 3/0 ★
[18] chinese-big5-2 ("Big5 (Level-2) C940-FEFE", Mule 2.3: lc-big5-2, Emacs 20+; XEmacs) - 94²集合 3/1 ★
[71] lc-ethio (Mule 2.3, 廃止) - 94²集合 3/2
[36] ethiopic ("Ethiopic characters", Emacs 20+; XEmacs) - 94²集合 3/3 ★
[30] indian-2-column ("Indian 2 Column", Emacs 20+) - 94²集合 3/5
[29] indian-1-column ("Indian 1 Column", Emacs 20+) - 94²集合 3/6
[31] tibetan ("Tibetan 2 column", Emacs 20+) - 94²集合 3/7
[32] tibetan-1-column ("Tibetan 1 column", Emacs 20+) - 94²集合 3/8
[72] mojikyo-2022-1 (XEmacs UTF-2000, 廃止) - 94³集合 3/10
[81] mojikyo-2022-2 (XEmacs UTF-2000, 廃止) - 94³集合 3/11
- [82] おそらく予約されただけで未実装のまま廃止
[73] thai-xtis (XEmacs) - 94²集合 3/15
[74] bitmap (BITMAP-MULE) - 96²集合 3/0
[35] mule-unicode-0100-24ff ("Unicode subset (U+0100..U+24FF)", Mule-UCS; Emacs 21+) - 96²集合 3/1
[33] mule-unicode-2500-33ff ("Unicode subset (U+2500..U+33FF)", Mule-UCS; Emacs 21+) - 96²集合 3/2
[34] mule-unicode-e000-ffff ("Unicode subset (U+E000+FFFF)", Mule-UCS; Emacs 21+) - 96²集合 3/3
[75] lc-arb3 (Mule 2.3) - 96²集合 3/3
[76] lc-arb4 (Mule 2.3) - 96²集合 3/4
[77] cgreek (cgreek) - 96²集合 3/4
[28] indian-glyph ("Indian glyph") - 96²集合 3/4
[43] 文字合成を表現する制御機能 3/0 - 3/4 私用制御機能

[100] ★ 印のあるものは Mule-UCS に Unicode との変換表があります。

[63] このうち Big5 の2つは X の ctext でも Emacs との互換性のためとして BIG5-E0, BIG5-E1 の名前で実装されています。 >>16

[64] lao は X の ctext で extended segment の mulelao-1 として実装されています。終端バイトは使っていません。 extended segment

[70] lc-thai と lao は同じ終端バイトを使いまわしています。 lc-thai は TIS 620 であるとされ、公式な終端バイトが割り当てられたために Fp は廃止されたようです。タイ文字とラオス文字の違いがある非互換変更のように見えますが、両者は親類関係にあり lao は TIS 620 のタイ文字をラオス文字に置き換えたものなので、まったくの非互換でもありません。

[79] Mule (GNU Emacs, XEmacs) は elisp で文字集合やその終端バイトを定義できます。 Mule 本体は私用終端バイトの符号化図形集合をその利用例のように説明していました >>65。 Mule 本体以外の追加で利用するパッケージで文字集合を定義したものが実際にありましたし、公開されていたもの以外にも私的な利用例はあったかもしれません。

[80] そのように完全な中央管理ではなかったため (および旧 Mule, GNU Emacs, XEmacs と複数系統の実装が存在したため) に一部で私用終端バイトの衝突が起きています。

[67] Mule 2.3 には 94集合「Right-to-Left ASCII」があって、終端バイト 4/2 になっていますが >>65、これは Fp ではなく標準の ASCII と同じものです。すべて rtl の文字なのが標準の ASCII と違います。これが実装されていた当時の Mule では標準の ASCII はすべて ltr の文字として扱われていたようです。その後の版の emacsen では標準の ASCII が ltr でも rtl でも使われるようになっているみたいです。 (書字方向モデルや書字方向依存グリフの問題の当時としての1解法だったのでしょう。)

[37] Mule内部コードも参照。

[83] なお Mule 独自の図形文字集合と Unicode の対照表は Mule-UCS に入っています。

[7] ISO-2022-JP-MS は ESC $ ( ? を EUDC に割り当てています。

[46] VT では94集合 3/0 - 3/9, 3/12 - 3/14, 2/2 3/1, 2/5 3/5, 2/5 3/6, 2/5 3/13 が使われています。 VT

[95] VT では、 ESC 2/8 3/1, ESC 2/9 3/1, ESC 2/9 4/2 が指示とは違う意味で使われることがありました。 VT, 指示シーケンス

[98] >>95 それと同時に ESC 2/8 3/1 が G0 に、 ESC 2/9 4/2 が G1 に ASCII を指示するとされます。 >>97 4/2 は ASCII ですが、 3/1 は VT では独自の集合のはずです。しかも ESC 2/9 3/1 は指示とはされていません。本当にこの通りの実装だったのでしょうか。謎です。

[97] DDD1, DDD2, DDD3, 2022-12-12T09:57:40.000Z https://www.vt100.net/docs/vt510-rm/DD1

[40] VT では告知シーケンス相当の私用終端バイト 3/6, 3/7 が使われています。 VT

[42] VT では 3F 型の私用制御機能 3/3 - 3/9, 3/15 が使われています。 VT

[44] ctext では 3F 型の私用制御機能の中間バイトと 3/0 - 3/1 が使われています。 ctext

[45] VT では DOCS 3/0, 3/4, 3/8 が使われています。 3/8 は異なる2種が知られています。 VT

[54] Tektronix 4014 では ESC 3/8 - 3/11 が使われていました。 VT, 私用制御機能

[55] Digital Ansi-Compliant Print Protocol Lev 2 Program. Ref. Man. - PPLV2PMB.PDF, 1995-08-31T14:32:02.000Z, 2022-05-02T13:36:24.457Z http://sup.xenya.si/sup/info/digital/MDS/jun99/Cd3/PRINTER/PPLV2PMB.PDF#page=162

94集合
- 3/0 DEC Special Graphics
- 3/4 DEC Dutch
- 3/5 DEC Finnish
- 3/6 DEC Norwegian/Danish
- 3/7 DEC Swedish
- 3/9 DEC French-Canadian
- 3/12 User Preference Supplemental
  - User Preference Supplemental character set は互換性のため DEC Supplemental character set に設定されている。
- 3/13 DEC Swiss
- 3/14 DEC Technical
- 2/2 3/4 DEC Hebrew Supplemental
- 2/5 3/0 DEC 8-Bit Turkish Supplemental
- 2/5 3/2 DEC 7-Bit Turkish
- 2/5 3/4 Legal
- 2/5 3/5 DEC Supplemental
- 2/5 3/6 DEC Portuguese
- 2/5 3/13 DEC 7-Bit Hebrew
- 2/2 3/15 DEC Greek Supplemental
96集合
- 3/12 User Preference Supplemental
  - User Preference Supplemental character set は互換性のため DEC Supplemental character set (94集合) に設定されている。

VT, DECの文字コードも参照

[89] SCS—Select Character Set, 2022-12-11T09:42:45.000Z https://vt100.net/docs/vt510-rm/SCS.html

[90] 94集合
- 0 DEC Special Graphic
- 5 Finnish NRCS
- 6 Norwegian/Danish NRCS
- 7 Swedish NRCS
- 9 French Canadian NRCS
- < User-preferred Supplemental
- = Swiss NRCS
- > DEC Technical Character Set
- " 4 DEC Hebrew
- " > Greek NRCS
- " ? DEC Greek
- % 0 DEC Turkish
- % 2 Turkish NRCS
- % 3 SCS NRCS
- % 5 DEC Supplemental
- % 6 Portuguese NRCS
- % = Hebrew NRCS
- & 4 DEC Cyrillic
- & 5 Russian NRCS
[91] 96集合
- < User-preferred Supplemental

[93] contra/memo.txt at master · akinomyoga/contra · GitHub, 2022-12-11T10:03:01.000Z https://github.com/akinomyoga/contra/blob/master/memo.txt#L1313

RLogin は private 94 charsets の "1" として DEC_TCS を実装しているが、然し一方で https://vt100.net/docs/vt510-rm/SCS.html によると、DEC_TCS は "<" である。

[57] 次のものは、 ISO-IR に登録された終端バイトも私用の文字集合に「fall back」するとされます。 >>55 (指示されたビット組合せは必ずしも互換ではない)

4/3 → DEC Finnish
5/1 → DEC French Canadian
4/5 → DEC Norwegian/Danish
4/8 → DEC Swedish

[58] 互換端末もこの仕様を踏襲しているようです。 VT

[92] >>89 では >>57 の4種に加えて 6/0 も Norwegian/Danish NRCS。

[96] 大韓民国版 DOS >>94

ESC $ ) 1 = ハングル入力モード
ESC ( 2 = 英語入力モード

[94] ANSI.SYS - Wikipedia, 2022-12-07T23:08:47.000Z, 2022-12-11T14:38:53.494Z https://en.wikipedia.org/wiki/ANSI.SYS

[56] DEC漢字関係:

94集合用終端バイト (94²集合) 3/0 「DEC漢字指示セット 1978年版(旧コマンド)」
94²集合 3/0 UDC
94²集合 3/1 「DEC漢字指示セット 1978年版(新コマンド)」
94²集合 3/3 「DEC漢字指示セット 1983年版」

[61] libmoe: DOCS

[84] ecma35lib: DOCS, IRR も参照。

[86] ecma35lib の Fp 指示列および Fp IRR + Fp 指示列

type: 種別
IRR: IRR Fp
F: 指示列 Fp
set: 説明 >>88

type: 94集合
F: 2/3 (#) 3/0 (0)
set: KS X 1003

type: 94集合
IRR: 3/0 (0)
F: 2/3 (#) 3/0 (0)
set: KS X 1003 with tilde

type: 94集合
F: 2/3 (#) 3/1 (1)
set: ETS 300 706 Latin G0 for France

type: 94集合
F: 2/3 (#) 3/2 (2)
set: ETS 300 706 Latin G0 for Spain and Portugal

type: 94集合
F: 2/3 (#) 3/3 (3)
set: ETS 300 706 Latin G0 for Estonia

type: 94集合
F: 2/3 (#) 3/4 (4)
set: ETS 300 706 Latin G0 for Latvia and Lithuania

type: 94集合
F: 2/3 (#) 3/5 (5)
set: ETS 300 706 Latin G0 for Serbia, Bosnia, Croatia and Slovenia

type: 94集合
IRR: 3/0 (0)
F: 2/3 (#) 3/5 (5)
set: ETS 300 706 Latin G0 for Serbia, Slovenia et al. with the Dollar sign

type: 94集合
F: 2/3 (#) 3/6 (6)
set: ETS 300 706 Latin G0 for Czech and Slovak

type: 94集合
F: 2/3 (#) 3/7 (7)
set: ETS 300 706 Latin G0 for Poland

type: 94集合
F: 2/3 (#) 3/8 (8)
set: ETS 300 706 Latin G0 for Romania

type: 94集合
F: 2/3 (#) 3/9 (9)
set: ETS 300 706 Latin G0 for Turkey

type: 94集合
F: 2/3 (#) 3/10 (:)
set: SoftBank 2G (single-byte) Emoji page E

type: 94集合
F: 2/3 (#) 3/11 (;)
set: SoftBank 2G (single-byte) Emoji page F

type: 94集合
F: 2/3 (#) 3/12 (<)
set: SoftBank 2G (single-byte) Emoji page G

type: 94集合
F: 2/3 (#) 3/13 (=)
set: SoftBank 2G (single-byte) Emoji page O

type: 94集合
F: 2/3 (#) 3/14 (>)
set: SoftBank 2G (single-byte) Emoji page P

type: 94集合
F: 2/3 (#) 3/15 (?)
set: SoftBank 2G (single-byte) Emoji page Q

type: 94集合
F: 2/4 ($) 3/1 (1)
set: DEC NRCS for Switzerland (corresponding to DEC's (not ARIB's) G*D4 4)

type: 94集合
F: 2/4 ($) 3/2 (2)
set: DEC NRCS for the Netherlands (corresponding to DEC's G*D4 =)

type: 94集合
F: 2/4 ($) 3/3 (3)
set: Marlett encoding

type: 94集合
F: 2/4 ($) 3/4 (4)
set: Zapf Dingbats, GL range

type: 94集合
F: 2/4 ($) 3/5 (5)
set: Zapf Dingbats, GR range

type: 94集合
F: 2/4 ($) 3/6 (6)
set: Symbol font encoding, GL range

type: 94集合
F: 2/4 ($) 3/7 (7)
set: Symbol font encoding, GR range (no euro)

type: 94集合
F: 2/4 ($) 3/8 (8)
set: 7-bit Maltese

type: 94集合
F: 2/4 ($) 3/9 (9)
set: 7-bit Icelandic

type: 94集合
F: 2/4 ($) 3/10 (:)
set: 7-bit Polish

type: 94集合
F: 2/4 ($) 3/11 (;)
set: ISO 11822:1996 Arabic supplementary set

type: 94集合
F: 2/4 ($) 3/12 (<)
set: ISO 10586:1996 Georgian

type: 94集合
F: 2/4 ($) 3/13 (=)
set: ISO 10585:1996 Armenian

type: 96集合
F: 2/1 (!) 3/0 (0)
set: RFC 1345's so-called ISO-IR-111/ECMA-Cyrillic (incompatible with ISO-IR-111 itself).

type: 96集合
IRR: 3/0 (0)
F: 2/4 ($) 3/7 (7)
set: Symbol font encoding, GR range (with figure space)

type: 96集合
IRR: 3/15 (?)
F: 2/4 ($) 3/7 (7)
set: Symbol font encoding, GR range (with euro)

type: 94²集合
F: 2/1 (!) 3/0 (0)
set: GB/T 12052 (Korean in Mainland China)

type: 94²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/1 (1)
set: All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (ICU EUC-2014 version)

type: 94²集合
IRR: 3/1 (1)
F: 2/1 (!) 3/1 (1)
set: All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (Microsoft version)

type: 94²集合
IRR: 3/2 (2)
F: 2/1 (!) 3/1 (1)
set: All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (Apple version)

type: 94²集合
IRR: 3/3 (3)
F: 2/1 (!) 3/1 (1)
set: All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (GOV-TW version)

type: 94²集合
IRR: 3/4 (4)
F: 2/1 (!) 3/1 (1)
set: All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (old ICU version)

type: 94²集合
IRR: 3/5 (5)
F: 2/1 (!) 3/1 (1)
set: All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (IBM version)

type: 94²集合
IRR: 3/6 (6)
F: 2/1 (!) 3/1 (1)
set: All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (Yasuoka version)

type: 94²集合
IRR: 3/7 (7)
F: 2/1 (!) 3/1 (1)
set: Planes 2 and up of CNS 11643 as a 94^3 set, as included by IBM EUC-TW as its G2 set (ICU EUC-2014 version)

type: 94²集合
IRR: 3/5 (5)3/8 カ?
F: 2/1 (!) 3/1 (1)
set: Planes 2 and up of CNS 11643 as a 94^3 set, as included by IBM EUC-TW as its G2 set (IBM version)

type: 94²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/1 (1)
set: All planes of CNS 11643 as a 94^3 set, as included by EUC-TW as its G2 set (recommended version)

type: 94²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/2 (2)
set: IBM Big-5 ETEN-based in-plane extensions (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 3/1 (1)
F: 2/1 (!) 3/2 (2)
set: Big5-ETEN with the subset of GCCS encoded with lead bytes following, not preceeding, the standard Big-5 assignments (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/2 (2)
set: MS-950 Big-5 extensions (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 4/0 (@)
F: 2/1 (!) 3/2 (2)
set: Big5-2003 extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 4/1 (A)
F: 2/1 (!) 3/2 (2)
set: Big5-ETEN extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 4/2 (B)
F: 2/1 (!) 3/2 (2)
set: Hong Kong GCCS extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 4/3 (C)
F: 2/1 (!) 3/2 (2)
set: Hong Kong Supplementary Character Set 1999 extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 4/4 (D)
F: 2/1 (!) 3/2 (2)
set: Hong Kong Supplementary Character Set 2001 extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 4/5 (E)
F: 2/1 (!) 3/2 (2)
set: Hong Kong Supplementary Character Set 2004 extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 4/6 (F)
F: 2/1 (!) 3/2 (2)
set: Hong Kong Supplementary Character Set full (GCCS + 2008) extension set (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/3 (3)
set: Non-ETEN Big5 kana and Cyrillic (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere)

type: 94²集合
IRR: 4/0 (@)
F: 2/1 (!) 3/3 (3)
set: Non-ETEN Big5 kana and Cyrillic (accepted by Big-5 filter in G3 slot, not expected to be used elsewhere) combined with Microsoft non-EUDC extensions, as in Python's built-in (used when not on Windows or when MS-950 is not the ACP) version of "cp950".

type: 94²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/4 (4)
set: IBM extensions for Shift_JIS (accepted by Shift_JIS filter in G3 slot, mapped to/from Shift_JIS by the same mapping scheme as JIS X 0213 plane 2); old mappings for use with 78JIS

type: 94²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/4 (4)
set: IBM extensions for Shift_JIS (accepted by Shift_JIS filter in G3 slot, mapped to/from Shift_JIS by the same mapping scheme as JIS X 0213 plane 2); excluding UDC

type: 94²集合
IRR: 4/0 (@)
F: 2/1 (!) 3/4 (4)
set: IBM extensions for Shift_JIS (accepted by Shift_JIS filter in G3 slot, mapped to/from Shift_JIS by the same mapping scheme as JIS X 0213 plane 2); including UDC

type: 94²集合
F: 2/1 (!) 3/5 (5)
set: DoCoMo Emoji extensions for Shift_JIS (as above)

type: 94²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/6 (6)
set: KDDI Emoji extensions for Shift_JIS (as above), symbolic zodiac variant

type: 94²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/6 (6)
set: KDDI Emoji extensions for Shift_JIS (as above), pictorial zodiac variant

type: 94²集合
F: 2/1 (!) 3/7 (7)
set: SoftBank Emoji extensions for Shift_JIS (as above)

type: 94²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/8 (8)
set: GB 7589 (supplementary simplified)—This is generated from the GB 13131 mappings, and may be minorly inaccurate in places.

type: 94²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/8 (8)
set: GB 13131 (supplementary traditional).

type: 94²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/9 (9)
set: GB 7590 (further supplementary simplified)—This is generated from the GB 13132 mappings, and may be minorly inaccurate in places.

type: 94²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/9 (9)
set: GB 13132 (further supplementary traditional) This is based on Unihan source data, and has several gaps.

type: 94²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/10 (:)
set: HangulTalk second plane (accepted by HangulTalk filter in G3 slot), updated mappings (recommended; default)

type: 94²集合
IRR: 3/1 (1)
F: 2/1 (!) 3/10 (:)
set: HangulTalk second plane (accepted by HangulTalk filter in G3 slot), marginally newer (but not that up-to-date) Apple mappings given by Apple in mapping file comments

type: 94²集合
IRR: 3/2 (2)
F: 2/1 (!) 3/10 (:)
set: HangulTalk second plane (accepted by HangulTalk filter in G3 slot), old Apple mappings

type: 94²集合
IRR: 3/3 (3)
F: 2/1 (!) 3/10 (:)
set: HangulTalk second plane (accepted by HangulTalk filter in G3 slot), Adobe CID mappings (very partial)

type: 94²集合
IRR: 3/4 (4)
F: 2/1 (!) 3/10 (:)
set: HangulTalk second plane (accepted by HangulTalk filter in G3 slot), mappings taking advantage of the PUA of the Nishiki-teki font where possible

type: 94²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/10 (:)
set: HangulTalk second plane (accepted by HangulTalk filter in G3 slot), Apple mappings

type: 94²集合
F: 2/1 (!) 3/11 (;)
set: Non-syllable part of KPS 9566-2011 outside the main plane (accepted by UHC filter in G3 slot)

type: 94²集合
F: 2/1 (!) 3/12 (<)
set: Big5-E extensions (for Big-5 filter's G3 slot)

type: 94²集合
F: 2/1 (!) 3/13 (=)
set: KS X 1002 (South Korean first supplementary plane)

type: 94²集合
F: 2/1 (!) 3/14 (>)
set: KS X 1027-1 (South Korean second supplementary plane)

type: 94²集合
F: 2/1 (!) 3/15 (?)
set: KS X 1027-2 (South Korean third supplementary plane)

type: 94²集合
IRR: 3/0 (0)
F: 2/2 (") 3/0 (0)
set: Big5 AtOn/ChinaSea extensions (for Big-5 filter's G3 slot), alternate version

type: 94²集合
IRR: 3/15 (?)
F: 2/2 (") 3/0 (0)
set: Big5 AtOn/ChinaSea extensions (for Big-5 filter's G3 slot)

type: 94²集合
IRR: 3/0 (0)
F: 2/2 (") 3/1 (1)
set: IBM-926 (IBM-944)'s 94×94 plane (not KS X 1001 compatible for the most part). No DOCS filter exists for it yet though. IBM mappings including corporate PUA.

type: 94²集合
IRR: 3/15 (?)
F: 2/2 (") 3/1 (1)
set: IBM-926 (IBM-944)'s 94×94 plane (not KS X 1001 compatible for the most part). No DOCS filter exists for it yet though. Reconstructed original version.

type: 94²集合
IRR: 4/0 (@)
F: 2/2 (") 3/1 (1)
set: IBM-926 (IBM-944)'s 94×94 plane (not KS X 1001 compatible for the most part). No DOCS filter exists for it yet though.

type: 94²集合
IRR: 3/0 (0)
F: 2/2 (") 3/2 (2)
set: "General Purpose Hanzi" (a small supplement of fewer than 200 characters for use alongside GBs 2312, 7589 and 7590 for Simplified Chinese).

type: 94²集合
IRR: 3/15 (?)
F: 2/2 (") 3/2 (2)
set: GB 16500 (yet another supplementary set, numbered the seventh in its title though Unihan calls it "GE").

type: 94²集合
IRR: 4/0 (@)
F: 2/2 (") 3/2 (2)
set: GB 16500 combined with "General Purpose Hanzi" (do not overlap, both are plane "7").

type: 94²集合
F: 2/2 (") 3/3 (3)
set: Big5 DynaLab extensions (for Big-5 filter's G3 slot)

type: 94²集合
F: 2/2 (") 3/4 (4)
set: Big5 Monotype extensions (for Big-5 filter's G3 slot)

type: 94²集合
F: 2/2 (") 3/5 (5)
set: Big5-Plus in-plane extensions (for Big-5 filter's G3 slot)

type: 94²集合
F: 2/2 (") 3/6 (6)
set: Big5-Plus out-of-plane extensions (not currently usable as such)

type: 94²集合
F: 2/2 (") 3/7 (7)
set: IBM Big5 non-ETEN out-of-plane extensions (not currently usable as such)

type: 96²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/0 (0)
set: GBK extras, WHATWG/HTML5 variant

type: 96²集合
IRR: 3/1 (1)
F: 2/1 (!) 3/0 (0)
set: GBK extras, mapping all characters with defined glyphs to non-PUA

type: 96²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/0 (0)
set: GBK extras (GB 18030, level 5 with associated UDC zone and non-URO part of level 4; accepted by GBK filter in G3 slot)

type: 96²集合
IRR: 3/0 (0)
F: 2/1 (!) 3/1 (1)
set: EACC / CCCII, Koha Taiwan version

type: 96²集合
IRR: 3/1 (1)
F: 2/1 (!) 3/1 (1)
set: EACC / CCCII, Hong Kong Innovative Users Group / Hong Kong University version

type: 96²集合
IRR: 3/2 (2)
F: 2/1 (!) 3/1 (1)
set: EACC / CCCII, aggregate version with Taiwan layout of row 2, favouring Unihan kCCCII for kanji mappings (default)

type: 96²集合
IRR: 3/3 (3)
F: 2/1 (!) 3/1 (1)
set: EACC / CCCII, aggregate version with Hong Kong layout of rows 0–2, favouring Library of Congress for kanji mappings

type: 96²集合
IRR: 3/15 (?)
F: 2/1 (!) 3/1 (1)
set: EACC / CCCII, Library of Congress version

[87] Fp IRR + 登録済終端バイトの指示列の事例は IRR を参照。

[88] GitHub - harjitmoe/ecma35lib: Aspiring to implement ECMA-35, in its fullness, for a Unicode era. This repo may occasionally lag behind the one on Gitlab., 2022-11-22T12:46:32.000Z https://github.com/harjitmoe/ecma35lib#additional-single-byte-g-sets-and-additional-selectors-for-particular-variants-of-single-byte-g-sets

[85] ISO/IEC 2022 は IRR の Fp を明確には規定していませんが、一部で使われています。 Fp

[78] 早稲田国際化多言語処理環境が ISO 2022 ベースだったらしいのですが、 ISO-IR にない文字いろいろに対応していたというのでもしかすると Fp を使っていたのかもしれませんが、ほとんど情報がなく不明です。

[102] skf はドキュメントで

vi) 携帯電話絵文字などの関係で、IANA と無関係の私用コードセットが幾つか定義されてお
り、呼び出し可能である。

としています。 >>101 具体的には明記されていませんが、 skf_2.00.17.tar.xz のソースコード in_code_table.c によると、

/* ESC-$-? : shorten multibyte sequence				   */

/* following set is not iso-2022 compliant, but can't distinguish  */
  {'G',1,128,vodafone_p1_uni_byte,COD_MB,NULL,L_JP,NULL,
	"Vodafone picture page1",NULL},
  {'E',1,128,vodafone_p2_uni_byte,COD_MB,NULL,L_JP,NULL,
	"Vodafone picture page2",NULL},
  {'F',1,128,vodafone_p3_uni_byte,COD_MB,NULL,L_JP,NULL,
	"Vodafone picture page3",NULL},
  {'O',1,128,vodafone_p4_uni_byte,COD_MB,NULL,L_JP,NULL,
	"Vodafone picture page4",NULL},
  {'P',1,128,vodafone_p5_uni_byte,COD_MB,NULL,L_JP,NULL,
	"Vodafone picture page5",NULL},
  {'Q',1,128,vodafone_p6_uni_byte,COD_MB,NULL,L_JP,NULL,
	"Vodafone picture page6",NULL},
  {'b',2,ISOMB_CODE_END,cp932_uni_byte,COD_MB,NULL,M_JP,NULL,
	NULL,"MS cp932"},	/* cp932 as documented		   */
  {'i',2,ISOMB_CODE_END,cp943_uni_byte,COD_MB,NULL,M_JP,NULL,
	NULL,"IBM cp943"},	/* cp943: OS/2 and AS/400	   */
  {'j',2,ISOMB_CODE_END,cp932w_uni_byte,COD_MB,NULL,M_JP,NULL,
	NULL,"MS cp932w"},	/* cp932 + wchar compatibility	   */
  {'B',2,ISOMB_CODE_END,cp932_uni_byte,COD_MB,NULL,M_JP,NULL,
	NULL,"MS cp932-DMY"},	/* cp932-JIS dummy entry	   */

/* ESC-$-F : private sequence (F < '@')				   */

  {0x38,4,0,NULL,(COD_PRIV | COD_MB | COD_MB_4),NULL,
  	L_NU,NULL,NULL,NULL}, /* arib graphic1 */
  {0x39,4,0,NULL,(COD_PRIV | COD_MB | COD_MB_4),NULL,
  	L_NU,NULL,NULL,NULL}, /* arib graphic2 */
  {0x3a,4,0,NULL,(COD_PRIV | COD_MB | COD_MB_4),NULL,
  	L_NU,NULL,NULL,NULL}, /* arib kanjiadd */
  {0x3f,2,0,NULL,(COD_PRIV | COD_MB | COD_MB_4),NULL,
  	L_NU,NULL,NULL,NULL}, /* iso-2022-jp-ms */

が定義されています。