<html xmlns="http://www.w3.org/1999/xhtml"><head></head><body><section><h1>文字コード</h1><p><sw-see xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:"> <anchor>クメール文字の文字コード</anchor> </sw-see></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="29" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[29]</anchor-end> 
<cite>Fonts | Khmer Fonts | Cambodian Fonts | Khmer Unicode - All Khmer Unicode Cambodian Information Center</cite>, <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">cambodia.org</anchor>, <time>2025-08-01T08:24:48.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="http://www.cambodia.org/fonts/">http://www.cambodia.org/fonts/</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="23" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[23]</anchor-end> 
<cite xml:lang="en-US">Help - Can i install Khmer unicode font (Cambodian font) on my Galaxy | Early Bird Club</cite>, <time>2025-08-01T07:08:19.000Z</time>, <time>2025-08-01T07:14:50.176Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://forum.earlybird.club/threads/can-i-install-khmer-unicode-font-cambodian-font-on-my-galaxy.268606/">https://forum.earlybird.club/threads/can-i-install-khmer-unicode-font-cambodian-font-on-my-galaxy.268606/</anchor-external></p><section><h1>Unicode</h1><ul><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="1" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[1]</anchor-end> 
<cite>FAQ and Resources on Khmer in Unicode</cite>, <time>2004-08-06T05:41:16.000Z</time>, <time>2025-07-29T08:42:28.045Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://www.khmerfonts.info/bauhahn/Welcome.html">https://www.khmerfonts.info/bauhahn/Welcome.html</anchor-external></li></ul><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="26" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[26]</anchor-end> 
<cite>FAQ and Resources on Khmer in Unicode</cite>, <time>2025-08-01T07:50:12.000Z</time>, <time>2006-03-15T12:24:37.378Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://web.archive.org/web/20060315122255/http://www.bauhahnm.clara.net/Khmer/Welcome.html">https://web.archive.org/web/20060315122255/http://www.bauhahnm.clara.net/Khmer/Welcome.html</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="3" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[3]</anchor-end> 
<cite>How to make a khmer font | Khmer fonts — ​ពុម្ព​អក្សរ​ខ្មែរ — Polices khmères</cite>, <time>2025-07-08T12:26:24.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://www.khmerfonts.info/howto.php">https://www.khmerfonts.info/howto.php</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="4" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[4]</anchor-end> 
<cite>The state of khmer unicode | Khmer fonts — ​ពុម្ព​អក្សរ​ខ្មែរ — Polices khmères</cite>, <time>2025-07-08T13:04:34.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://www.khmerfonts.info/unicode.php">https://www.khmerfonts.info/unicode.php</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="5" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[5]</anchor-end> <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="3" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;3</anchor-internal> <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="4" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;4</anchor-internal> <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Unicode</anchor> にない文字がまだいくつかある</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="30" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[30]</anchor-end> 
<cite>Character list | Khmer fonts — ​ពុម្ព​អក្សរ​ខ្មែរ — Polices khmères</cite>, <time>2025-08-01T09:25:26.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://www.khmerfonts.info/characterlist.php">https://www.khmerfonts.info/characterlist.php</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="25" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[25]</anchor-end> 
<cite>Menu Francais</cite>, <time>2025-08-01T07:48:41.000Z</time>, <time>2006-03-12T10:46:15.269Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://web.archive.org/web/20060312101357/http://www.cfcambodge.org/Anglais/KhmerFont.htm">https://web.archive.org/web/20060312101357/http://www.cfcambodge.org/Anglais/KhmerFont.htm</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="27" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[27]</anchor-end> 
<cite>Khmer OpenType specification</cite>, <time>2025-08-01T07:50:47.000Z</time>, <time>2005-11-22T13:30:03.820Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://web.archive.org/web/20051122132755/http://www.microsoft.com/typography/otfntdev/khmerot/default.htm">https://web.archive.org/web/20051122132755/http://www.microsoft.com/typography/otfntdev/khmerot/default.htm</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="2" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[2]</anchor-end> 
<cite xml:lang="ja">XユーザーのMichiさん: 「若年層がクメール文字を使わずラテンを多用することを懸念する記事。確かにSNSを見るとクメール文字よりも入力しやすいのか、流行りからか、ラテンばかり。記事によれば、かつて宗主国フランスはクメール文字をラテンに置き換えようとしていたと。知らなかった。https://t.co/pOK3g1jypv」 / X</cite>, <time>午前9:01 · 2020年9月5日<attrvalue xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:10:">2020-09-05T00:01:17.000Z</attrvalue></time>, <time>2024-04-18T09:31:53.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://twitter.com/kruphlau/status/1302034027495424005">https://twitter.com/kruphlau/status/1302034027495424005</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="6" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[6]</anchor-end> 半分はずっと内戦やってて工業化が遅れた自国の責任。もう半分は現地事情を無視して複雑な符号化を強引に押し付けてクメール文字の実装を引き伸ばした欧米人の責任。しかし欧米人は仕様だけ作ってあとは知らんふり、
文化がどうなろうと知ったことではないからなあ。</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="7" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[7]</anchor-end> 
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Unicode帝国主義</anchor>、経済的植民地主義と非難されても擁護のしようがない嫌な事件だったね。</p><section><h1>実装</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="19" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[19]</anchor-end> 
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">KhmerOS</anchor></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="20" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[20]</anchor-end> 
<cite xml:lang="en">Khmer Open Source - Browse Files at SourceForge.net</cite>, <time>2025-08-01T07:06:06.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://sourceforge.net/projects/khmer-open-source/files/">https://sourceforge.net/projects/khmer-open-source/files/</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="21" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[21]</anchor-end> <cite xml:lang="en">All Khmer Unicode Fonts : Free Download, Borrow, and Streaming : Internet Archive</cite>, <time>2025-08-01T07:06:30.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://archive.org/details/all-khmer-unicode-fonts">https://archive.org/details/all-khmer-unicode-fonts</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="22" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[22]</anchor-end> 
<cite xml:lang="en">Download All Khmer Unicode Fonts | Society for Better Books in Cambodia</cite>, <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Society for Better Books in Cambodia</anchor>, <time>2025-08-01T07:07:42.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://sbbic.org/2014/06/25/download-all-khmer-unicode-fonts">https://sbbic.org/2014/06/25/download-all-khmer-unicode-fonts</anchor-external></p></section><section><h1>文字順序問題</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="14" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[14]</anchor-end> 
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Unicode</anchor> は<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">クメール文字</anchor>を極めて複雑な仕組みで符号化したにも関わらず、
その用法を十分厳密に定めることを怠り、
実装によって<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">結合文字</anchor>が異なる順序で用いられるなどの<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">相互運用性</anchor>の問題が長年放置されてきました。</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="16" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[16]</anchor-end> 
基本的には<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">結合文字</anchor>の配置順の問題のようです。 <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Unicode</anchor> には<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">正準再順序付け</anchor>のような仕組みがあり、
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">ラテン文字</anchor>などにはおおむねうまく機能している (あくまでおおむね。) ものの、
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">クメール文字</anchor>に対しては碌に定義されておらず、その方面の改善も期待できないようです。</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="15" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[15]</anchor-end> 
また、複雑な組み合わせになると実装によって正しく表示できないという問題が起こっています。
少数民族言語などに皺寄せが行きがちです。</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="17" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[17]</anchor-end> 
これについてはどのような<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Unicode文字</anchor>の組み合わせでどのような<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">グリフ</anchor>が表示されることが期待されるのか、
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">フォント</anchor>はどのような<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">グリフ</anchor>を実装することが期待されるのか、
といったことを具体的に列挙するような規格が本質的な解決策になるのでしょうが、
そうしたものを整備しようという動きは無さそうで、
混乱は続きそうです。</p><comment-p xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:10:"><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="18" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[18]</anchor-end> 網羅的な規定を設けず、<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">文字素</anchor>として利用可能な<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">基底文字</anchor>と<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">結合文字</anchor>を用意すれば問題解決ということにしてしまうのは、
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">クメール文字</anchor>に限らず <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Unicode</anchor> の全体的な態度であり、<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">相互運用性</anchor>の問題を引き起こしがちです。</comment-p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="31" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[31]</anchor-end> <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="30" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;30</anchor-internal> がそれに近いのでしょうが、
<anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="11" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;11</anchor-internal>
を見るにこれでは十分ではなさそうですね。</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="9" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[9]</anchor-end> 
<cite xml:lang="en-US">khmer encoding structure - Institute of Digital Research &amp; Innovation</cite>, <time>2025-07-29T09:49:30.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://www.idri.edu.kh/khmer-encoding-structure/">https://www.idri.edu.kh/khmer-encoding-structure/</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="24" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[24]</anchor-end> 
<cite xml:lang="en">GitHub - clo-cambodia/pheasakhmer.com</cite>, <time>2025-08-01T07:24:19.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://github.com/clo-cambodia/pheasakhmer.com/">https://github.com/clo-cambodia/pheasakhmer.com/</anchor-external></p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="10" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[10]</anchor-end> 
<cite xml:lang="ja">utn61-Khmer_Encoding_Structure_V2.pdf</cite>, <time>2025-02-06T17:57:30.000Z</time>, <time>2025-07-29T09:50:27.247Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://www.unicode.org/notes/tn61/utn61-Khmer_Encoding_Structure_V2.pdf">https://www.unicode.org/notes/tn61/utn61-Khmer_Encoding_Structure_V2.pdf</anchor-external></p><ul><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="11" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[11]</anchor-end> <cite xml:lang="en">GitHub - sillsdev/khmer-character-specification: Khmer Character Specification</cite>, <time>2025-08-01T05:49:13.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://github.com/sillsdev/khmer-character-specification/">https://github.com/sillsdev/khmer-character-specification/</anchor-external></li><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="33" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[33]</anchor-end> 
<cite xml:lang="en">GitHub - sillsdev/khmer-normalizer: Normalize Khmer strings according to https://www.unicode.org/L2/L2022/22290-khmer-encoding.pdf</cite>, <time>2025-08-01T10:01:07.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://github.com/sillsdev/khmer-normalizer">https://github.com/sillsdev/khmer-normalizer</anchor-external></li><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="13" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[13]</anchor-end> 
<cite xml:lang="en-us">UTN #61: Khmer Encoding Structure</cite>, <time>2025-02-07T16:29:20.000Z</time>, <time>2025-08-01T05:50:24.460Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://www.unicode.org/notes/tn61/">https://www.unicode.org/notes/tn61/</anchor-external></li></ul></section><section><h1>shaping</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="28" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[28]</anchor-end> 
<cite>Shaping: Khmer OpenType specification</cite>, <time>2025-08-01T07:51:48.000Z</time>, <time>2005-09-08T06:29:20.032Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://web.archive.org/web/20050908062827/http://www.microsoft.com/typography/otfntdev/khmerot/shaping.htm">https://web.archive.org/web/20050908062827/http://www.microsoft.com/typography/otfntdev/khmerot/shaping.htm</anchor-external></p></section><section><h1>日付文字</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="8" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[8]</anchor-end> 
<cite xml:lang="ja">DATE: 2002-02-13 - 02101-KhmerLunarDates.pdf</cite>, <time>2002-02-17T07:53:06.000Z</time>, <time>2025-07-29T08:43:33.612Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://www.khmerfonts.info/bauhahn/02101-KhmerLunarDates.pdf">https://www.khmerfonts.info/bauhahn/02101-KhmerLunarDates.pdf</anchor-external></p></section><section><h1>入力正規化</h1><ul><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="12" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[12]</anchor-end> <cite xml:lang="en">GitHub - seanghay/khmernormalizer: A missing toolkit for Khmer Natural Language Processing.</cite>, <time>2025-08-01T09:58:56.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://github.com/seanghay/khmernormalizer">https://github.com/seanghay/khmernormalizer</anchor-external></li><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="36" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[36]</anchor-end> <cite xml:lang="en">GitHub - seanghay/unicode2legacy-js: Khmer Unicode to Limon Converter</cite>, <time>2025-08-01T10:05:52.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://github.com/seanghay/unicode2legacy-js">https://github.com/seanghay/unicode2legacy-js</anchor-external><ul><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="37" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[37]</anchor-end> <cite xml:lang="km">Khmer Unicode to Limon</cite>, <time>2025-07-29T09:50:56.000Z</time>, <time>2025-08-01T10:06:15.678Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://khmer.floo.app/">https://khmer.floo.app/</anchor-external></li></ul></li></ul><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="32" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[32]</anchor-end> <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="12" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;12</anchor-internal> <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">自然言語処理</anchor>のためのデータ作成のための<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">正規化</anchor>器。
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">クメール文字</anchor>の表現については <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="33" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;33</anchor-internal> と同等とみられる。
それ以外の基礎的な部分は<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">ベンガル文字</anchor>用の <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="35" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;35</anchor-internal> を流用している。</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="38" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[38]</anchor-end> <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="37" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;37</anchor-internal> の実装が <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="36" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;36</anchor-internal>。 <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="36" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;36</anchor-internal> は <anchor-internal xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="12" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">&gt;&gt;12</anchor-internal> および変換の実装を使っている。
変換の実装については<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">クメール文字の文字コード</anchor>を参照。</p><ul><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="34" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[34]</anchor-end> 
<cite xml:lang="ja">Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation - 2020.emnlp-main.207.pdf</cite>, <time>2020-11-11T21:31:33.000Z</time>, <time>2025-08-01T10:01:56.732Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://aclanthology.org/2020.emnlp-main.207.pdf">https://aclanthology.org/2020.emnlp-main.207.pdf</anchor-external><ul><li><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="35" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[35]</anchor-end> 
<cite xml:lang="en">GitHub - csebuetnlp/normalizer: This python module is an easy-to-use port of the text normalization used in the paper &quot;Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation&quot;. It is intended to be used for normalizing / cleaning Bengali and English text.</cite>, <time>2025-08-01T10:02:25.000Z</time> <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://github.com/csebuetnlp/normalizer/tree/main">https://github.com/csebuetnlp/normalizer/tree/main</anchor-external></li></ul></li></ul></section></section></section><section><h1>メモ</h1></section></body></html>