<html xmlns="http://www.w3.org/1999/xhtml"><head></head><body><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="1" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[1]</anchor-end> 
The <dfn xml:lang="en"><cite>Segmented Name Strings</cite></dfn> specification
defines several syntaxes representing names and their 
segment boundaries.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="12" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[12]</anchor-end> This specification depends on the <cite><anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Infra Standard</anchor></cite>.
The terms
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">code point</anchor>,
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">ASCII lower alpha</anchor>,
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">list</anchor>,
and
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">is empty</anchor>
are defined by the <cite>Infra Standard</cite>.</p><section><h1>General definition</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="2" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[2]</anchor-end> A <dfn>segmented name string</dfn> is a space-separated list of 
one or more substrings, representing a name and implied boundaries
within it.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="5" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[5]</anchor-end> 
Formally, a segmented name string of type <var>T</var>
is a segment of type <var>T</var>,
followed by zero or more sequences of
combinations of a separator of type <var>T</var> and a segment of type <var>T</var>.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="3" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[3]</anchor-end> A segment of type <var>T</var> is a string of one or more non-space characters
that is not a punctuation of type <var>T</var>.
There might be <var>T</var>-dependent additional restrictions.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="4" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[4]</anchor-end> A punctuation of type <var>T</var> is one of the <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">list</anchor>,
referred to as the punctuation list for <var>T</var>.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="6" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[6]</anchor-end> A separator of type <var>T</var> is either a space, or
a space followed by a punctuation of type <var>T</var> followed by a space.
A separator that is a space represents an implied boundary,
e.g. hyphenation point.
A separator that contains a punctuation represents an explicit
boundary whose type is denoted by the punctuation.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="7" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[7]</anchor-end> A space is a <code>U+0020</code> <code class="charname" xml:lang="en">SPACE</code> character.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="8" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[8]</anchor-end> A character is a <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">code point</anchor>.</p></section><section><h1>East Asian names</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="9" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[9]</anchor-end> A <dfn>segmented East Asian name string</dfn> is a type of <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented name string</anchor>.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="10" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[10]</anchor-end> A segment of <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented East Asian name string</anchor> is a string
of one or more characters from:</p><ul><li><anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">CJK Unified Ideographs</anchor></li><li><anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">CJK Compatibility Ideographs</anchor></li><li><ed xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:10:">and more</ed></li></ul><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="11" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[11]</anchor-end> The punctuation list for <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented East Asian name string</anchor>
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">is empty</anchor>.</p></section><section><h1>Hiragana names</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="13" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[13]</anchor-end> A <dfn>segmented Hiragana name string</dfn> is a type of <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented name string</anchor>.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="14" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[14]</anchor-end> A segment of <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented Hiragana name string</anchor> is a string 
of one or more characters from:</p><ul><li><anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">Hiragana</anchor></li><li><code>ー</code></li></ul><ed xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:10:">Need more formal definition.</ed><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="15" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[15]</anchor-end> The punctuation list of <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented Hiragana name string</anchor>
<anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">is empty</anchor>.</p></section><section><h1>Romaji names</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="16" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[16]</anchor-end> A <dfn>segmented Romaji name string</dfn> is a type of <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented name string</anchor>.</p><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="17" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[17]</anchor-end> A segment of <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented Romaji name string</anchor> is a string 
of one or more characters from:</p><ul><li><anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">ASCII lower alpha</anchor></li><li><code>ā</code>, <code>ī</code>, <code>ū</code>, <code>ē</code>, <code>ō</code></li></ul><ed xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:10:">Need more formal definition.</ed><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="18" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[18]</anchor-end> The punctuation list of <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">segmented Romaji name string</anchor>
is a <anchor xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:">list</anchor> of
<code>-</code> and <code>'</code>.</p></section><section><h1>License</h1><p><anchor-end xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:anchor="19" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:">[19]</anchor-end> 
Per CC0 <anchor-external xmlns="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resScheme="URI" xmlns:a0="urn:x-suika-fam-cx:markup:suikawiki:0:9:" a0:resParameter="https://creativecommons.org/publicdomain/zero/1.0/">https://creativecommons.org/publicdomain/zero/1.0/</anchor-external>, to the extent possible under law, the author of this specification has waived all copyright and related or neighboring rights to this specification.</p></section><section><h1>Notes</h1></section></body></html>