[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Re: Agenda Item for next UTC: Normalizing Case Mapping



At 12:31 AM 2/19/00 +0800, James Seng wrote:
>To say U+7535 U+90AE '.' U+53F0 U+6E7E != U+96FB U+90F5 '.' U+81FA U+7063 is
>as good as saying email.tw != EMAIL.TW.

Could you explain why? I'm looking at the code charts, and U+7535 looks 
nothing like U+96FB; U+90AE looks only somewhat like U+90F5 (but is 
completely distinguishable). If you are saying "people might be as likely 
to write U+7535 U+90AE as they are U+96FB U+90F5 for the same word", that's 
similar to saying "people might write Duerst for Dürst". What they see on a 
display (paper or computer) is not confusing here.

>But why should UC bother with Chinese 'case' folding? Afterall, this is a
>problem unique to DNS and we should let it be handled it in DNS aliasing via
>DNAME and CNAME, e.g
>
>U+96FB U+90F5 '.' U+81FA U+7063 IN DNAME U+7535 U+90AE '.' U+53F0 U+6E7E

That is a local decision by the administrator for the domain in question (I 
can't tell which one you are saying is the base here).

>I agreed this is not 'case folding' as one would normally associate with the
>meaning of 'case'. But the problems are as real as I = dotless i.

The example you give here sounds like a problem of language synonyms, not 
one of script properties, which is what case is. Have I misinterpreted the 
example you gave?

--Paul Hoffman, Director
--Internet Mail Consortium