[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Prohibit CDN code points



In a message dated 2002-01-22 1:56:56 Pacific Standard Time, 
tsenglm@cc.ncu.edu.tw writes:

> TC/SC character equivalence mapping is similar to  the mapping of  UNICODE
> Alphabet  map  it to its counterpart of ASCII  alpnabet .

No, it isn't.  Stop saying that.

ASCII uppercase/lowercase mapping is straightforward and unambiguous, and can 
be done one character at a time with NO lexical analysis (at least for 99% of 
all languages that use it; Turkish and Azeri do have exceptions).

TC/SC is NOT one-to-one for all characters.  It is for many, but nowhere near 
99% or 95%.  If you implement any sort of TC/SC mapping you MUST figure out 
how to handle the many-to-one and one-to-many cases, and this is where we 
have all been balking.  Users will not understand or accept that "only some" 
of the TC and SC characters are mapped to each other.

-Doug Ewell
 Fullerton, California