[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Prohibit CDN code points

To: idn@ops.ietf.org
Subject: Re: [idn] Prohibit CDN code points
From: DougEwell2@cs.com
Date: Tue, 22 Jan 2002 10:08:19 EST
Cc: tsenglm@cc.ncu.edu.tw, paf@cisco.com, seki@jp.fujitsu.com

In a message dated 2002-01-22 1:56:56 Pacific Standard Time, 
tsenglm@cc.ncu.edu.tw writes:

> TC/SC character equivalence mapping is similar to  the mapping of  UNICODE
> Alphabet  map  it to its counterpart of ASCII  alpnabet .

No, it isn't.  Stop saying that.

ASCII uppercase/lowercase mapping is straightforward and unambiguous, and can 
be done one character at a time with NO lexical analysis (at least for 99% of 
all languages that use it; Turkish and Azeri do have exceptions).

TC/SC is NOT one-to-one for all characters.  It is for many, but nowhere near 
99% or 95%.  If you implement any sort of TC/SC mapping you MUST figure out 
how to handle the many-to-one and one-to-many cases, and this is where we 
have all been balking.  Users will not understand or accept that "only some" 
of the TC and SC characters are mapped to each other.

-Doug Ewell
 Fullerton, California

Follow-Ups:
- RE: [idn] Prohibit CDN code points
  - From: "Kenny Huang" <huangk@alum.sinica.edu>
- Re: [idn] Prohibit CDN code points
  - From: =?utf-8?B?dHNlbmdsbUDoqIjntrLkuK3lv4Mu5Lit5aSnLnR3?= <tsenglm@cc.ncu.edu.tw>

Prev by Date: Re: [idn] Determining equivalence in Unicode DNS names
Next by Date: Re: [idn] Prohibit CDN code points
Previous by thread: RE: [idn] Prohibit CDN code points
Next by thread: Re: [idn] Prohibit CDN code points
Index(es):
- Date
- Thread