[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] IRIs ought to use internationalized host names

To: "IETF idn working group" <idn@ops.ietf.org>
Subject: Re: [idn] IRIs ought to use internationalized *host* names
From: "Soobok Lee" <lsb@postel.co.kr>
Date: Thu, 28 Mar 2002 00:29:54 +0900
References: <7FC3066C236FD511BC5900508BAC86FE9199C0@trestles.internal.realnames.com> <000f01c1d4ae$56a77e60$4d43738c@me.ncu.edu.tw> <008701c1d4d9$7e3ddde0$6501a8c0@EDMON15> <004901c1d4e2$23605de0$0701000a@jamescompaq> <20020327033059.GA18099@nicemice.net>


----- Original Message ----- 
From: "Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord>
 > The Unicode character database classifies each character as belonging to
> exactly one of the following broad classes:
> 
> L: letter
> M: mark
> N: number
> P: punctuation
> S: symbol
> Z: separator
> C: other

May I add this?

  U: unassigned code points.

> 
> We can start by examining which of these classes of ASCII characters are
> allowed in ASCII host labels.
> 
> L: 52 exist, all are allowed
> M:  0 exist
> N: 10 exist, all are allowed
> P: 23 exist, only hyphen-minus is allowed
> S:  9 exist, none are allowed
> Z:  1 exists, it is not allowed
> C: 33 exist, none are allowed

  U: indefinite, all are allowed .


> 
> We can trivially extend these results to form a simple rule covering the
> entire Unicode repertoire, except that we have no precedent for class
> M.  Since characters in class M tend to be things like diacritics, they
> should be allowed.  So the proposed rule is:
> 
> All characters in classes L (letter), M (mark), and N (number) are
> allowed, and U+002D (hyphen-minus) is also allowed.  Everything else is
> forbidden.
 
 U should be also allowed in addition to L,M,N.
 But in later version of unicode , U may be partitioned into L' ~ C' and smaller U'.

 Soobok Lee

References:
- RE: [idn] Web navigation for IDN resolving
  - From: Yves Arrouye <yves@realnames.com>
- Re: [idn] Web navigation for IDN resolving
  - From: =?big5?B?dHNlbmdsbUCtcLr0pKSk3y6kpKRqLnR3?= <tsenglm@cc.ncu.edu.tw>
- Re: [idn] Web navigation for IDN resolving
  - From: "Edmon Chung" <edmon@neteka.com>
- Re: [idn] Web navigation for IDN resolving
  - From: "James Seng/Personal" <jseng@pobox.org.sg>
- [idn] IRIs ought to use internationalized *host* names
  - From: "Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord>

Prev by Date: RE: [idn] IRIs ought to use internationalized *host* names
Next by Date: Re: [idn] IRIs ought to use internationalized *host* names
Previous by thread: Re: [idn] IRIs ought to use internationalized *host* names
Next by thread: host names and nameprep (was: Re: [idn] IRIs ought to use internationalized *host* names)
Index(es):
- Date
- Thread

Re: [idn] IRIs ought to use internationalized *host* names

Re: [idn] IRIs ought to use internationalized host names