[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Newbie's questions implementing the [IDNA]



On Wed, Dec 11, 2002 at 05:05:22AM +0100,
 JFC (Jefsey) Morfin <jefsey@jefsey.com> wrote 
 a message of 48 lines which said:

> as "iesg--coca-cola.com" or "iesg--jonathan-cohen.net" or
> "iesg--vint-cerf.org".

The Unicode strings which IDN-encodes into "iesg--coca-cola.com" is:

LATIN SMALL LETTER C (Basic Latin)
LATIN SMALL LETTER O (Basic Latin)
ORIYA DIGIT THREE (Oriya)
ORIYA DIGIT THREE (Oriya)
LATIN SMALL LETTER C (Basic Latin)
LATIN SMALL LETTER A (Basic Latin)
FULL STOP (Basic Latin)

LATIN SMALL LETTER C (Basic Latin)
LATIN SMALL LETTER O (Basic Latin)
LATIN SMALL LETTER M (Basic Latin)

In what language does it make sense? It is certainly possible to find
funny ACE encodings which have a meaningful Unicode form but it is not
obvious.