[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] Newbie's questions implementing the [IDNA]

To: IETF idn working group <idn@ops.ietf.org>
Subject: Re: [idn] Newbie's questions implementing the [IDNA]
From: "Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord>
Date: Wed, 11 Dec 2002 01:11:04 +0000
In-reply-to: <5.1.0.14.0.20021210125313.02c89ec0@mail.jefsey.com>
References: <20021210002946.GD32412@nicemice.net> <3DF3F626.4060709@netpia.com> <20021209024449.GA31524@postel.co.kr> <20021209231828.GA32412@nicemice.net> <20021210000347.GR31524@postel.co.kr> <20021210002946.GD32412@nicemice.net> <5.1.0.14.0.20021210125313.02c89ec0@mail.jefsey.com>
Reply-to: IETF idn working group <idn@ops.ietf.org>
User-agent: Mutt/1.4i

"JFC (Jefsey) Morfin" <jefsey@jefsey.com> wrote:

> if (ToASCII(nameprep(ToUnicode(ascii_text))) == ascii_text) babelname=true;
> if (babelname) "iesg--ascii_text" will display in ASCII mode on most
> of the systems while having been registered, and possibly TMed, as
> ToUnicode(ascii_text).

Your reference to "iesg--ascii_text" suggests that you expect ascii_text
not to include the ACE prefix.  But in that case, ToUnicode(ascii_text)
is simply ascii_text itself, because ToUnicode won't alter a string that
doesn't begin with the ACE prefix.

> I documented Adam with cases where babelname==true.  Among them
> "coca-cola", "ibm", "vint-cerf", "adam-costello".

babelname, as you've defined it, would be true for *every* lowercase
ASCII string (up to 63 characters).  I think the feature of those
example strings that you're interested in is that if you prepend the ACE
prefix to them, the result is an ACE (which is, by definition, something
that ToUnicode would alter).  So the test you're looking for is:

    ToUnicode(IESG--ascii_text) != IESG--ascii_text

which is roughly approximated by this test:

    Punyencode(Nameprep(Punydecode(ascii_text))) == ascii_text

which is similar in form to the test you proposed, but ToASCII is
very different from Punyencode, and ToUnicode is very different from
Punydecode.

> if (babelname) "iesg--ascii_text" will display in ASCII mode on most
> of the systems while having been registered, and possibly TMed, as
> ToUnicode(ascii_text).

IESG--ascii_text will be displayed in non-ASCII form by IDN-aware
applications capable of displaying the characters, and will be displayed
in ASCII form otherwise.

AMC

Follow-Ups:
- Re: [idn] Newbie's questions implementing the [IDNA]
  - From: "JFC (Jefsey) Morfin" <jefsey@jefsey.com>

References:
- Re: [idn] Newbie's questions implementing the [IDNA]
  - From: "Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord>
- [idn] Newbie's questions implementing the [IDNA]
  - From: Seungho Lee <shlee@netpia.com>
- Re: [idn] Newbie's questions implementing the [IDNA]
  - From: Soobok Lee <lsb@postel.co.kr>
- Re: [idn] Newbie's questions implementing the [IDNA]
  - From: "Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord>
- Re: [idn] Newbie's questions implementing the [IDNA]
  - From: Soobok Lee <lsb@postel.co.kr>
- Re: [idn] Newbie's questions implementing the [IDNA]
  - From: "JFC (Jefsey) Morfin" <jefsey@jefsey.com>

Prev by Date: Re: [idn] Newbie's questions implementing the [IDNA]
Next by Date: [idn] Fw: Newswire: WALID Transfers Key Internet Patent to IDN Technologies
Previous by thread: Re: [idn] Newbie's questions implementing the [IDNA]
Next by thread: Re: [idn] Newbie's questions implementing the [IDNA]
Index(es):
- Date
- Thread