[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[idn] Re: IDNA: is the specification proper, adequate, and complete?(was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)

To: vint cerf <vinton.g.cerf@wcom.com>
Subject: [idn] Re: IDNA: is the specification proper, adequate, and complete?(was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
From: Simon Josefsson <simon+idn@josefsson.org>
Date: Mon, 17 Jun 2002 13:16:31 +0200
Cc: Soobok Lee <lsb@postel.co.kr>, idn@ops.ietf.org
In-reply-to: <5.0.2.1.2.20020617061922.04817210@shoe.reston.mci.net> (vintcerf's message of "Mon, 17 Jun 2002 06:22:14 -0400")
References: <5.1.0.14.2.20020615133616.0236f8c8@jay.songbird.com><5.1.0.14.2.20020616185947.022b0430@jay.songbird.com><5.0.2.1.2.20020617061922.04817210@shoe.reston.mci.net>
User-agent: Gnus/5.090007 (Oort Gnus v0.07) Emacs/21.3.50(i686-pc-linux-gnu)

vint cerf <vinton.g.cerf@wcom.com> writes:

> It seems to me that we err if we mix "finding" identifiers
> (with search engines, elaborate directories that offer multiple
> choices of IDNs based on imprecise search criteria) with
> resolving unambiguous identifiers into their respective IP addresses
> (speaking roughly since DNS also offers indirect resolutions such as
> MX, CNAME and so on).
>
> I think we do ourselves a disservice if we try to make DNS resolve
> ambiguous references - it is not designed for such applications;
> search engines and directory structures are more oriented towards
> that aspect of finding things "by name" on the Internet.

This seem to argue against the current design of IDNA.

IDNA resolves some ambiguities in identifiers by Unicode
normalization, and introduces further ambiguities by not handling
legacy charset transcoding issues at all.

Admittedly, "resolving ambiguities" in human entered text strings is a
fuzzy area.  One extreme is to not resolve any ambiguity at all, the
other is to use as much intelligence in the software as possible to
figure out what the user meant.  Domain names in DNS traditionally was
in the first extreme, IDNA move things slightly toward the other
extreme with normalization and legacy transcoding, but far from
approaching search engines or directories.

Now, one can argue that Unicode normalization is only used because
Unicode happens to have different ways of representing the same, or
non-visual, characters, but nevertheless this adds an ambiguity
resolving mechanism to software.  One that will have to be modified
over time, as well, since consensus on how to resolve ambiguities will
change over time.  I have trouble visualizing how this can be
implemented and work well for 2, 5, 10 years and more, when Unicode
and other charsets are moving targets.

Follow-Ups:
- Re: [idn] Re: IDNA: is the specification proper, adequate, andcomplete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
  - From: Patrik Fältström <paf@cisco.com>
- [idn] Re: IDNA: is the specification proper, adequate, andcomplete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
  - From: Paul Hoffman / IMC <phoffman@imc.org>

References:
- Re: [idn] IDNA: is the specification proper, adequate, and complete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
  - From: Dave Crocker <dhc@dcrocker.net>
- Re: [idn] IDNA: is the specification proper, adequate, and complete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
  - From: Dave Crocker <dhc@dcrocker.net>
- Re: [idn] IDNA: is the specification proper, adequate,and complete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
  - From: vint cerf <vinton.g.cerf@wcom.com>

Prev by Date: Re: [idn] IDNA: is the specification proper, adequate,and complete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
Next by Date: [idn] Re: IDNA: is the specification proper, adequate, andcomplete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
Previous by thread: Re: [idn] IDNA: is the specification proper, adequate,and complete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
Next by thread: [idn] Re: IDNA: is the specification proper, adequate, andcomplete? (was: Re: I-D ACTION:draft-ietf-idn-idna-08.txt)
Index(es):
- Date
- Thread