[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [idn] IDNs in IE and Google
----- Original Message -----
From: "Michel Suignard" <michelsu@windows.microsoft.com>
To: "Stephane Bortzmeyer" <bortzmeyer@nic.fr>; "Georg Ochsner" <georg@ochsner.de>
Cc: <idn@ops.ietf.org>
Sent: Friday, January 23, 2004 10:31 AM
Subject: RE: [idn] IDNs in IE and Google
> Concerning IRI, it is not a matter of 'preference'. If you present
> something like a URI containing a host name presented in non ASCII
> repertoire, you are in fact using an illegal URI per RFC2396 definition.
> At minimum you need to have a clear definition on how such 'extended'
> URI (in other words IRI) are mapped to legal URI. This is a big part of
> the IRI draft spec currently worked on. The draft is at
> http://www.ietf.org/internet-drafts/draft-duerst-iri-05.txt. The same
> goes for http, and any other URI schemes presented in browser user
> interface.
I know the importance of IRI effort.
BTW, MSIE/Mozilla seem to support IRI concept in "file:" protocol already.
file: protocol URL had been supporting NETBIOS PC Name and File/Directory Pathname
in ***LOCAL CHARSET ENCODING***, not in UTF-8 encoding from very long time ago.
That works in Windows OS and even in LINUX.
Moreover, Most asian HTML homepages are published in local charset encoding
like euc-kr, big5 and gb2312 etc. UTF-8-encoded HTML pages are extremely *RARE*
in ASIA.
Need for backward compatibility to already deployed IRI-concept and
Unicode<->Local charset conversion layer may lay another complexity to IRI effort.
Just comparing two IRIs won't be a trivial task, if they can be in two diifferent encodings.
IMHO, IRI efforts deserve a WG. I will resume tracking the progress of IRI spec.. :-)
Soobok Lee