[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] homograph attacks



Kane, Pat wrote:
VeriSign does prevent domains with the Russian language tag from commingling
A-Z with the Cyrillic characters. It does permit 0-9 and the dash to be
used. This filter also applies to other Cyrillic based languages such as
Belarusian, Ukrainian, Serbian, Macedonian and Bulgarian.


There are other languages that are listed within ISO 639-2 that today use a
combination of Latin and Cyrillic as they were originally Latin based (Tajik
was Arabic prior to being Latin based), migrated to Cyrillic during the
Soviet era and today are migrating back to Latin.

Thanks for the clarification. Is this information publically available somehow? On

http://www.verisign.com/static/002533.pdf

I can find the language code list (which shows that indeed TGK and RUS
might be treated differently); I wonder whether you somehow list the
constraints implemented for each tag. How did the applicant know that
he would have to use Tajik in order to get a cyrillic letter into an
otherwise latin label?

As for the Tajik writing system: why is it then necessary to allow
mixed scripts? Wouldn't the Tajik users be satisfied if you could
either register all-Latin or all-Cyrillic labels (perhaps allowing
all-Arabic as well)?

Regards,
Martin