[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[idn] Fw: Cyrillics - Latin
----- Original Message -----
From: "Sergey Charikov" <email@example.com>
To: <Elisabeth.Porteneuve@cetp.ipsl.fr>; <firstname.lastname@example.org>
Sent: Monday, February 04, 2002 4:34 PM
Subject: Cyrillics - Latin
> Do believe there's absolutely no any confusion of Latin vs Cyrillics
> Russian community.
> We can see many familiar words in another language characters
> especially if there are similar drawing characters.
> The matter is when Russians see COBET (printed) they see "Soviet" and
> And they print on a business card - COBET for website with cyrillics
> (for russian and cyrillics markets)
> but they take SOVIET usually for webpages on english.
> We should understand the cyrillics domains required for cyrillics
> A PY (Paraguay) case is a funny thing only. This says about rushing to
> the native language even in such a way.
> Best Regards,
> Serguei Charikov
> Chair of Russian Language WG of MINC
> > If I may add a note on Latin-Cyrillic confusion. Quoted
> > from an explanation I have been providing to another group.
> > An aside note - I learnt from Russian colleagues that some
> > Russian favor to register domain names under .PY (ccTLD for
> > rather that .RU (ccTLD for Russia). The reason is that "PY"
> > is the beginning of the word "Russia" in Cyrillic - PYCCU[R].
> > The last caracter is Cyrillic "ya", see below, any other is
> > printing in Latin and Cyrillic, different code points in Unicode,
> > identical code point in "LDH".
> > Best regards,
> > Elisabeth Porteneuve
> > --
> > Let have a glimpse on both end-user and intellectual property
> > perspectives with an example.
> > The word "COBET" reads as it is if one assumes it is Latin
> > alphabet, but spells "soviet" if one assumes it is Cyrillic.
> > The Unicode code point representation for Cyrillic "C", 0x0421,
> > is different from code point representation for Latin "C",
> > but they are identical on a printed paper, business cards
> > or a screen. Taking into account the above, a usage of Unicode
> > code points subsequently makes it impossible to communicate
> > with anybody without knowing which language is _printed_, or,
> > even worst, which letter or sign is printed in which language.
> > In the famous TOYS[R]US the R in brackets is a Cyrillic
> > code point 0x042f spelled "ya", which also happen to be the
> > letter R seen as in mirror, spelled "are". With the exception
> > of that letter [R], any other one in TOYS[R]US may be read
> > either as Latin or as Cyrillic code point, different spellings,
> > different code points, identical printing on paper or screen.
> > In an example of a word of 6 code points, with the same
> > printing but 2 different contents there is 2**6 = 64 possible
> > combinations It is the number of times a 6 letters word
> > should be registered to preserve its whole intellectual
> > property rights in 2 alphabets, Latin and Cyrillic.
> > It is also the maximal number of tries an end-user should
> > made to get to a website, if she or he got only a printed
> > information.
> > I have no competencies to expand this example to other
> > alphabets or code points. Hovever, as far as I understand,
> > the problem of Chinese code points have some similarity.
> > --