[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [idn] hangul question




I got the following:

UTF16:
\u0032\u0030\u0030\u0032\u0066\u0069\u0066\u0061\uC6D4\uB4DC\uCEF5\uCE74\uB4DC\u002E\u0063\u006F\u006D
RACE: bq--3aadeabqaayaamqamyagsadgabq4nvfu3thplttuwtoa.com 

Source: RACE 
CharacterEncoding: ISO-8859-1 > 
native: 2002fifa월드컵카드.com> 
Input: bq--3aadeabqaayaamqamyagsadgabq4nvfu3thplttuwtoa.com > 0xd000

check out
http://thor.ar.com/servlet/com.ar.idns.RACEServlet


> 
> There's no valid Korean word or phrase represented by this string; in
> fact this string includes several syllables that are not normally used
> in contemporary Korean language.
> 
> However, it seems to me that this is not even a valid UTF-8 encoding,
> because the string has a pattern:
> 
> 1. All characters have the 0xd0 MSB.
> 2. The string of LSBs is (without leading 0x prefixes):
> 
> 00 32 00 30 00 30 00 32 00 66 00 69 00 66 00 61 b6 d4 b4 d4 ce f5 ce 74
> 
> Grouping each pair of adjacent bytes from the beginning, we get:
> 
> 0032 0030 0030 0032 0066 0069 0066 0061 b6d4 b4d4 cef5 ce74
> 
> I don't have any idea about the last four ones, but when interpreted as
> UCS-16 string, the first 8 characters are: "2002FIFA". : )
> 
> Hope this helped,
> Eugene
> 
> -- 
> Eugene M. Kim <ab@astralblue.com>
> 
> "Is your music unpopular?  Make it popular; make music which people
> like, or make people who like your music."
> 
>