[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Fwd: Need for Normalization forms "KR" was: Re: [idn] case folding]



I am kind of tired of explaining day-in and day-out on this. 

so i am going to write an I-D on this, explaining CJK canonicalization. i am
half way thru now. give me another week or two. i put other things on hold for
this.

(yea, my todo list is getting long. sorry paul :P)

-James Seng

Mark Davis wrote:
> 
> I am certainly not against this, if it is possible (and I am not an expert in this area). However, I have heard from various sources that the mappings are not trivial, and require dictionary look-up for satisfactory results. If it can be done in an algorithmic fashion, and you have the data to implement it, then it is a different matter.
> 
> Mark
> 
> James Seng wrote:
> 
> > Mark Davis wrote:
> > > - Han duplicates. What are you thinking of here? Simplified vs. Traditional is not algorithmic -- are you thinking of radicals, or some other mapping? Do you have data for whatever mapping you are thinking of?
> >
> > Without comment on the rest, I beg to defer on this.
> >
> > I know that the common persection is that SC-TC is impossible to do without
> > considering the lexical and context, but for domain names, it *can* be done.
> >
> > -James Seng