[Kannada] (Fwd) Re: [indic] Re: Fw: allkeys.txt

Dr. U.B. Pavanaja pavanaja at vishvakannada.com
Tue Jun 1 22:38:03 PDT 2004


------- Forwarded message follows -------
Date sent:      	Tue, 1 Jun 2004 15:54:32 -0700 (PDT)
From:           	Kenneth Whistler <kenw at sybase.com>
Send reply to:  	Kenneth Whistler <kenw at sybase.com>
Subject:        	Re: [indic] Re: Fw: allkeys.txt
To:             	pavanaja at vishvakannada.com
Copies to:      	mark.davis at jtcsv.com, kenw at sybase.com, ake.persson at mimer.se

I have updated the source for the generating file (unidata-4.0.1.txt)
in accordance with your information about the misplacement of
KANNADA SIGN AVAGRAHA, pending any other corrections which will
need to be made (for other scripts) for an eventual allkeys-4.0.1.txt.

Regards,

--Ken Whistler

> Hi all,
> 
> There is only one mistake in the allkeys-4.0.0.txt with regards to Kannada 
collation order. It is the wrong 
> position of AVAGRAHA (0CBD). It is kept in between 0CB9 and 0CB3. It should be 
placed before 
> 0CBE. This is taken care in the proposed allkeys-4.0.1.txt. But while 
correcting this mistake a new 
> mistake is introduced! It is the wrong position of KANNADA LETTER RRA (0CB1). 
Its position in 
> allkeys-4.0.0.txt is correct. Don't move that.
> 
> ***** allkeys-4.0.0.txt
> 0CB0  ; [.173A.0020.0002.0CB0] # KANNADA LETTER RA
> 0CB1  ; [.173B.0020.0002.0CB1] # KANNADA LETTER RRA
> 0CB2  ; [.173C.0020.0002.0CB2] # KANNADA LETTER LA
> 0CB5  ; [.173D.0020.0002.0CB5] # KANNADA LETTER VA
> 0CB6  ; [.173E.0020.0002.0CB6] # KANNADA LETTER SHA
> 0CB7  ; [.173F.0020.0002.0CB7] # KANNADA LETTER SSA
> 0CB8  ; [.1740.0020.0002.0CB8] # KANNADA LETTER SA
> 0CB9  ; [.1741.0020.0002.0CB9] # KANNADA LETTER HA
> 0CBD  ; [.1742.0020.0002.0CBD] # KANNADA SIGN AVAGRAHA
> 0CB3  ; [.1743.0020.0002.0CB3] # KANNADA LETTER LLA
> 0CDE  ; [.1744.0020.0002.0CDE] # KANNADA LETTER FA
> 0CBE  ; [.1745.0020.0002.0CBE] # KANNADA VOWEL SIGN AA
> 
> ***** allkeys-4.0.1.txt - proposed  as per the mail from Mark Davis
> 0CB0  ; [.173A.0020.0002.0CB0] # KANNADA LETTER RA
> 0CB2  ; [.173B.0020.0002.0CB2] # KANNADA LETTER LA
> 0CB5  ; [.173C.0020.0002.0CB5] # KANNADA LETTER VA
> 0CB6  ; [.173D.0020.0002.0CB6] # KANNADA LETTER SHA
> 0CB7  ; [.173E.0020.0002.0CB7] # KANNADA LETTER SSA
> 0CB8  ; [.173F.0020.0002.0CB8] # KANNADA LETTER SA
> 0CB9  ; [.1740.0020.0002.0CB9] # KANNADA LETTER HA
> 0CB3  ; [.1741.0020.0002.0CB3] # KANNADA LETTER LLA
> 0CB1  ; [.1742.0020.0002.0CB1] # KANNADA LETTER RRA
> 0CDE  ; [.1743.0020.0002.0CDE] # KANNADA LETTER FA
> 0CBD  ; [.1744.0020.0002.0CBD] # KANNADA SIGN AVAGRAHA
> 0CBE  ; [.1745.0020.0002.0CBE] # KANNADA VOWEL SIGN AA
> *****
> 
> ***** allkeys-4.0.1.txt - correct order as per Kannada
> 0CB0  ; [.173A.0020.0002.0CB0] # KANNADA LETTER RA
> 0CB1  ; [.1742.0020.0002.0CB1] # KANNADA LETTER RRA
> 0CB2  ; [.173B.0020.0002.0CB2] # KANNADA LETTER LA
> 0CB5  ; [.173C.0020.0002.0CB5] # KANNADA LETTER VA
> 0CB6  ; [.173D.0020.0002.0CB6] # KANNADA LETTER SHA
> 0CB7  ; [.173E.0020.0002.0CB7] # KANNADA LETTER SSA
> 0CB8  ; [.173F.0020.0002.0CB8] # KANNADA LETTER SA
> 0CB9  ; [.1740.0020.0002.0CB9] # KANNADA LETTER HA
> 0CB3  ; [.1741.0020.0002.0CB3] # KANNADA LETTER LLA
> 0CDE  ; [.1743.0020.0002.0CDE] # KANNADA LETTER FA
> 0CBD  ; [.1744.0020.0002.0CBD] # KANNADA SIGN AVAGRAHA
> 0CBE  ; [.1745.0020.0002.0CBE] # KANNADA VOWEL SIGN AA
> *****
> 
> Thanks and regards,
> Pavanaja
> 
> > 
> >  
> > We have received the attached submission on Indic collation. There is a 
meeting of the UTC in a 
> > couple of weeks, so if anything needs to be done on collation for Indic in 
this meeting, it would be 
> > good to get a proposal from the experts.
> > 
> > Note: there are two places that collation can be affected:
> > 
> > A. One is in the DUCET table, which is the default ordering. 
> > - This ordering can be seen in chart form at 
http://www.unicode.org/charts/collation/.
> > - The data file is at http://www.unicode.org/Public/UCA/latest/allkeys.txt
> > - The technical standard, which describes the format,is at 
http://www.unicode.org/reports/tr10/
> > 
> > B. The other is in the CLDR. This is appropriate where there are multiple 
languages that use the 
> > same script.
> > - The main page is http://www.unicode.org/cldr/
> > - Charts at http://www.unicode.org/cldr/comparison_charts.html
> > - Format described at http://www.unicode.org/reports/tr35/#<collations>
> > 
> > Mark
> > __________________________________
> > http://www.macchiato.com
> > ► शिष्यादिच्छेत्पराजयम् 
â—„
> > ----- Original Message ----- From: Ã
ke Persson 
> > To: Mark Davis 
> > Sent: Fri, 2004 May 28 09:42
> > Subject: allkeys.txt
> > 
> > 
> > Hi Mark,
> > The DUCET default ordering for some Indic scripts are possibly (IMO) 
incorrect.Please,look it 
> > over. I have made an imaginary allkeys-4.0.1.txt file, and attached the 
difference. I have no Indic 
> > dictionaries, I'm just guessing.
> > 
> > Sources:
> > http://www.eki.ee/wgrs/
> > http://acharya.iitm.ac.in/multi_sys/unicode/debate.html
> > 
> > Kind regards,
> > Ã
ke
> > 
> 
> 
> 
--------------------------------------------------------------------------------
-------------
> Dr. U.B. Pavanaja
> CEO, Vishva Kannada Softech
> Think Globally, Act locally
> 
> 
> 



------- End of forwarded message -------
---------------------------------------------------------------------------------------------
Dr. U.B. Pavanaja
CEO, Vishva Kannada Softech
Think Globally, Act locally


More information about the Kannada mailing list