Corpora: Standard character set for India languages & etc.

From: Gann Ketty (gann_ketty@bah.com)
Date: Mon Mar 12 2001 - 23:04:57 MET

  • Next message: Evelyne Viegas: "Corpora: Calls: Semantic lexicons in NLP"

    ****Apologize for those who have seen the same posting on Linguist
    List****

    Dear netter,

    I'm working on mapping tables (character set to UNICODE) for several
    foreign languages. I'd like to know the national standard character set
    (other than UNICODE) for the following languages. For instance, standard

    character sets for Chinese are BIG5 & GB2312. If there is no standard
    character set for a specific language, I need to know the most popular
    character set being used and where I'm able to acquire the electronic
    data (prefer web site):
    (1) Azerbaijan
    (2) Bengali
    (3) Kannada
    (4) Lao
    (5) Punjabi
    (6) Tamil
    (7) Urdu

    Please reply me directly. Thank you in advance!

    Ketty Gann
    Language Technology Manager
    Booz.Allen & Hamilton Inc.



    This archive was generated by hypermail 2b29 : Tue Mar 13 2001 - 08:58:37 MET