Re: Corpora: seeking help

From: SHEREEN KHOJA (s.khoja@lancaster.ac.uk)
Date: Wed Apr 04 2001 - 13:06:41 MET DST

  • Next message: Linda Bawcom: "RE: Corpora: Corp[us|ora]"

    Hi,

    You did not specify what kind of Arabic you were looking for
    (i.e. Classical Arabic, Modern Standard Arabic, or one of the dialects). I
    have a small corpus of MSA newspaper text, 1600 words of which are
    Part-of-speech tagged. I could give you a copy of this but I need to know
    what encoding you prefer (Unicode, cp-1256, ISO 8859-6).

    Regarding parallel corpora, I don't know of any, but I think the BBC
    Arabic news website display some articles in Arabic that are translations
    of the English articles and so could probably be aligned.

    Regards,
    Shereen Khoja
    Lancaster University
    s.khoja@lancaster.ac.uk

    On Tue, 3 Apr 2001, aladdin kharabsheh wrote:

    > hi
    >
    > i am aph.d student.i am doing my research in
    > translation in manchester.my study is a corpus-based
    > studies.
    >
    > i need parallel corpora in computer science as i have
    > chosen the technical translation. my parallel corpora
    > should be english -arabic .
    >
    > untill now i have not found any source from which i
    > can construct my corpora or texts, please , if you
    > know about any website , organization...etc and
    > provide me with the neccessary information to contact
    > them , i will be more than thankful
    >
    > .N.B. in case you do not have any ides about parallel
    > corpora , i am interested also in building a
    > monolingual corpus in the same above mentioned field
    >
    > thanks in advance for your help
    >
    > my best regards
    >
    > __________________________________________________
    > Do You Yahoo!?
    > Get email at your own domain with Yahoo! Mail.
    > http://personal.mail.yahoo.com/
    >
    >
    >



    This archive was generated by hypermail 2b29 : Wed Apr 04 2001 - 13:02:36 MET DST