Corpora: language boundaries + code switching

From: D C Souter (cs@scs.leeds.ac.uk)
Date: Tue Jul 04 2000 - 13:24:33 MET DST

  • Next message: Adam Kilgarriff: "Comparing Corpora: Last call for papers"

    Dear all,

    I'm looking for details of projects on automatic boundary identification
    in bilingual/multilingual texts, and corpus material containing such texts.
    I would prefer it if one of the languages were English, and the texts were
    ASCII. I suppose one such source would be a corpus showing code switching.
    Anyone know of such material/projects?

    (I know we could create such material artificially, but I was hoping to
    find naturally occurring material).

    Clive

    =========================================================================
    Clive Souter Tel: +44 113 233 5460
    Lecturer & Senior Admissions Tutor Fax: +44 113 233 5468
    School of Computer Studies
    University of Leeds
    Leeds LS2 9JT
    UK Email: cs@scs.leeds.ac.uk
    =========================================================================



    This archive was generated by hypermail 2b29 : Tue Jul 04 2000 - 13:26:21 MET DST