Corpora: Re: What is a Corpus

From: Adam Kilgarriff (ak28@itri.brighton.ac.uk)
Date: Mon Feb 07 2000 - 10:08:39 MET

  • Next message: Fabio Tamburini: "Corpora: HMM Tagging Papers..."

    Re: What is a corpus?

            Enough of the essays - let's get quantitative!

    As Vladimir says,

    > what is a corpus, is it balanced or/ and
    > representative.
     ...

    > Is there a quasi-logical procedure of defining - is this
    > collection (dump) of textual data a representative corpus? This is the
    > starting point of all the following activity - is it scientific one or
    > paid hobby?

    We are planning a workshop at ACL 2000 (Hong Kong, October),

            Title: "Comparing corpora"

    with the objective of seeking quantitative responses to the issue.
    Watch this space.

    %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
    Adam Kilgarriff
    Senior Research Fellow tel: (44) 1273 642919
    Information Technology Research Institute (44) 1273 642900
    University of Brighton fax: (44) 1273 642908
    Lewes Road
    Brighton BN2 4GJ email: Adam.Kilgarriff@itri.bton.ac.uk
    UK http://www.itri.bton.ac.uk/~Adam.Kilgarriff
    %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%



    This archive was generated by hypermail 2b29 : Wed Feb 09 2000 - 08:46:51 MET