[Corpora-List] rare words

From: N M Chipere (n.chipere@reading.ac.uk)
Date: Wed Jun 18 2003 - 11:44:13 MET DST

  • Next message: Magali Jeanmaire: "[Corpora-List] ELRA news - Call for Participation: ESTER campaign"

    Dear all,

    Is anyone familiar with the issues surrounding the definition and
    measurement of word rarity? My colleagues and I are currently treating
    the first two thousand most frequent words in English as common words and
    the rest as rare (excluding proper nouns, numerals, etc). Apart from the
    issue of where one puts the cut-off point, there is an obvious problem to
    do with homographs, for which we don't have a simple solution.

    I'd be grateful for any feedback.

    Ngoni

    *********************************************************************
    Dr Ngoni Chipere
    Institute of Education
    The University of Reading
    Reading
    Berkshire RG6 1HY

    tel: 0118 987 5123 x 4943
    **********************************************************************



    This archive was generated by hypermail 2b29 : Wed Jun 18 2003 - 12:01:50 MET DST