[Corpora-List] bnc word list

From: krausse (krausse@fh-nordhausen.de)
Date: Tue Jun 17 2003 - 14:37:33 MET DST

  • Next message: delucca@nilc.icmc.usp.br: "Re: [Corpora-List] Legal aspects of compiling corpora"

    Dear list members,

    Having followed the discussion on the size of reference corpus where as
    a byproduct it was mentioned where to get the BNC word list from I am
    actually wondering about a problem that I have come across recently.

    If I want to compare the word list of my corpus with the BNC I would
    need a list with just the words in it without frequency information and
    tags. Being only as computer literate as the average languages person I
    wonder whether there either exists such a plain list or whether there is
    another way of removing numbers and tags than letting it run through the
    exchange program of my word processor. (I only want to find out which
    words in my corpus are not represented in the BNC.)

    Many thanks in advance for any help/advice you might have.

    Sylvana Krausse





    This archive was generated by hypermail 2b29 : Tue Jun 17 2003 - 14:38:36 MET DST