Re: Corpora: Reference

From: Mitch Marcus (mitch@linc.cis.upenn.edu)
Date: Mon Feb 12 2001 - 18:09:19 MET

  • Next message: Charles Meyer: "Corpora: Final Program for 3rd North American Symposium on Corpus Linguistics and Language Teaching"

    Mari,

    Someone is confusing me with Mark Liberman, which isn't all that
    unusual. Mark at an invited talk at some ACL annual meeting or other
    presented a list of new words in the AP newswire that occured after
    100 million words of text. None of the words were that unusual. I
    don't think he published this anywhere.

     Mitch

    :
    :Can anyone provide a reference for a purported study, in which someone
    :analyzed the Wall Street Journal for new words, the number of which tailed
    :off to 20 words per (month? week?) after a certain point? Or is this an NLP
    :urban legend? A colleague recalls Mitch Marcus pointing out that the rate of
    :new word occurrences does not asymptote but rather continues at some small
    :but non-trivial rate, but not whether this is Marcus' own study, an
    :observation, or a reference to another work.
    :
    :Thanks,
    :
    :Mari Olsen
    :Microsoft-Natural Language Group
    :



    This archive was generated by hypermail 2b29 : Mon Feb 12 2001 - 21:42:11 MET