Re: Corpora: Reference

From: Mitch Marcus (mitch@linc.cis.upenn.edu)
Date: Mon Feb 12 2001 - 18:09:19 MET

  • Next message: Bruce Lambert: "Corpora: Mitch Marcus"

    Mari,

    Someone is confusing me with Mark Liberman, which isn't all that
    unusual. Mark at an invited talk at some ACL annual meeting or other
    presented a list of new words in the AP newswire that occured after
    100 million words of text. None of the words were that unusual. I
    don't think he published this anywhere.

     Mitch

    :
    :Can anyone provide a reference for a purported study, in which someone
    :analyzed the Wall Street Journal for new words, the number of which tailed
    :off to 20 words per (month? week?) after a certain point? Or is this an NLP
    :urban legend? A colleague recalls Mitch Marcus pointing out that the rate of
    :new word occurrences does not asymptote but rather continues at some small
    :but non-trivial rate, but not whether this is Marcus' own study, an
    :observation, or a reference to another work.
    :
    :Thanks,
    :
    :Mari Olsen
    :Microsoft-Natural Language Group
    :



    This archive was generated by hypermail 2b29 : Mon Feb 12 2001 - 22:14:17 MET