Re: Corpora: phrase (n-gram) frequency information

Tony Berber Sardinha (tony4@uol.com.br)
Mon, 28 Jun 1999 23:28:50 -0300

Dear David

There's a (frequency) list of n-grams in the Brown corpus on my web pages
listed below.

cheers
tony
-------------------------------
Dr Tony Berber Sardinha
Catholic University of Sao Paulo, Brazil
tony4@uol.com.br
http://sites.uol.com.br/tony4/homepage.html
http://homepages.infoseek.com/~corpuslinguistics/homepage.html
-------------------------------

----------
> From: david sarokin <sarokin.david@epamail.epa.gov>
> To: corpora@hd.uib.no
> Subject: Corpora: phrase (n-gram) frequency information
> Date: 28 June 1999 10:14
>
> Hello the list! Does anyone have information to offer on the most
> common English phrases in use in a given body of text? That is, what
> 4-word, 5-word (10-word, whatever) phrases appear most frequently in the
> Bible, in Shakespeare, in Tom Clancy novels, in newspapers, in any known
> corpora? Any information on this would be greatly appreciated.
>
> Thanks.
>
> David Sarokin
> sarokin.david@epa.gov
> 202-260-6396
>
>