Re: newspaper corpora

Ron Zweig (ron@rambam.tau.ac.il)
Tue, 10 Dec 1996 15:59:04 +0200 (IST)

On Tue, 10 Dec 1996, Lou Burnard wrote:

> Does anyone know of any English language newspapers which could be used
> for corpus analysis, or web resources about newspaper corpora? I currently
> have The Times, Sunday Times, Guardian, Telegraph and Sheffield Electronic
> press.
> If anyone knows of any other national or local newpapers available over the
> web ideally, but also on CD, I be most grateful if they could let me know.

Will 18,000 pages (approx 100 million words) of the Palestine Post
(1939-1948) [or 30,000 pages, 1932-1950 - soon] do? The format is
searchable immages - you can do string searches and get the image of the
original page with the hit terms highlighted.

Lou: do u still have the demo CD?

Ron Zweig
Tel Aviv University

ron@rambam.tau.ac.il