Corpora: Expository text database

Yaari Yaakov (yyaari@macs.biu.ac.il)
Wed, 17 Dec 1997 19:19:07 +0200

I am looking for online database of medium size expository texts. More
specifically, the texts should have the following attributes:

- Medium size ~2000-3000 words
- Not too technical content - popular science is OK.
- Text is segmented to sections and, if possible, to subsections. HTML,
SGML or latex formats are OK. Tag-less text is OK too as long as
section boundaries could be easily found.
- Ideal genre would be online (Web) articles in HTML.

Yaakov Yaari