Announcement INL 27 Million Words Dutch Newspaper Corpus 1995

ROB@rulxho.LeidenUniv.nl
Wed, 24 May 1995 10:41:45 +0100 (MET)

INSTITUUT VOOR NEDERLANDSE LEXICOLOGIE

On-line access to 27 million Words Dutch Newspaper Corpus for
non-commercial purposes.

The Institute for Dutch Lexicology INL offers you the possibility to
consult a text corpus of over 27 million words of Dutch newspaper text, by
the international computer network. In 1994, a 5 Million Words Corpus with
diversified composition has been made accessible in a similar way.

The retrieval system is essentially the same as that for the 5 Million
Words Corpus 1994. It allows you to search for single words or for word
patterns, including some predefined syntactic patterns that can be changed
by the user. Searches concern the levels of word form, part of speech
(POS), and head word, both separately and in combination by use of Boolean
operators and proximity searches. During the search, data concerning
frequency and distribution over the texts are provided at several levels.
The output most often is a list of items, or a series of concordances
(words in context) with a variable, user-defined textual context. Sorting
facilities may support your analysis of the output data. With some
limitations due to copyright, the output of your searches can be transfered
to your own computer by e-mail. It is not allowed to transfer complete
texts or substantial text parts.

Most of the data has not been corrected, neither on the level of the text,
nor on the level of POS and headword. POS and headword have automatically
been assigned to the word forms in the electronic text by lingware
developed at the INL.

The provider of the texts has given permission for use of the materials for
non-commercial, research purposes only.

Please note that for an optimal use of the retrieval system, the use of a
VT 220 (or higher) terminal, or an appropriate terminal-emulator (e.g.
Kermit) is recommended.

In order to get access to this corpus, an individual user agreement has to
be signed. An electronic user agreement form can be obtained from our
mailserver Mailserv@Rulxho.Leidenuniv.NL. Type in the body of your e-mail
message: SEND [27MLN95]AGREEMNT.USE. For access to the 5 Million Words
Corpus 1994, a separate user agreement is required, which can be obtained
from the same mailserver, by the message SEND [5MLN94]AGREEMNT.USE .

Please make a hard copy of the agreement form, sign it, keep a copy
yourself, and return a signed copy to: Institute for Dutch Lexicology INL,
P.O. Box 9515, 2300 RA Leiden. Fax: 31 71 27 2115.

After receipt of the signed user agreement, you will be informed about your
username and password.

If you need additional information, please send an e-mail message to
Helpdesk@Rulxho.Leidenuniv.NL, or send a fax to Mrs. dr. J.G. Kruyt.