Corpora: Summary: Corpus of Search Engine Queries

Brian Ulicny (Brian_Ulicny@inso.com)
Fri, 17 Apr 1998 10:45:47 -0400

Thanks to the two correspondents who addressed my query about finding or
compiling a corpus of search engine queries. The following two service
broadcast web search engine queries to an automatically updated page, so a
corpus could be compiled from them.

1. www.fireball.de (a search engine specialized on German webpages) has
a
feature they call 'live search' which I think is similar to what
Magellan used
to offer. If you hear of more engines that would have that feature, I'd
be
interested to hear about it.
2. Take a look at MetaSpy at http://www.metacrawler.com

If there are other such service, or if anyone has done any analyses of the
linguistic properties of search engine queries, I'd be interested to know.

Brian Ulicny, PhD
Senior Software Linguist
Inso Corporation
31 St James Ave
Boston MA 02116
www.inso.com