Re: Corpora: Search engine for webserver

Bruce L. Lambert (lambertb@uic.edu)
Fri, 14 May 1999 14:16:54 -0500

Take a look at Udi Manber's site.

http://donkey.cs.arizona.edu/udi.html

-bruce

At 01:45 PM 5/14/99 -0500, you wrote:
>I'm considering putting on the web portions of my corpus of historical
>Spanish prose (http://138.87.135.33/personal/espanol.htm). I've been
>looking forever (it seems) for a search engine that I can install on my
>machine that will provide access to the texts. Here are the requirements:
>
>1) Can run on NT Server 4.0
>
>2) Be able to do rather complex wildcard, proximity, and Boolean searches.
>In the ideal world, users could search for something like:
> quer* + [lo OR la OR los OR las] + *er
> (some forms of querer "to want" + some DO pronouns + some infinitives
>ending in -er)
> (giving, for example, queria lo fazer: OldSp "I wanted to do it")
>I've found search engines that have some degree of proximity searching and
>others that allow limited wildcard searches (usually for suffixes: i.e.
>play* = play, playing, plays, played, etc, but rarely the beginning of
>words: *ing, or the middle of words: int*net). But I have yet to find one
>that allows more robust wildcard searching, and especially one that allows
>it to be combined with proximity or Boolean searches.
>
>3) Provide hits from texts in context, but NOT ALLOW direct access to the
>entire texts themselves (i.e. to prevent users from just downloading all of
>the texts to their computer).
>
>4) Costs $5000 or less, and allows a demonstration copy (i.e. times out) to
>be loaded on to the webserver to see if it can do what I need.
>
>Finding such a web-based search engine for my server has been kind of a
>Holy Grail for me for the past 3-4 years. I've already looked extensively
>though sites like www.searchtools.com, but haven't found anything that fits
>the bill (at least from what I see there).
>
> Any suggestions? Thanks in advance,
>
>Mark Davies
>
>
>==================================================================
>Mark Davies, Associate Professor, Spanish Linguistics
>Dept. of Foreign Languages, Illinois State University
>Normal, IL 61790-4300
>
>Voice:309/438-7975 email:mdavies@ilstu.edu
>Fax:309/438-8038 http://138.87.135.33/personal/
>==================================================================
>

Bruce Lambert, PhD
Department of Pharmacy Administration
University of Illinois at Chicago
833 S. Wood St. (M/C 871)
Chicago, IL 60612-7231

phone: 312-996-2411
fax: 312-996-0868