Corpora: On-line KWIC system in PHP

From: Antonio Ruiz Tinoco (a-ruiz@hoffman.cc.sophia.ac.jp)
Date: Fri Jan 05 2001 - 09:01:01 MET

  • Next message: Kai Noponen: "Corpora: Comprehensive corpus/dictionary/lexicon"

    Hello,

    May I introduce myself as a new member of this list?
    I'm an associate professor of Spanish Linguistics at Sophia University,
    Tokyo.

    I am interested in processing corpora (mainly in Spanish and Japanese) and
    now I am preparing some exercises for my students for the new course
    beginning next April. What I am trying to do is a Web KWIC system using only
    (or mainly) PHP.

    Is there anybody using PHP for this purpose? For big corpora I am developing
    a system with PHP and MySQL, and I think that its response time is quite
    fast compared with PERL even without a backend database. I am not using by
    now any type of indexation, just putting the text in ASCII code. It works
    with almost any language in plain text files, including Japanese.

    For test purposes, I am using the following web page:
     http://133.12.37.60/kwic/kwic.html
    This page is in Spanish but it is rather simple. Just put any string in the
    window, choose a corpus and press. Regular expressions can be tested in
     http://133.12.37.60/kwic/kwic-reg.html

    Of course, I will release the code when it is “viewable”. In a few months?
    I would be most grateful for any comments. Thanks.

    Antonio Ruiz Tinoco
    Sophia University, Tokyo
    a-ruiz@hoffman.cc.sophia.ac.jp



    This archive was generated by hypermail 2b29 : Fri Jan 05 2001 - 09:03:17 MET