Corpora: PWA - word alignment software available

From: Jörg Tiedemann (joerg@stp.ling.uu.se)
Date: Fri Jan 26 2001 - 18:17:22 MET

  • Next message: Andrew Kehoe: "Corpora: Analysis and Prediction of Innovation in the Lexicon"

    The PLUG Word Aligner - PWA

    http://stp.ling.uu.se/plug/pwa/
    pwa@stp.ling.u.se

    About PWA

    The PLUG Word Aligner (PWA) is a collection of tools for the automatic
    alignment of word correspondences in bilingual parallel texts. The system
    integrates a set of modules for knowledge-lite approaches to word
    alignment, with various possibilities to change configuration and to adapt
    the system to other language pairs and text types. The system requires
    sentence aligned bitexts as its input and produces a list of word and
    phrase correspondences in the text (link instances) and an additional
    bilingual lexicon from these instances (type links).

    PWA comprises 2 word alignment systems, the Linköping Word Aligner (LWA)
    and the Uppsala Word Aligner (UWA). Both system were developed within the
    the co-operative project on parallel text, PLUG, that was carried out
    between November, 1997 and March, 2000. The system was developed at the
    Department of Computer and Information Science at Linköping University,
    Linköping/Sweden and the Department of Linguistics at Uppsala University,
    Uppsala/Sweden. PWA integrates both systems in the modular corpus toolbox
    Uplug and includes additional tools for the automatic generation of
    monolingual word collocations (phrases) and for the automated evaluation
    of alignment results (the PLUG Scorer - PLS).

    Download

    PWA is available to the research community from this page free of charge
    after the signing of a licence agreement with the proprietors.
    PWA is available as binary distribution for the following operating
    systems

          Linux
          MS Windows (Win32)

    Restricted demo-versions of PWA are available now! The demo-versions
    are free for downloading and may be distributed freely without any
    charge. The demo-version is restricted to process the included text corpus
    only. The distribution contains a Swedish/English sentence aligned
    corpus consisting of the declarations of the Swedish government. All PWA
    modules are adjusted to run on this text only. It is strictly prohibited
    to change any part of the distribution in any way in order to run the
    system on other text material and for other purposes than testing the
    system.
    Free demo-versions are available under this license agreement as binary
    distributions:

          PWA demo for Linux - v0.92d October 05, 2000 (4.6 MB)
            http://stp.ling.uu.se/plug/pwa/pwa-linux-demo.tar.gz
          (tested on Red Hat Linux, release 6.2)

          PWA demo for MS Windows - v0.92d October 05, 2000 (6.3 MB)
            http://stp.ling.uu.se/plug/pwa/pwa-win-demo.exe
          (tested on Windows NT 4.0, Service Pack 5)

    ***********/\/\/\/\/\/\/\/\/\/\/\************************************
    ** Joerg Tiedemann joerg@stp.ling.uu.se **
    ** Department of Linguistics http://stp.ling.uu.se/~joerg/ **
    ** Uppsala University tel: (018) 471 7007 **
    ** S-751 20 Uppsala/SWEDEN fax: (018) 471 1416 **
    *************************************/\/\/\/\/\/\/\/\/\/\/\**********



    This archive was generated by hypermail 2b29 : Fri Jan 26 2001 - 18:13:48 MET