Re: [Corpora-List] Re. Concordancer for Chinese (Summary of reply)

From: Kiril Simov (kivs@bultreebank.org)
Date: Tue Oct 08 2002 - 13:14:53 MET DST

  • Next message: Sylvain Loiseau: "[Corpora-List] French philosophers corpus"

    Dear Linda,

    I am not a specialist in Chinese, but if your documents are Unicode based
    and can be represented as XML documents, you could try our system:
    CLaRK. It is an XML-based system for corpora development and it
    includes an Unicode XML Editor, XPath language for navigation in
    XML documents, XSLT engine for tranformation of XML documents,
    Cascaded Regular Grammars, Constraints over XML documents,
    Tokenizers, Concordance tool, Extract, Remove and other tools.
    The system is freely available at:

    http://www.bultreebank.org/clark/index.html

    With best regards,

    Kiril

    -----------------------------------------------------------------
    Kiril Simov
    BulTreeBank Project
    Linguistic Modelling Laboratory, CLPP,
    Bulgarian Academy of Sciences
    Acad. G.Bonchev St. 25A
    1113 Sofia, Bulgaria
    E-mail: kivs@bultreebank.org
    Web: http://www.bultreebank.org/
    -----------------------------------------------------------------
    ----- Original Message -----
    From: "Linda Lin" <eclindal@polyu.edu.hk>
    To: "Josephine Lo" <ENJOSELO@cityu.edu.hk>; <CORPORA@HD.UIB.NO>
    Cc: "john flowerdew" <ENJOHNF@cityu.edu.hk>
    Sent: Monday, October 07, 2002 12:15 PM
    Subject: [Corpora-List] Re. Concordancer for Chinese (Summary of reply)

    > Dear All
    >
    > Thanks for your information about the concordancers for Chinese language.
    I
    > have a question regarding the use of these concordancers. Do you think the
    > recommended concordancers such as MonoConc Pro can only recognize
    individual
    > characters, not actual "words" i.e. strings of characters, or they can in
    > fact process actual "words"?
    >
    >
    > Thanks.
    >
    > Linda
    >
    > ----- Original Message -----
    > From: Josephine Lo <ENJOSELO@cityu.edu.hk>
    > To: <CORPORA@HD.UIB.NO>
    > Sent: Wednesday, October 02, 2002 10:01 AM
    > Subject: [Corpora-List] Concordancer for Chinese (Summary of reply)
    >
    >
    > Some times ago I ask for recommendation on concordancers working on
    Chinese
    > characters and thanks for the responses from the following linguists:
    >
    > Michael Barlow:
    > MonoConc Pro should work if you are using Chinese Windows. You
    > would have to use the regex search option in advanced search due to
    > the lack of spaces. You should try the demo at athel.com
    >
    > Lou Burnard:
    > Any concordancer should be able to work with Chinese characters, but it
    > depends rather on how the characters are encoded.
    >
    > We are working on a version of Sara which is able to operate on Unicode,
    > and have been testing it against a Chinese file, which seems to work OK.
    >
    > Rafal L. Górski:
    > Try ConcApp http://vlc.polyu.edu.hk/PUB/concapp/. It is a freeware.
    >
    > Antoinette Renouf
    >
    > Simon G. J. Smith
    > The CKIP corpus, which you can link to from www.sinica.edu.tw , is
    > web-based and lets you do concordances. This is not, however, software
    that
    > can be used to process your own texts.
    >
    > Scott Piao:
    > I put a downloadable Java tool including multi-lingual concordancer on
    > webpage:
    > http://www.lancs.ac.uk/staff/piaosl/research/download/download.htm
    >
    > It has a Graphical interface, and easy to use. In order to run this tool,
    > you'll need to install Java Runtime Environment (JRE) first).
    >
    >
    >
    >
    >



    This archive was generated by hypermail 2b29 : Tue Oct 08 2002 - 13:15:32 MET DST