Corpora: THX: Wordsmith question

From: Van den Heuvel M Mev (MVDH@sun.ac.za)
Date: Thu Apr 04 2002 - 08:21:44 MET DST

  • Next message: David Grant: "Corpora: definition of anaphoric coreference"

    Thanks for all you very useful inputs. I am indeed using a Windoze system as
    you surmised, so the unix tools won't be that helpful to me. But I did take
    note of all the tips on diff and cygwin. Regarding the MS Word tip on
    comparing documents - that doesn't work too well in this case, since the
    comparison focuses on editing changes - so all the differences in
    capitalisation and punctuation that occur are shown, but not the missing
    items.

    I implemented a roundabout method in Wordsmith suggested by Oliver Mason and
    copied the two lists to a third document. The one list was copied twice and
    the other only once. I then calculated the word frequencies again in
    Wordlist, and the scores indicated what was missing where. A score of
    3=present in both lists; 2=present only in the list that was copied twice
    and 1=present only in the list that was copied once. Since I was in quite a
    hurry to get the work done, it seemed like a good idea at the time and it
    did save me a lot of time and numerous headaches.

    ***

    -----Original Message-----
    From: Rod de Lima-Lopes [mailto:limalopes@terra.com.br]
    Sent: 03 April 2002 19:05
    To: Van den Heuvel M Mev
    Cc: 'CORPORA@hd.uib.no'
    Subject: Re: Corpora: Wordsmith question

    Hi Maritza

    I think you should use detailed consistencie tool (in the wordlist
    programme), since it can tell you how quite acuratelly how many lexicon is
    shared by two wordlists

    go to wordlist > compararison > consistency (detailed

    I hope it helps

    []s Rod
    --------------------
    R.E. de Lima-Lopes
    http://rod.do.sapo.pt
    GNU/Linux User # 182240
    On Wed, 3 Apr 2002, Van den Heuvel M Mev wrote:

    > Hi everybody,
    >
    > I'm having a spot of trouble with the Wordlist tool in the Wordsmith suite
    > that I hope someone out there can help me with. I want to compare two
    almost
    > identical word lists containing the entries of a pronunciation lexicon.
    > There are some inconsistencies between the lists, i.e. items missing in
    the
    > one that should be in the other and vice versa. I need to identify the
    > missing words. I thought that I could use the "compare word lists"
    function
    > in Wordlist for this purpose by setting the minimum frequency to 1 word,
    but
    > it's not working. I'm obviously doing something wrong.
    >
    > If you don't have a quick answer to the Wordsmith problem, but know of
    > another tool that could help me do just this one little task with a few
    > button clicks, I would also appreciate your response!
    >
    > TIA.
    >
    > REgards
    > Maritza van den Heuvel
    >
    >
    > Research Unit for Experimental Phonology
    > STellenbosch University
    > South Africa
    >
    >



    This archive was generated by hypermail 2b29 : Fri Apr 05 2002 - 15:38:35 MET DST