Re: [Corpora-List] plain text extraction from ICE-GB...

From: Stefan Th. Gries (STGries@sitkom.sdu.dk)
Date: Mon Nov 22 2004 - 15:37:02 MET

  • Next message: Max Silberztein: "[Corpora-List] INTEX/NooJ WOrkshop 2005"

    Just use a grep command or some utility to extract all
    sequences of
    {*}
    at the end of a line.
    Best,
    STG

    Stefan Th. Gries
    ----------------------------------------
    IFKI, Southern Denmark University
    http://people.freenet.de/Stefan_Th_Gries
    ----------------------------------------

    Ute Römer wrote:
    > Dear all,
    >
    > In a pragmatics class I would like to use some real samples of classroom
    > interaction (for my students to analyse the move structure in terms of IRF
    > etc.). I thought I could simply take one or two of the transcripts included
    > in ICE-GB but now I don't seem to find a way of copying (cutting/pasting)
    > texts from ICECUP (in the browse text mode) to a Word or text file. My
    > not-so-ideal solution was to enlarge font size and use screenshots, but I am
    > not quite happy with that (some lines are cut off). When I go to the corpus
    > files proper, I get all the markup material which I don't want either. Does
    > anyone know of a way of extracting plain text from this corpus?
    >
    > Thanks and best wishes... Ute



    This archive was generated by hypermail 2b29 : Mon Nov 22 2004 - 15:45:25 MET