Re: [Corpora-List] plain text extraction from ICE-GB...

From: Stefan Th. Gries (STGries@sitkom.sdu.dk)
Date: Mon Nov 22 2004 - 15:37:02 MET

Next message: Max Silberztein: "[Corpora-List] INTEX/NooJ WOrkshop 2005"

Previous message: Ute Römer: "Re: [Corpora-List] plain text extraction from ICE-GB -- found a solution..."
In reply to: Ute Römer: "[Corpora-List] plain text extraction from ICE-GB..."
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Just use a grep command or some utility to extract all
sequences of
{*}
at the end of a line.
Best,
STG

Stefan Th. Gries
----------------------------------------
IFKI, Southern Denmark University
http://people.freenet.de/Stefan_Th_Gries
----------------------------------------

Ute Römer wrote:
> Dear all,
>
> In a pragmatics class I would like to use some real samples of classroom
> interaction (for my students to analyse the move structure in terms of IRF
> etc.). I thought I could simply take one or two of the transcripts included
> in ICE-GB but now I don't seem to find a way of copying (cutting/pasting)
> texts from ICECUP (in the browse text mode) to a Word or text file. My
> not-so-ideal solution was to enlarge font size and use screenshots, but I am
> not quite happy with that (some lines are cut off). When I go to the corpus
> files proper, I get all the markup material which I don't want either. Does
> anyone know of a way of extracting plain text from this corpus?
>
> Thanks and best wishes... Ute

Next message: Max Silberztein: "[Corpora-List] INTEX/NooJ WOrkshop 2005"
Previous message: Ute Römer: "Re: [Corpora-List] plain text extraction from ICE-GB -- found a solution..."
In reply to: Ute Römer: "[Corpora-List] plain text extraction from ICE-GB..."
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Mon Nov 22 2004 - 15:45:25 MET