Re: Corpora: Santa Barbara Corpus

From: Lou Burnard (lou.burnard@computing-services.oxford.ac.uk)
Date: Mon Aug 07 2000 - 17:31:27 MET DST

  • Next message: Chris Manning: "Re: Corpora: Santa Barbara Corpus"

    On Fri, 4 Aug 2000, Christopher Cieri wrote:

    |I can certainly try to help. Because we expected the Santa Barbara
    |Corpus of Spoken American English (SBCSAE) to be used in multiple
    |research communties where different computer platforms and software are
    |common, we have tried to avoid depending upon any specific set of tools.
    |The corpus contains only data; there is no software to install. Indeed,
    |the data is stored on the CDs in uncompressed format so that you can
    |read the transcripts or listen to the audio directly from CD.

    Hmm. So instead of using pre-existing standards which at least have a
    chance of being implemented across different computer platforms, it's
    better to make up an entirely arbitrary set of codes of your own for
    which *everyone* has to write their own software?

    Ah well.

     ----------------------------------------------------------------
     Lou Burnard http://users.ox.ac.uk/~lou
     ----------------------------------------------------------------



    This archive was generated by hypermail 2b29 : Mon Aug 07 2000 - 17:29:27 MET DST