Re: Corpora: MS Word to text

Gabriel Pereira Lopes (gpl@di.fct.unl.pt)
Fri, 03 Sep 1999 00:31:21 +0100

GET IN CONTACT COM NUNO MARQUES (nmm@di.fct.unl.pt)

Marco Antonio Esteves da Rocha wrote:

> Dear all,
> Someone has collected a sizable corpus of literary works and documents
> written in Brazilian Portuguese throughout the nineteenth century. It is a
> valuable asset for us here and it is been all typed in MS Word, thus it is
> impossible to use all those software resources you all know. Does anyone
> know about a way to transform these .doc files into ASCII text files
> without having to do that one by one ? If you feel tempted to suggest
> sitting on the curb and crying, please don't.
> Marco Rocha
> marcor@cce.ufsc.br

--
José Gabriel Pereira Lopes
Departamento de Informática
Faculdade de Ciências e Tecnologia
Universidade Nova de Lisboa
Quinta da Torre
2825-114 Caparica
Portugal
Tel.:351-(0)1-294 85 36
Fax: 351-(0)1-294 85 41
e-mail: gpl@di.fct.unl.pt