Re: Corpora: PDF format

Lee Gillam (css1lg@ee.surrey.ac.uk)
Fri, 13 Aug 1999 13:01:58 +0100

The following URL details one converter (to raw text) with the added
benefit of being free. There are a number of other commercial packages
available that will give different formats, those from Adobe
probably being the obvious recommendations.

http://www.research.digital.com/SRC/virtualpaper/pstotext.html

> We have been given a collection of magazines in PDF. Is there any way
> of converting them to SGML / HTML or some other form in which the text
> is actually represented as text?
>
>
> **********************************************************************
> Martin Wynne Multilinguale Forschung
> Visiting Researcher Abteilung LEXIK
> wynne@ids-mannheim.de Institut fuer deutsche Sprache
> Tel: +49 621 1581 427 R5, 6-13
> Fax: +49 621 1581 415 D-68161 Mannheim
> +49 621 1581 200
> **********************************************************************
>
>
>

-- 

_/__/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/ _/ L.Gillam@surrey.ac.uk _/ http://www.mcs.surrey.ac.uk/showstaff?L.Gillam _/__/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/

Lucky cake of the month: fish