Re: Corpora: Conversion of PDF files

From: Simon G. J. Smith (smithsgj@eee.bham.ac.uk)
Date: Thu May 24 2001 - 12:40:17 MET DST

  • Next message: Sattar.Izwaini@stud.umist.ac.uk: "Re: Corpora: Conversion of PDF files"

     MSword -- www.adobe.com will do free conversions FROM word (they get emailed back to you, and you can only do abt 5 per email address), but I don't know about the other way round.

    To extract text:

    from acrobat (mine is 4.0) choose the text select tool (capital T with a little box). Then just cut and paste the text you want. This works one page at a time.

    From ghostview (if it can read your particular PDF, sometimes doesn't work for me), do the whole thing at once by Edit|Text Extract. It's in the gsview help.

    You can convert whole pages to bitmaps with gsview, and I think in Acrobat you can select graphics from the pdf file (the Acrobat help says use the graphics select tool, but I can't find this tool). The bitmap file can then be viewed from Word.



    This archive was generated by hypermail 2b29 : Thu May 24 2001 - 12:35:47 MET DST