Re: Importing "Plaintext" from PDF
- To: mathgroup at smc.vnet.net
- Subject: [mg113507] Re: Importing "Plaintext" from PDF
- From: Helen Read <readhpr at gmail.com>
- Date: Sun, 31 Oct 2010 02:10:18 -0500 (EST)
- References: <iagldl$3cm$1@smc.vnet.net>
On 10/30/2010 4:36 AM, Mark Coleman wrote: > Hi, > > I'm attempting to use Mathematica (v7.01) to Import the text from a PDF file. > If I simply Import[] the file, it returns a list of graphics objects > representing each page of the file. If I use use "Plaintext" option of > Import[], it returns an empty list. My source pdf files were obtained > from Google's Patent Search function. Just wondering if I there is > some option I am missing or if Mathematica cannot Import text from pdf files. I just tried Import with the "Plaintext" option on a pdf that had text in it, and it worked fine. The pdf you are working with might have originated from a scanned document with each page saved as an image, in which case there isn't any Plaintext to Import. Wait for Mathematica 8, though :-) -- Helen Read University of Vermont