MathGroup Archive 2010

[Date Index] [Thread Index] [Author Index]

Search the Archive

Re: Importing "Plaintext" from PDF

  • To: mathgroup at smc.vnet.net
  • Subject: [mg113507] Re: Importing "Plaintext" from PDF
  • From: Helen Read <readhpr at gmail.com>
  • Date: Sun, 31 Oct 2010 02:10:18 -0500 (EST)
  • References: <iagldl$3cm$1@smc.vnet.net>

On 10/30/2010 4:36 AM, Mark Coleman wrote:
> Hi,
>
> I'm attempting to use Mathematica (v7.01) to Import the text from a PDF file.
> If I simply Import[] the file, it returns a list of graphics objects
> representing each page of the file. If I use use "Plaintext" option of
> Import[], it returns an empty list. My source pdf files were obtained
> from Google's Patent Search function. Just wondering if I there is
> some option I am missing or if Mathematica cannot Import text from pdf files.

I just tried Import with the "Plaintext" option on a pdf that had text 
in it, and it worked fine. The pdf you are working with might have 
originated from a scanned document with each page saved as an image, in 
which case there isn't any Plaintext to Import. Wait for Mathematica 8, 
though :-)

-- 
Helen Read
University of Vermont


  • Prev by Date: Re: Condensed syntax
  • Next by Date: Re: Condensed syntax
  • Previous by thread: Re: Importing "Plaintext" from PDF
  • Next by thread: Re: Importing "Plaintext" from PDF