 
 
 
 
 
 
Re: Excel Headers --> Mathematica
- To: mathgroup at smc.vnet.net
- Subject: [mg127491] Re: Excel Headers --> Mathematica
- From: Murta <rodrigomurtax at gmail.com>
- Date: Sun, 29 Jul 2012 03:05:01 -0400 (EDT)
- Delivered-to: l-mathgroup@mail-archive0.wolfram.com
- Delivered-to: l-mathgroup@wolfram.com
- Delivered-to: mathgroup-newout@smc.vnet.net
- Delivered-to: mathgroup-newsend@smc.vnet.net
- References: <juqs07$66i$1@smc.vnet.net>
On Thursday, July 26, 2012 4:35:35 AM UTC-3, Bill Rowe wrote:
> On 7/25/12 at 2:31 AM, nkormanik at gmail.com (Nicholas Kormanik) wrote:
> 
> 
> 
> >I have a large Excel .xls file.  Miniature version shown below:
> 
> 
> 
> >Test1   Test2
> 
> >75        64
> 
> >58        85
> 
> >81        55
> 
> >64        63
> 
> >70        94
> 
> >88        90
> 
> >78        96
> 
> >96        98
> 
> >54        97
> 
> >75        61
> 
> 
> 
> >I would like to do some statistical analysis on the file in
> 
> >Mathematica 8.0.4.
> 
> 
> 
> >Additionally I am using Mathematica Link for Excel 3.5.
> 
> 
> 
> I thought this software was designed to allow usage of
> 
> Mathematica functions within Excel. If so, it really isn't
> 
> relevant to the problem you describe below
> 
> 
> 
> >The issue I'm now attempting to address concerns the "headers," or
> 
> >"column labels."
> 
> 
> 
> Given your file apparently has only one row of header
> 
> information, probably the easiest thing to do is simple drop the
> 
> first row after reading the data into Mathematica. I would do
> 
> this as follows:
> 
> 
> 
> data = Import[filename.xls,"XLS"][[1,2;;]];
> 
> 
> 
> which assumes the data of interest is in the first worksheet of
> 
> the Excel file. Alternatively, if there is an indeterminate
> 
> number of header rows or I simply didn't want to count them for
> 
> whatever reason and the data of interest is all numeric, I would
> 
> do something like:
> 
> 
> 
> data = Cases[Import[filename.xls,"XLS"][[1]],{_?NumericQ,_?NumericQ}];
> 
> 
> 
> This will read the data in the first worksheet on get rid of any
> 
> rows that don't import as a pair of numbers. Note, this pattern
> 
> is very restrictive. Comments in any column would cause that row
> 
> to be dropped. If the data was in say columns 3 and 4 of your
> 
> worksheet, all data would be dropped and the result would be an
> 
> empty list assigned to the Mathematica variable data.
> 
> 
> 
> The first method is not so restrictive. But this also means if
> 
> there are additional rows below the data, information in other
> 
> columns etc, those rows will also get assigned to the variable
> 
> data and likely will cause issues with further Mathematica computations.
Isn't simple to use Rest function?
best regards
Murta

