Re: Excel Headers --> Mathematica
- To: mathgroup at smc.vnet.net
- Subject: [mg127455] Re: Excel Headers --> Mathematica
- From: Bill Rowe <readnews at sbcglobal.net>
- Date: Thu, 26 Jul 2012 03:33:45 -0400 (EDT)
- Delivered-to: l-mathgroup@mail-archive0.wolfram.com
- Delivered-to: mathgroup-newout@smc.vnet.net
- Delivered-to: mathgroup-newsend@smc.vnet.net
On 7/25/12 at 2:31 AM, nkormanik at gmail.com (Nicholas Kormanik) wrote: >I have a large Excel .xls file. Miniature version shown below: >Test1 Test2 >75 64 >58 85 >81 55 >64 63 >70 94 >88 90 >78 96 >96 98 >54 97 >75 61 >I would like to do some statistical analysis on the file in >Mathematica 8.0.4. >Additionally I am using Mathematica Link for Excel 3.5. I thought this software was designed to allow usage of Mathematica functions within Excel. If so, it really isn't relevant to the problem you describe below >The issue I'm now attempting to address concerns the "headers," or >"column labels." Given your file apparently has only one row of header information, probably the easiest thing to do is simple drop the first row after reading the data into Mathematica. I would do this as follows: data = Import[filename.xls,"XLS"][[1,2;;]]; which assumes the data of interest is in the first worksheet of the Excel file. Alternatively, if there is an indeterminate number of header rows or I simply didn't want to count them for whatever reason and the data of interest is all numeric, I would do something like: data = Cases[Import[filename.xls,"XLS"][[1]],{_?NumericQ,_?NumericQ}]; This will read the data in the first worksheet on get rid of any rows that don't import as a pair of numbers. Note, this pattern is very restrictive. Comments in any column would cause that row to be dropped. If the data was in say columns 3 and 4 of your worksheet, all data would be dropped and the result would be an empty list assigned to the Mathematica variable data. The first method is not so restrictive. But this also means if there are additional rows below the data, information in other columns etc, those rows will also get assigned to the variable data and likely will cause issues with further Mathematica computations.