Re: quartiles
- To: mathgroup at smc.vnet.net
 - Subject: [mg18232] Re: [mg18214] quartiles
 - From: "Wolf, Hartmut" <hwolf at debis.com>
 - Date: Thu, 24 Jun 1999 14:24:16 -0400
 - Organization: debis Systemhaus
 - References: <199906230041.UAA04362@smc.vnet.net.>
 - Sender: owner-wri-mathgroup at wolfram.com
 
Hello Tom, 
Tom De Vries schrieb:
> 
> Hello !
> 
> I am working with some statistics problems and have a question about finding
> quartiles for a set of data
> 
> Load in the package
> 
> Needs["Statistics`DescriptiveStatistics`"]
> 
> create a set of data
> 
> salaries =
> {250000,100000,60000,60000,40000,40000,40000,40000,25000,20000,20000,20000,
>   18000,16000,16000}
> 
> ask for the quartiles
> 
> Quartiles[salaries]
> 
> and this is the response
> {20000,40000,55000}
> 
> At this point I am probably revealing my ignorance of statistics....
> 
> 250000,100000,60000,60000,40000,40000, 40000
> 40000,  Median
> 25000,20000,20000,20000,18000,16000,16000
> 
> The lower quartile is the median of the values below the median, which I get
> with Mathematica
> 25000,20000,20000,
> 20000,
> 18000,16000,16000
> 
> The upper quartile should be the median of the numbers above the median, so
> why is it 55000?
> 250000,100000,60000,
> 60000,
> 40000,40000, 40000
> 
> Does Mathematica use some algorithm to get rid of outliers before finding
> quartiles,  or does it eliminate the median from the data set before finding
> the quartiles, .....?
> 
> The set of data I used as an example was taken from the math text I am using
> and the answer the text supplies, and the answer I think I should get, is
> different from the one Mathematica gets.  I would appreciate any advice on this!
> 
In[3]:= Needs["Statistics`DescriptiveStatistics`"]
In[4]:= salaries =
{250000,100000,60000,60000,40000,40000,40000,40000,25000,
 20000,20000,20000,18000,16000,16000}
In[5]:= Quartiles[salaries] 
Out[5]= {20000,40000,55000}
How is this to be interpreted? Look at
In[8]:= ?Quartiles
"Quartiles[list] gives a list of the interpolated .25, .50, and .75
quantiles of the entries in list."
Regard "interpolated". However if you do
In[9]:= Quantile[salaries,#]&/@{1/4,1/2,3/4}
Out[9]= {20000,40000,60000}
which might be what you expect. 
But I think your definitions of lower and upper quartile are imprecise
(what does "values below/above the median" mean exactly?) and sometimes
incorrect. See:
In[17]:= distr={1,1,1,2,3,3,3,3,3,3,4,4,4,4,4};
In[18]:= Quantile[distr,#]&/@{1/4,1/2,3/4}
Out[18]= {2,3,4}
But
In[19]:= Median[With[{m=Median[distr]},Select[distr,#<m&]]]
Out[19]= 1
In[20]:= Median[With[{m=Median[distr]},Select[distr,#>=m&]]]
Out[20]= 3
kind regards, hw
- References:
- quartiles
- From: "Tom De Vries" <tdevries@shop.westworld.ca>
 
 
 - quartiles