MathGroup Archive 2011

[Date Index] [Thread Index] [Author Index]

Search the Archive

Re: create/insert information for missing data points

  • To: mathgroup at smc.vnet.net
  • Subject: [mg119916] Re: create/insert information for missing data points
  • From: Ray Koopman <koopman at sfu.ca>
  • Date: Wed, 29 Jun 2011 05:29:48 -0400 (EDT)

On Mon, 27 Jun 2011 at 07:32:55 -0400 (EDT),
lillydogy <lillydogy at yahoo.com> wrote:
> how do I create and insert information for missing data?
>
> have an experiment running in military time, and sometimes it has
> missing data points, sometimes three missing data points in a row or
> more. I want to take the average of the two adjacent (joining) data
> points and insert for the missing point/s. for example, these would
> be the missing points inserted for the example below.
> {305,38.5},{311,40.5},{312,40.5},{313,40.5}.
>
> before :(
> listmissing = {{301,44}, {302,46}, {303,40}, {304,30}, {306,47},
>                {307,37}, {308,42}, {309,40}, {310,43}, {314,38}}
>
> after :)
> listfull = {{301,44}, {302,46}, {303,40}, {304,30}, {305,38.5},
>             {306,47}, {307,37}, {308,42}, {309,40}, {310,43},
>             {311,40.5}, {312,40.5}, {313,40.5}, {314,38}}
>
> btw, these are millitary times, so it goes 959,1000....1059,1100...
> etc because there are only 60 minutes in an hour. there would be
> no such thing as 1392, because 1:92pm does not exist.

Here are two utility functions.
The first converts military time to minutes.
The second goes the other way.

  mil2min[mil_] := Quotient[mil,100]*60 + Mod[mil,100]
  min2mil[min_] := Quotient[min,60]*100 + Mod[min,60]

Both functions are Listable.

  mil2min@{957,958,959,1000,1001,1002,1003}
  min2mil@%

  {597, 598, 599, 600, 601, 602, 603}
  {957, 958, 959, 1000, 1001, 1002, 1003}

The first step is to convert to minutes. Frem here on we work with xy.

  xy = MapAt[mil2min,#,1]&/@listmissing

  {{181,44}, {182,46}, {183,40}, {184,30}, {186,47},
   {187,37}, {188,42}, {189,40}, {190,43}, {194,38}}

This fills in the missing data by linear interpolation.
I know you asked for averages, but this is easier and
(IMHO) probably more reasonable.

  L = Interpolation[xy,InterpolationOrder->1];
  {min2mil@#,L@#}& /@ Range[xy[[1,1]],xy[[-1,1]]] /. y_Rational:>N@y

  {{301,44}, {302,46}, {303,40}, {304,30}, {305,38.5},
   {306,47}, {307,37}, {308,42}, {309,40}, {310,43},
   {311,41.75}, {312,40.5}, {313,39.25}, {314,38}}

Inserting averages is tricky because the function produced
by Interpolation does not follow the documentation when
InterpolationOrder->0 is specified: given the first x-value,
it returns the second y-value instead of the first.

  f = Interpolation[xy,InterpolationOrder->0];
  p = Prepend[ Rest[f/@Range@@xy[[{1,-1},1]]],xy[[1,2]]];
  g = Interpolation[Reverse[{-1,1}#&/@xy],InterpolationOrder->0];
  q = Append[Most[Reverse[g/@Range@@-xy[[{-1,1},1]]]],xy[[-1,2]]];
  Transpose@{min2mil[Range@@xy[[{1,-1},1]]],(p+q)/2} /. y_Rational:>N@y
  % == listfull

  {{301,44}, {302,46}, {303,40}, {304,30}, {305,38.5},
   {306,47}, {307,37}, {308,42}, {309,40}, {310,43},
   {311,40.5}, {312,40.5}, {313,40.5}, {314,38}}

  True

>
> this is another example of how the data looks
> {{957,6}, {958,4}, {959,5}, {1000,5}, {1001,4}, {1002,2}, {1003,3}}


  • Prev by Date: Re: How to find the difference between two lists (opposite of union[]?)
  • Next by Date: Re: How to find the difference between two lists (opposite of union[]?)
  • Previous by thread: create/insert information for missing data points
  • Next by thread: Stylesheet questions