MathGroup Archive: December 2010 [00767]

[Date Index] [Thread Index] [Author Index]

Re: newbie list question

To: mathgroup at smc.vnet.net
Subject: [mg115057] Re: newbie list question
From: ADL <alberto.dilullo at tiscali.it>
Date: Thu, 30 Dec 2010 04:08:22 -0500 (EST)
References: <iff4bh$rbh$1@smc.vnet.net>

On 29 Dic, 12:00, Carl Woll <ca... at wolfram.com> wrote:
> On 12/28/2010 5:53 AM, Daniel Lichtblau wrote:
>
>
>
> > ----- Original Message -----
>
> >> From: "Gareth Edwards"<gareth.edwa... at cubicmotion.com>
> >> To: mathgr... at smc.vnet.net
> >> Sent: Sunday, December 26, 2010 3:02:23 AM
> >> Subject:  newbie list question
> >> Hi,
>
> >> Liking Mathematica a lot, but struggling with the early part of the
> >> learning curve, i.e. don't know what I don't know...
>
> >> What would be the neatest syntax for finding the first location of
> >> elements from one list in another? For example:
>
> >> listA = { 4,5,8,2,6,4 }
> >> listB = { 8,4,2 }
>
> >> I would like a function to return {3,1,4} in this case (the first
> >> location in A of each element in B)
>
> >> Many thanks!
>
> > Neatest is subject to debate. A straightforward and simple method is to=
 Map Position (of element in first list) over the second list.
>
> > firstPositions1[l1_, l2_] := Flatten[Map[Position[l1, #, 1, 1]&, l2]]
>
> > This will be fine provided the second list is not too large; it scales as m1*m2 where mj is the length of list j. A method that is slower per element of list2, but scales as O(m1+m2), is shown below.
>
> > firstPositions2[l1_, l2_] := Module[
> >    {stor, elem, tab},
> >    tab = ConstantArray[0, Length[l2]];
> >    MapIndexed[If[! IntegerQ[stor[#1]], stor[#1] = #2[[1]]]&, l2];
> >    Do[
> >     elem = stor[l1[[j]]];
> >     If[IntegerQ[elem],
> >      stor[l1[[j]]] = False;
> >      tab[[elem]] = j;
> >      ], {j, Length[l1]}];
> >    tab]
>
> > Quick test that these work.
>
> > listA = {4, 5, 8, 2, 6, 4};
> > listB = {8, 4, 2};
>
> > In[85]:= firstPositions1[listA, listB] ==
> >   firstPositions2[listA, listB] ==
> >   IntegerDigits[IntegerPart[100*Pi]]
>
> > Out[85]= True
>
> > If the second list has 10 or so elements, the second method is noticeably slower than the first. But it does in fact scale much better. Here we compare speed when second list is 1000 elements (not necessarily unique).
>
> > l1 = RandomInteger[10^6, 5*10^5];
> > l2 = RandomInteger[10^6, 10^3];
>
> > In[73]:= Timing[pos1 = firstPositions1[l1, l2];]
> > Out[73]= {68.11000000000001, Null}
>
> > In[74]:= Timing[pos2 = firstPositions2[l1, l2];]
> > Out[74]= {1.933999999999969, Null}
>
> > Notice there is a difference in behavior beyond speed. If the second list contains values not found in the first, the second result will have zeros in those corresponding slots. I'm guessing this is probably desirable behavior, as compared to omitting them altogether (since you won't otherwise know what element in result corresponds to what element in second list).
>
> > In[75]:= pos1 == DeleteCases[pos2, 0]
> > Out[75]= True
>
> > In[76]:= Length[pos1]
> > Out[76]= 379
>
> > Daniel Lichtblau
> > Wolfram Research
>
> If we assume that all of the list elements are positive integers, then
> the following is much faster:
>
> firsts[l1_, l2_] :=  Normal[SparseArray[Automatic, {Max[l1]}, 0, {1,
> {{0, Length[l1]}, List /@ l1}, Range[Length[l1]]}][[l2]]]
>
> Example:
>
> l1 = RandomInteger[{1, 10^6}, 5*10^5];
> l2 = RandomInteger[{1, 10^6}, 10^3];
>
> In[130]:= r1 = firstPositions2[l1, l2]; // Timing
> r2 = firsts[l1, l2]; // Timing
> r1 === r2
>
> Out[130]= {2.387, Null}
>
> Out[131]= {0.109, Null}
>
> Out[132]= True
>
> Carl Woll
> Wolfram Research

Carl, I do not know enough about SparseArrays to fix the code myself,
but I notice this:

li1 = {4, 5, 8, 2, 6, 4, 1};
li2 = {8, 4, 2, 8, 5, 12};

firsts[li1, li2]

During evaluation of In[10]:= Part::partw: Part 12 of
SparseArray[Automatic,{8},0,{1,{{0,6},(4 5 8 2 6 1 )},{1,2,3,4,5,7}}]
[[{8,4,2,8,5,12}]] does not exist. >>
During evaluation of In[270]:= Part::partw: Part 12 of
{7,4,0,1,2,5,0,3} does not exist. >>
During evaluation of In[270]:= Part::partw: Part 12 of
{7,4,0,1,2,5,0,3} does not exist. >>
During evaluation of In[270]:= General::stop: Further output of
Part::partw will be suppressed during this calculation. >>
Out[270]= {7,4,0,1,2,5,0,3}[[{8,4,2,8,5,12}]]

So, the method appears to work only if li2 is a subset of li1.
Is it possible to fix this?

ADL

Prev by Date: Re: newbie list question

Next by Date: Re: newbie list question

Previous by thread: Re: newbie list question

Next by thread: Re: newbie list question