MathGroup Archive: July 1998 [00247]

[Date Index] [Thread Index] [Author Index]
Re: Re: Derivatives D[ ] as Functions: Summary (LONGISH)
To: mathgroup at smc.vnet.net
Subject: [mg13333] Re: [mg13253] Re: Derivatives D[ ] as Functions: Summary (LONGISH)
From: David Withoff <withoff>
Date: Mon, 20 Jul 1998 02:49:54 -0400
Sender: owner-wri-mathgroup at wolfram.com
> I recently posed the question of defining a function f2 as the
> derivative of another function f1, e.g.
> 
>      Remove["Global`*"];
> 
>      f1[a_,x_] :=a Cos[x] + a^2 Sin[x];
>      f2[a_,x_] := D[f1[a,x],x];
>      
> and then finding that f2 would not behave "as expected" in a subsequent 
> Plot[ ] or Table[ ].
> 
> Thanks to several people for their responses, and apologies for my 
> several typos in earlier postings.  It's clear what my difficulty was. 

So what was the difficulty?  I wasn't sure what you meant when you said
that f2 didn't behave as expected.  I don't know what you tried, or
what happened, or what you were expecting.  Unless the guesses below
address the problem, could you fill us in?

My initial guess was that you tried something like

In[1]:= f1[a_,x_] :=a Cos[x] + a^2 Sin[x]

In[2]:= f2[a_,x_] := D[f1[a,x],x]

In[3]:= Table[f2[1,x],{x,2}]

General::ivar: 1 is not a valid variable.

General::ivar: 2 is not a valid variable.

Out[3]= {D[Cos[1] + Sin[1], 1], D[Cos[2] + Sin[2], 2]}

which fails because the example has been set up so that Table inserts
numerical values for the differentiation variable before the
differentiation can be completed.  It's a question of the order in
which things happen.  This particular problem is resolved by doing the
differentiation first, as in

In[4]:= f2[a_,x_] = D[f1[a,x],x]

         2
Out[4]= a  Cos[x] - a Sin[x]

In[5]:= Table[f2[1,x],{x,2}]

Out[5]= {Cos[1] - Sin[1], Cos[2] - Sin[2]}

What left me puzzled, and led me to believe that I must have
misunderstood the question, was the subsequent question about FullForm.
Based on the above understanding of the question, FullForm is very
nearly irrelevant here, and I wasn't able to think of any other
interpretation of your question in which FullForm might be useful. 

With the following remark I am even more puzzled:

> However, let's look at the general issue a little further.  If for
> example we do the above and then look at the FullForm version of the
> two functions, they look very similar:
> 
>      f1[a,x]//FullForm
> 
>               Plus[Times[a,Cos[x]],Times[Power[a,2],Sin[x]]]
> 
>      f2[a,x] // FullForm
> 
>               Plus[Times[Power[a,2],Cos[x]],Times[-1,a,Sin[x]]]
> 
> Note that these FullForm results, if not misleading, certainly give no
> clue that f2 has a different  character than f1 and will behave totally
> differently if you try to use it  in Plot[ ]  or Table[ ] .  

This doesn't make any sense to me.  The FullForm looks quite explicit,
and tells me exactly how this expression will be treated in Plot or
Table.

My guess now is that my original interpretation of your question was
correct, and that all of the discussion of FullForm is not only
irrelevant, it has served to make a very simple and straightforward
issue appear to be something deep and difficult.

My guess is that this is an issue of evaluation order -- it is necessary
to evaluate the derivative before inserting numerical values for the
differentiation variable, and problems will come up (there will be
attempts to differentiate with respect to a number) if Table, Plot, or
some other operation replaces the differentiation variable by a number
before the differentiation is done.

On an unrelated issue:

>  A general and frequent and serious problem with Mathematica, in fact,
> is that it  often does things that may be totally logical and even
> necessary by its  rules, but that can be mysterious, non-intuitive, and
> frustrating  to ordinary users; and the nature of the beast often makes
> these problems obscure and hard  to find. 
> 
> [As another example of Mathematica's penchant for obscure difficulties, 
> note that if you follow the usual practice that you are allowed to use
> for (all? most?) other compound  expressions and include the "Remove[
> ]" expression in the same cell as  the two function definitions above,
> the two definitions will be Removed, even  though they come AFTER the
> Remove[ ].  Tell me why this makes sense? -- not what combination of
> rules makes Remove[ ] function this way, but why setting it up in this
> way makes sense?  Also, tell me where you're warned  about this in the
> Mathematica Book?]

One answer to those questions is that that is just the way that computer
languages work.  Input is processed in two stages.  In the first stage
the computer figures out what the input means, and in the second stage
the computer does what it was asked to do.  Anything that happens in
the first stage will inevitably be done before anything that happens in
the second stage.  It's another question of the order in which things
happen.

Consider, for example, processing the input x+y.  After reading only the
"x", or only the "x+", the computer still won't know what the input
means.  It needs to read all the way to the "y", and detect that there
is nothing after the "y", before it knows what you want to do.  After
reading the complete input "x+y", the computer knows what the input
means. Now it can go back and do what it was asked to do: look up the
value of x, look up the value of y, and add them.  The essential point
of this example is that, even though "y" comes after "x" in this input,
the computer looks at "y" *before* it looks up the value of x, and
anything that the computer does with the "y" when it first looks at it
will happen *before* the computer evaluates x.

That is also what happens in an input such as Remove[x];x=3.  This is an
ambiguous input.  If the computer processes this input all at once,
then it seems reasonable for the computer to guess that the first x
refers to the same thing as the second x.  That is the way that
Mathematica works. One can entertain a variety of alternate designs --
parsing and evaluating the compound expression in pieces, deferring the
creation of a symbol to represent "x", and so forth -- but all of these
alternatives are more complicated, and none of them completely
eliminate the problem.

> [Another general observation is that when problems -- or anyway
> "apparent problems" -- with Mathematica's ways of doing things are
> pointed out, I find it's much more common to get a detailed explanation
> of why Mathematica does what it does, rather than any discussion of the
> design choices, or design necessities, that make it necessary -- if it
> is necessary -- for Mathematica to function in this way.]

In Mathematica, I would describe all of the behaviors mentioned here as
"design necessities", in the sense that, while there are other ways to
do these things, none of the alternatives are very appealing.

Dave Withoff
Wolfram Research
Prev by Date: Re: Re: Re: coordinate transformation
Next by Date: Re: Re: Re: coordinate transformation
Previous by thread: Re: Derivatives D[ ] as Functions: Summary (LONGISH)
Next by thread: Re: Re: Derivatives D[ ] as Functions: Summary (LONGISH)