Understanding the total derivative in terms of analysis

In summary, the conversation is about the concept of Lagrangian mechanics and the difficulty in understanding the notation and interpretation of total derivatives. The participants discuss the idea that the x_i's in the notation may not actually be functions and the importance of rigor in understanding the problem. They also mention the potential energy function and how it can be expressed as a composition of functions. However, there is a disagreement about whether x can be considered a function or not. The conversation ends with the original poster expressing their appreciation for the responses and acknowledging their unique background.
  • #1
Tac-Tics
816
7
I'm trying to work through an introduction to Lagrangian mechanics. I get the idea. You have a particle at point A traveling to point B. You have a functional which maps every path between those points to a scalar called the action of the path. The universe then does some number crunching, figures out which path has attributed to it the smallest amount of action, and the particle takes that path. Conceptually, not hard.

Physicst's calculus notation left over from the 17- and 1800s, on the other hand, is very hard. For me at least. I'm hoping to get a slightly more rigorous understanding of how a total derivative works.

So, let's say I have my functional defined by [tex]F[x] = \int_A^B L(x, \dot{x}, t)dt[/tex] where L is the Legrangian. We all learn in calculus that [tex]\frac{d}{dt}f = \frac{\partial f}{\partial t} + \Sigma_i \frac{\partial f}{\partial x_i}\frac{dx_i}{dt}[/tex]. But the terms [tex]\frac{dx_i}{dt}[/tex] seem nonsensical to me. The [tex]x_i[/tex]'s aren't functions, and you can't take the derivative of them! The [tex]\partial x_i[/tex]'s in the denominators are a shrothand notation to indicate the direction of the derivative, or which variable you are taking the derivative with, holding the rest constant. How the heck should I interpret those [tex]\frac{dx_i}{dt}[/tex]'s?

I know, of course, that when working with this kind of problem, the parameters to the function aren't totally independent in the problem. Once we admit that, though, L(x, y, z) is no longer a legitimate function following the definition of a function described by set theory. (And yes, I do realize that the theory of calculus came well before set theory came to dominate mathematics).

So what I'm looking for is a better "framework" for interpreting the total derivative. Clearly, the dependencies granted to the Legrangian by the definition of the functional must be taken into account. The total derivative would seem not to be an operation on the Legrangian itself, then, but rather, it would be an operation on the expression of the functional which resides under the integral sign.

Any thoughts on this would be greatly appreciated!
 
Physics news on Phys.org
  • #2
Tac-Tics said:
The [tex]x_i[/tex]'s aren't functions, and you can't take the derivative of them!

Who says the Xs aren't functions? There is no reason we can't say they depend on other variables. Suppose that we have a function f(x,y), why can't we have x=h(u,v) and y=g(u,v)

Here's a specific example: Potential energy.
Potential energy is a function of N x's where N is the number of degrees of freedom of a system. The Xs are functions themselves X(t)s. This is because POSITION MAY CHANGE WITH TIME. So x is definitely a function of time. Therefore, potential energy is a function composition.

For example, let's say a particle is falling from rest in a uniform gravitational field. It's position(from high school physics) is x(t)=-(1/2)gt^2, while it's potential energy is V(x(t))=mgx, we could calculate dV/dt by plugging the first equation into the second.
i.e. V(x(t))=-(1/2)m(gt)^2 is is a function of t and we could easily see that dV/dt= -mgt
which we could directly calculate by using the chain rule(try it yourself).

Wiki Articles-
Function composition: http://en.wikipedia.org/wiki/Function_composition"
Chain Rule proof: http://en.wikipedia.org/wiki/Chain_rule#Proof_of_the_chain_rule"
 
Last edited by a moderator:
  • #3
The [tex]x_i[/tex]'s may appear to vary in the context of the problem, but this is just a notational phenomenon. You are appealing to a physical explanation, as if each variable was a bead on a abacus which you could freely slide back and forth. But variables in a rigorous proof never vary. Once you say x is a real number, it can't be a function too. The real story of what's going on is deeper than function composition.
 
  • #4
Tac-Tics said:
Once you say x is a real number, it can't be a function too. The real story of what's going on is deeper than function composition.

Isn't x still a function even if it is a real number? Say x is some function x(t) (where t is just a dummy variable, but could represent time). Then x(t)=5 is a function that says x is constant and will always be equal to the real number 5. With this in mind, [tex]\frac{dx_i}{dt} [/tex] is always valid and may be 0.
 
  • #5
sir_manning said:
Isn't x still a function even if it is a real number?

It isn't. It might be a real-valued function, but that is a different kind of beast than a real number. In short, you have to apply an argument to it to get a real out. The value of f, by itself, is a function. The value of f(x) is a real. Most importantly, [tex]f \neq f(x)[/tex]!

You may call it splitting hairs, but I call it rigor ;-) When you work out a problem in physics, you can fudge the math and produce the correct answer. But I'm not trying to do a homework problem here, so I can afford to be as pedantic as I want in my analysis, as long as it helps my understanding.
(By the way, I want to say thanks to you guys for responding to my post at all. I have a somewhat unique background and I approach these problems with an entirely different toolset than someone going to school for physics or engineering. I know I'm a pain in the butt for the regulars. They like questions to be asked the same way as they are written in textbooks, but I enjoy taking my own approach to things).
 
  • #6
The position vector x is assumed to be a function of t, with each coordinate being a function. Why do you question the existence of [tex]\frac{dx_i}{dt}[/tex] when differentiating f with respect to t, but let it slide when in the integral above you have the derivative with respect to time of the position vector already?
 
Last edited:
  • #7
Office_Shredder said:
The position vector x is assumed to be a function of t, with each coordinate being a function. Why do you question the existence of [tex]\frac{dx_i}{dt}[/tex] when differentiating f with respect to t, but let it slide when in the integral above you have the derivative with respect to time of the position vector already?

Perhaps I have missed something. The definition of the total derivative seems to type properly as long as the respected variable is a scalar and the remaining variables are all functions.

I will sit and think about this problem for a bit. That may have been the insight I needed. Thanks a lot!
 
  • #8
Yes, x is a "number" but it may change with time if it is moving.
For example,
at t=0, x=1
at t=1, x=4
at t=2, x=16
at t=3, x=64
(Exercise: Find the probable equation of position. And then find the velocity function. Then determine the force acting on the particle with mass m.)



As you see, for a particle, with each instant of time, there is a single position corresponding to it. This is the definition of a function.
 
  • #9
There's nothing deep here. Let's consider the Lagrangian, which is supposed to be a function of two variables q and q'. What does it mean to say that the action S is the integral of the lagrangian over time? L is not a function of time, so how do we integrate it over time? If we have a path γ : R → C, where C is the configuration space, then at each instant of time t, we have a position γ(t), and a velocity γ'(t). Now we create a new function of time as follows: M(t) = L(γ(t), γ'(t)). The action for the path γ is now defined as S = ∫ M(t) dt. This is where mathematicians may feel a little uneasy when reading physics books, because physicists use the same name for the Lagrangian function L and what we have called M, and they also use q(t) for γ(t) and q'(t) for γ'(t) . So they write S = ∫ L(q,q') dt. The way we have introduced a time dependence into the Lagrangian function is called 'implicit' time dependence, and a total derivative is simply differentiation with respect to an implicit variable. So the total derivative Dt of the Lagrangian function L(q,q') with respect to t is

Dt L = dM/dt

where M is defined as above.

In general, if we have a time dependent lagrangian L(t,q,q'), then the associated 'M function' will be M(t) = L(t,γ(t),γ'(t)), and by the usual calculus rules, we get

Dt L = dM/dt = ∂L/∂t + (dγ/dt)(∂L/∂q) + (dγ'/dt)(∂L/∂q')

and switching to physicist notation (q for γ, L for M etc.), this becomes

Dt L = dL/dt = ∂L/∂t + (dq/dt)(∂L/∂q) + (dq'/dt)(∂L/∂q')

So, to summarize, you are right in saying q and q' are variables, not functions, but it is just convenient notation to use the same letters for varables q, q' and the functions γ(t), γ'(t), and the functions L(q,q') and M(t) = L(γ(t), γ'(t)).
 
Last edited:

Related to Understanding the total derivative in terms of analysis

What is the total derivative in terms of analysis?

The total derivative in terms of analysis is a mathematical concept that describes how small changes in one variable can affect the value of a function. It is used to analyze the behavior of functions and understand their behavior at specific points.

How is the total derivative calculated?

The total derivative is calculated by taking the partial derivatives of a function with respect to each of its variables and then combining them using the chain rule. This will give the total derivative of the function at a specific point.

What is the significance of the total derivative in terms of analysis?

The total derivative is significant in analysis because it allows us to understand the behavior of functions at specific points. It helps us determine whether a function is increasing or decreasing, and how sensitive it is to changes in its variables.

How does the total derivative relate to the concept of limits?

The total derivative is closely related to the concept of limits. In fact, the total derivative can be defined as the limit of the difference quotient. This means that as the difference between the two points approaches zero, the total derivative becomes the same as the limit of the difference quotient.

Can the total derivative be used to approximate a function?

Yes, the total derivative can be used to approximate a function at a specific point. This is because the total derivative gives us information about how the function changes at that point, which can be used to approximate the value of the function at nearby points.

Similar threads

Replies
4
Views
2K
Replies
5
Views
1K
Replies
3
Views
1K
Replies
12
Views
2K
  • Calculus
Replies
9
Views
1K
Replies
6
Views
1K
  • Calculus
Replies
3
Views
2K
Replies
4
Views
3K
Replies
1
Views
1K
Replies
19
Views
2K
Back
Top