Variations, Euler-Lagrange, and Stokes

In summary, the variation principle states that the variation of a dynamical variable is zero on a region defined by a set of boundaries.
  • #1
haushofer
Science Advisor
Insights Author
2,959
1,505
Hi, I have some questions which I encountered during my thesis-writing, I hope some-one can help me out on this :)

First, I have some problems interpreting coordinate-transformations ( "active and passive") and the derivation of the Equations of Motion. We have

[tex]
S = \int L(\phi, \partial_{\mu} \phi) d^{4}x
[/tex]

and

[tex]
\delta S = 0
[/tex]

, in which the variation of the field is arbitrary. My question is: how exactly is this variation defined? One has 2 options:

[tex]
\delta \phi = \phi^{'}(x^{'}) - \phi (x)
[/tex]
or
[tex]
\delta \phi = \phi^{'}(x) - \phi (x)
[/tex]

where the difference lies in the argument. In notes of Aldrovandi and Pereira ( Notes for a classical course on fields ) option 1 is choosen. And in Inverno they say that option 2 is a coordinate transformation. Why is that? I tend to choose for option two, because here you actually change the field; I would say that a scalar quantity is invariant under transformation 1, so here you just state general covariance in stead of obtaining the Equations of Motion. At the other hand, you can always choose x=x'. In the end I want to look at why one is able to commute variations and partial derivatives, so I need the exact definition of the variation of the field.

Another question concerns Stokes theorem. I understand the theorem totally for n-forms ( where you define the boundary of your region with chains etc ), but why is it also valid for example in the derivation of the Euler-Lagrange equations or tensor densities?

Many thanks in forward :)

-edit My TeX-code is not working properly for some reason, but I hope it is clear.
 
Last edited:
Physics news on Phys.org
  • #3
Ok, let me put it differently. I know about pull-backs, push-forwards, diffeomorphisms etc. I always regarded a push-forward of a vector as some sort of "generalized coordinatetransformation", but still as passively. Why is a push-forward an active transformation, and a coordinatetransformation just passive?
 
  • #4
I suggest a good textbook -- Analysis, Manifolds and Physics is good. It covers differential geometry over finite and infinite dimensional spaces, which are what you need for the formal treatment of variational calculus.
 
  • #5
genneth said:
I suggest a good textbook -- Analysis, Manifolds and Physics is good. It covers differential geometry over finite and infinite dimensional spaces, which are what you need for the formal treatment of variational calculus.

I followed a course on differential geometry, and have the idea that I have enough knowledge of understanding this, but that I'm overlooking something.
 
  • #6
haushofer said:
Hi, I have some questions which I encountered during my thesis-writing, I hope some-one can help me out on this :)

First, I have some problems interpreting coordinate-transformations ( "active and passive") and the derivation of the Equations of Motion. We have

[tex]
S = \int L(\phi, \partial_{\mu} \phi) d^{4}x
[/tex]

and

[tex]
\delta S = 0
[/tex]

, in which the variation of the field is arbitrary. My question is: how exactly is this variation defined? One has 2 options:

[tex]
\delta \phi = \phi^{'}(x^{'}) - \phi (x)
[/tex]
or
[tex]
\delta \phi = \phi^{'}(x) - \phi (x)
[/tex]

where the difference lies in the argument. In notes of Aldrovandi and Pereira ( Notes for a classical course on fields ) option 1 is choosen. And in Inverno they say that option 2 is a coordinate transformation. Why is that? I tend to choose for option two, because here you actually change the field; I would say that a scalar quantity is invariant under transformation 1, so here you just state general covariance in stead of obtaining the Equations of Motion. At the other hand, you can always choose x=x'. In the end I want to look at why one is able to commute variations and partial derivatives, so I need the exact definition of the variation of the field.

Coordinate transformations and (therefore) the transformation properties (tensorial characters) of the dynamical variables play no role in the formulation of the variation principle of stationary action:

[tex]\delta \int_{D} d^{4}x \mathcal{L}(x) = 0[/tex]

We obtain the E-L equation in D by varying the dynamical variables;

[tex]\bar{u}_{i}(x) = u_{i}(x) + \delta u_{i}(x)[/tex]

and assuming that the variation, [itex]\delta u_{i}[/itex], vanishes on [itex]\partial D[/itex] (the surface of the 4-volume over which the integral is taken) but arbitrary in D.

In this case, it is trivial to verify that

[tex]\left[ \delta , \partial \right] = 0[/tex]

The variation principle leads to an expression of the form;

[tex]
0 = \int_{D} d^{4}x \ \frac{\delta \mathcal{L}}{\delta u_{i}} \ \delta u_{i} + \int_{D} d^{4}x \ \partial_{a} \{ F^{ai}(u) \delta u_{i} \}
[/tex]

But

[tex]\int_{D} d^{4}x \ \partial_{a} \{ F^{ai}(u) \delta u_{i} \} = \int_{\partial D} d \Sigma_{a} \ F^{ai}(u) \ \delta u_{i}[/tex]

and, since

[tex] \delta u_{i} = 0 \ \mbox{on} \ \partial D [/tex]

Thus

[tex]\int_{D} d^{4}x \ \frac{ \delta \mathcal{L}}{\delta u_{i}} \ \delta u_{i} = 0[/tex]

Now, since the variation, [itex]\delta u_{i}[/itex], are arbitrary in D, hence

[tex] \frac{\delta \mathcal{L}}{\delta u_{i}} \equiv \frac{\partial \mathcal{L}}{\partial u_{i}} - \partial_{a} \left( \frac{\partial \mathcal{L}}{\partial \partial_{a} u_{i}} \right) = 0[/tex]

If you want to see how the action integral changes under some group of coordinate transformations then you do need to know how the dynamical variables transform under the given group, i.e., what representation do they carry.

Consider an arbitrary infinitesimal transformations;

[tex]\bar{x}^{a} = x^{a} + \delta x^{a} \ \ 1[/tex]
[tex] \delta x^{a} = \omega_{A} f^{Aa}(x) \ \ 1'[/tex]

where A is a multi-index taken value in {0,1,2,...}, i.e., A = no-index, single-index, double-index, ..., [itex]\omega_{A}[/itex] is the parameter of the transformations, and [itex]f^{Aa}(x)[/itex] is an arbitrary function.

Normally, the dynamical variables belong to some finite dimensional (matrix) representation of the group;

[tex]\bar{u}_{i}(\bar{x}) = R_{i}{}^{j}(\omega) u_{j}(x) \ \ 2[/tex]

The infinitesimal version of this law is obtained by expanding the transformation matrix, near the identity, to 1st order in the parameter;

[tex] R_{i}{}^{j} = \delta^{j}_{i} + \omega_{A} D_{i}^{Aj} \ \ 3[/tex]

where [itex]D^{A}[/itex] are set of matrices satisfying the Lie algebra of the group. They act on the space of the dynamical variables. for example, if we are dealing with Lorentz group, then [itex]D^{A}[/itex] is the appropriate spin matrix for [itex]u_{i}(x)[/itex].

From (3) and (2) we find

[tex]
\delta^{\ast} u_{i}(x) \equiv \bar{u}_{i}(\bar{x}) - u_{i}(x) = \omega_{A} D_{i}^{Aj} u_{j}(x) \ \ 4
[/tex]

So, [itex]\delta^{\ast} u_{i}[/itex] describes an infinitesimal change in u brought about by the coordinate transformations. Notice that [itex]x[/itex] and [itex]\bar{x}[/itex] represent the same geometrical point, i.e., [itex]\delta^{\ast}[/itex] compares two functions at the same point, hence the name local variation.

Since [itex]u_{i}[/itex] and [itex]\partial u_{i}[/itex] belong to different representation matrices, then [itex]\delta^{\ast}[/itex] does not, in general, commute with [itex]\partial[/itex]. Indeed you can show that

[tex]\left[ \partial_{a} , \delta^{\ast} \right] = \omega_{A} \partial_{a} f^{Ab} \partial_{b} \ \ 5[/tex]

This vanishes only for the translation group ( f(x) is constant).

We can also define a non-local variation symbol, [itex]\delta[/itex], that compares the two functions [itex]( u_{i}, \bar{u}_{i})[/itex] at two different points;

[tex]\delta_{PM} u_{i} \equiv \bar{u}_{i}(P) - u_{i}(M)[/tex]

If the two points have the same value of the coordinate x, then

[tex]
\delta u_{i}(x) = \bar{u}_{i}(\bar{x})|_{\bar{x} = x} - u_{i}(x) = \bar{u}_{i}(x) - u_{i}(x)
[/tex]

represents the infinitesimal change of "shape" of the dynamical variable. It is easy to see that this functional-form-variation is related to the local variation by a "drag";

[tex]\delta u_{i} = - \delta x^{a} \partial_{a} u_{i} + \delta^{\ast} u_{i}[/tex]

Indeed, for scalar, vector and tensor fields, [itex] \delta[/itex] is nothing but a Lie derivative.
I prefer the name total variation; notice the orbital and the spin contributions in;

[tex]\delta u_{i}(x) = - \omega_{A} \{ f^{Aa}(x) \partial_{a} u_{i} - D_{i}^{Aj} u_{j} \}[/tex]

One can also show that

[tex] \delta u = \left[ \omega_{A} G^{A} , u \right][/tex]

where the G's are the infinitesimal generatores of the group in question.

For more details, see post #9 and #12 in

https://www.physicsforums.com/showthread.php?t=172461

regards

sam
 
Last edited:
  • #7
samalkhaiat said:
haushofer said:
Coordinate transformations and (therefore) the transformation properties (tensorial characters) of the dynamical variables play no role in the formulation of the variation principle of stationary action:

[tex]\delta \int_{D} d^{4}x \mathcal{L}(x) = 0[/tex]

We obtain the E-L equation in D by varying the dynamical variables;

[tex]\bar{u}_{i}(x) = u_{i}(x) + \delta u_{i}(x)[/tex]

and assuming that the variation, [itex]\delta u_{i}[/itex], vanishes on [itex]\partial D[/itex] (the surface of the 4-volume over which the integral is taken) but arbitrary in D.

In this case, it is trivial to verify that

[tex]\left[ \delta , \partial \right] = 0[/tex]

The variation principle leads to an expression of the form;

[tex]
0 = \int_{D} d^{4}x \ \frac{\delta \mathcal{L}}{\delta u_{i}} \ \delta u_{i} + \int_{D} d^{4}x \ \partial_{a} \{ F^{ai}(u) \delta u_{i} \}
[/tex]

But

[tex]\int_{D} d^{4}x \ \partial_{a} \{ F^{ai}(u) \delta u_{i} \} = \int_{\partial D} d \Sigma_{a} \ F^{ai}(u) \ \delta u_{i}[/tex]

and, since

[tex] \delta u_{i} = 0 \ \mbox{on} \ \partial D [/tex]

Thus

[tex]\int_{D} d^{4}x \ \frac{ \delta \mathcal{L}}{\delta u_{i}} \ \delta u_{i} = 0[/tex]

Now, since the variation, [itex]\delta u_{i}[/itex], are arbitrary in D, hence

[tex] \frac{\delta \mathcal{L}}{\delta u_{i}} \equiv \frac{\partial \mathcal{L}}{\partial u_{i}} - \partial_{a} \left( \frac{\partial \mathcal{L}}{\partial \partial_{a} u_{i}} \right) = 0[/tex]

If you want to see how the action integral changes under some group of coordinate transformations then you do need to know how the dynamical variables transform under the given group, i.e., what representation do they carry.

Consider an arbitrary infinitesimal transformations;

[tex]\bar{x}^{a} = x^{a} + \delta x^{a} \ \ 1[/tex]
[tex] \delta x^{a} = \omega_{A} f^{Aa}(x) \ \ 1'[/tex]

where A is a multi-index taken value in {0,1,2,...}, i.e., A = no-index, single-index, double-index, ..., [itex]\omega_{A}[/itex] is the parameter of the transformations, and [itex]f^{Aa}(x)[/itex] is an arbitrary function.

Normally, the dynamical variables belong to some finite dimensional (matrix) representation of the group;

[tex]\bar{u}_{i}(\bar{x}) = R_{i}{}^{j}(\omega) u_{j}(x) \ \ 2[/tex]

The infinitesimal version of this law is obtained by expanding the transformation matrix, near the identity, to 1st order in the parameter;

[tex] R_{i}{}^{j} = \delta^{j}_{i} + \omega_{A} D_{i}^{Aj} \ \ 3[/tex]

where [itex]D^{A}[/itex] are set of matrices satisfying the Lie algebra of the group. They act on the space of the dynamical variables. for example, if we are dealing with Lorentz group, then [itex]D^{A}[/itex] is the appropriate spin matrix for [itex]u_{i}(x)[/itex].

From (3) and (2) we find

[tex]
\delta^{\ast} u_{i}(x) \equiv \bar{u}_{i}(\bar{x}) - u_{i}(x) = \omega_{A} D_{i}^{Aj} u_{j}(x) \ \ 4
[/tex]

So, [itex]\delta^{\ast} u_{i}[/itex] describes an infinitesimal change in u brought about by the coordinate transformations. Notice that [itex]x[/itex] and [itex]\bar{x}[/itex] represent the same geometrical point, i.e., [itex]\delta^{\ast}[/itex] compares two functions at the same point, hence the name local variation.

Since [itex]u_{i}[/itex] and [itex]\partial u_{i}[/itex] belong to different representation matrices, then [itex]\delta^{\ast}[/itex] does not, in general, commute with [itex]\partial[/itex]. Indeed you can show that

[tex]\left[ \partial_{a} , \delta^{\ast} \right] = \omega_{A} \partial_{a} f^{Ab} \partial_{b} \ \ 5[/tex]

This vanishes only for the translation group ( f(x) is constant).

We can also define a non-local variation symbol, [itex]\delta[/itex], that compares the two functions [itex]( u_{i}, \bar{u}_{i})[/itex] at two different points;

[tex]\delta_{PM} u_{i} \equiv \bar{u}_{i}(P) - u_{i}(M)[/tex]

If the two points have the same value of the coordinate x, then

[tex]
\delta u_{i}(x) = \bar{u}_{i}(\bar{x})|_{\bar{x} = x} - u_{i}(x) = \bar{u}_{i}(x) - u_{i}(x)
[/tex]

represents the infinitesimal change of "shape" of the dynamical variable. It is easy to see that this functional-form-variation is related to the local variation by a "drag";

[tex]\delta u_{i} = - \delta x^{a} \partial_{a} u_{i} + \delta^{\ast} u_{i}[/tex]

Indeed, for scalar, vector and tensor fields, [itex] \delta[/itex] is nothing but a Lie derivative.
I prefer the name total variation; notice the orbital and the spin contributions in;

[tex]\delta u_{i}(x) = - \omega_{A} \{ f^{Aa}(x) \partial_{a} u_{i} - D_{i}^{Aj} u_{j} \}[/tex]

One can also show that

[tex] \delta u = \left[ \omega_{A} G^{A} , u \right][/tex]

where the G's are the infinitesimal generatores of the group in question.

For more details, see post #9 and #12 in

https://www.physicsforums.com/showthread.php?t=172461

regards

sam

A late reply, but thank you very much for your effort, I will definitely look at it ! I have the feeling that this is something which is for many people quite trivial, and then it's hard to find a good explanation for it.
 

Related to Variations, Euler-Lagrange, and Stokes

1. What is the concept of variations in mathematics?

Variations are a mathematical concept used to find extreme values or optimal solutions for a given function. They involve finding the rate of change of a function with respect to small changes in its input variables.

2. How is the Euler-Lagrange equation used in mathematics?

The Euler-Lagrange equation is used to solve problems involving variations, particularly in calculus of variations. It is used to find the function that minimizes or maximizes a given integral, subject to certain constraints.

3. What is the significance of the Euler-Lagrange equation in physics?

The Euler-Lagrange equation has significant applications in physics, particularly in the fields of mechanics and quantum mechanics. It is used to derive the equations of motion for physical systems and to find the stationary states of quantum systems.

4. How does the Stokes theorem relate to the concepts of variations and Euler-Lagrange?

The Stokes theorem is a generalization of the fundamental theorem of calculus, which relates the integral of a function over a region to its boundary. It is used to derive the Euler-Lagrange equation in higher dimensions and has applications in solving problems involving variations in vector fields.

5. What are some common real-world applications of variations, Euler-Lagrange, and Stokes?

Variations, Euler-Lagrange, and Stokes have numerous real-world applications, particularly in fields such as physics, engineering, and economics. Examples include finding the optimal path for a rocket to reach its destination, determining the shape of a bridge that can withstand the most weight, and finding the most efficient route for a delivery truck to take.

Similar threads

  • Special and General Relativity
Replies
8
Views
187
  • Special and General Relativity
Replies
15
Views
3K
  • Special and General Relativity
Replies
4
Views
1K
  • Special and General Relativity
Replies
3
Views
2K
Replies
5
Views
538
  • Special and General Relativity
Replies
7
Views
1K
  • Advanced Physics Homework Help
Replies
6
Views
508
  • Special and General Relativity
Replies
11
Views
376
  • Special and General Relativity
Replies
9
Views
4K
Replies
1
Views
1K
Back
Top