Canonical transformations and generating functions

In summary, the author discusses canonical transformations in Hamiltonian mechanics and notes that the integrands of the corresponding actions should differ by at most a total time derivative of the generating function, leading to the equation (1). They then re-express this equation as (2), which implies that the variables ##p## and ##P## are now expressed in terms of the generating function, and the new Hamiltonian is given by the original Hamiltonian plus the total time derivative of the generating function. The author also clarifies that in order for this to hold, the coefficients of the independent variables must be identically zero.
  • #1
Frank Castle
580
23
I've been reading about canonical transformations in Hamiltonian mechanics and I'm a bit confused about the following:

The author considers a canonical transformation $$q\quad\rightarrow\quad Q\quad ,\quad p\quad\rightarrow\quad P$$ generated by some function ##G##. He then considers the case in which ##G## is a function of the (independent) variables ##(p,Q)## and notes that the integrands of the corresponding actions should differ by at most a total time derivative of ##G##, such that $$p\dot{q}-H(q,p)=P\dot{Q}-\mathcal{H}(Q,P)+\frac{dG(q,Q)}{dt}\qquad (1)$$ Noting that ##\frac{dG(q,Q)}{dt}=\frac{\partial G}{\partial q}\dot{q}+\frac{\partial G}{\partial Q}\dot{Q}+\frac{\partial G}{\partial t}##, he re-expresses ##(1)## as $$\left(p-\frac{\partial G}{\partial q}\right)\dot{q}-\left(P+\frac{\partial G}{\partial Q}\right)\dot{Q}=H(q,p)-\mathcal{H}(Q,P)+\frac{dG(q,Q)}{dt}\qquad (2)$$ and then immediately states that it follows that the remaining variables ##(p,P)## are now expressed by $$p=\frac{\partial G}{\partial q}\quad ,\quad P=-\frac{\partial G}{\partial Q}$$ and that the new Hamiltonian is given by $$\mathcal{H}(Q,P)=H(q,p)+\frac{dG(q,Q)}{dt}.$$

I get that this has something to do with the fact that ##q## and ##Q## are independent variables, but it is not immediately obvious to me why the above relations follow (i.e. why the coefficients on the left hand side vanish)?!

Is it simply due to the fact that the only way the time derivatives of ##p## and ##Q## can be related in general is if their coefficients are identically zero (since ##\dot{q}## and ##\dot{Q}## can vary independently of one another, but equation ##(2)## is an identity and so the only what that both sides can be equal for all values of ##\dot{q}## and ##\dot{Q}## is if their corresponding coefficients are identically zero)?
 
Physics news on Phys.org
  • #2
Is it simply due to the fact that ...

Yes.
 
  • #3
dextercioby said:
Yes.

Ah ok, cool. Thanks.

@dextercioby As an aside, is it correct to say that, at a given instant in time, ##t##, the function ##q(t)## and its derivative ##\dot{q}(t)## are independent of one another (since one would need to know the value of ##q## at two different times, either side of ##t##, in order to determine the value of ##\dot{q}## at ##t##, entirely from the knowledge of ##q##)?!
 
Last edited:
  • #4
Well (2), and consequently the equation for the Hamiltonian are wrong. More generally you have an explicitly time-dependent generator, ##G##, and then (1) leads to
$$p \cdot{q}-H=P \dot{Q}-H'+\frac{\mathrm{d}}{\mathrm{d} t} G=P \dot{Q}-H'+\dot{Q} \partial_Q G+ \dot{q} \partial_q G + \partial_t G,$$
where the latter partial time derivative means the derivative due to the explicit time dependence. Now writing this as a total differential and ordering you get
$$(p-\partial_q G) \mathrm{d} q - (P+\partial_Q G) \mathrm{d} Q + (H'-H -\partial_t G) \mathrm{d} t=0.$$
Now taking ##q##, ##Q##, and ##t## as the independent variables as "natural" for this choice of the generator, this implies that
$$p=\partial_q G, \quad P=-\partial_Q G, \quad H'=H+\partial_t G.$$
 
  • #5
vanhees71 said:
Well (2), and consequently the equation for the Hamiltonian are wrong. More generally you have an explicitly time-dependent generator, ##G##, and then (1) leads to
$$p \cdot{q}-H=P \dot{Q}-H'+\frac{\mathrm{d}}{\mathrm{d} t} G=P \dot{Q}-H'+\dot{Q} \partial_Q G+ \dot{q} \partial_q G + \partial_t G,$$
where the latter partial time derivative means the derivative due to the explicit time dependence. Now writing this as a total differential and ordering you get
$$(p-\partial_q G) \mathrm{d} q - (P+\partial_Q G) \mathrm{d} Q + (H'-H -\partial_t G) \mathrm{d} t=0.$$
Now taking ##q##, ##Q##, and ##t## as the independent variables as "natural" for this choice of the generator, this implies that
$$p=\partial_q G, \quad P=-\partial_Q G, \quad H'=H+\partial_t G.$$

Whoops, yes I had meant to write (2) with a partial derivative (I copied and pasted and forgot to change the LaTex).

Is the point that since ##q##, ##Q## and ##t## are independent, their respective differentials ##dq##, ##dQ## and ##dt## will be mutually independent and so the only way they can be related by the equation $$(p-\partial_q G) \mathrm{d} q - (P+\partial_Q G) \mathrm{d} Q + (H'-H -\partial_t G) \mathrm{d} t=0$$ is if their coefficients are identically zero?!

Is it also correct to say that ##q## and ##\dot{q}##, evaluated at a given instant in time, ##\tau## are mutually independent, since one would require the knowledge of the values of ##q## at an earlier (or later) time as well as at time ##\tau## in order to determine the value of ##\dot{q}## at ##\tau## solely from ##q##?!
 
  • #6
Yes, in the Lagrange formalism ##q## and ##\dot{q}## are independent variables. Only when inserting a trajectory ##q=q(t)## you get ##\dot{q}=\mathrm{d} q/\mathrm{d} t##. This is often leading to confusion, and that's why it is so important to distinguish the meaning between ##\partial_t## and ##\mathrm{d}/\mathrm{d} t##.

In the Hamiltonian formalism you trade the ##\dot{q}## by the canonical momenta ##p##. The Lagrange formalism and the Hamilton formalism are connected by a Legendre transformation.
 
  • #7
vanhees71 said:
Yes, in the Lagrange formalism qq and ˙q\dot{q} are independent variables. Only when inserting a trajectory q=q(t)q=q(t) you get ˙q=dq/dt\dot{q}=\mathrm{d} q/\mathrm{d} t. This is often leading to confusion, and that's why it is so important to distinguish the meaning between ∂t\partial_t and d/dt\mathrm{d}/\mathrm{d} t.

In the Lagrange formalism though isn't the point that one considers points, ##(q,\dot{q})## in the tangent bundle over some manifold, where ##q## and ##\dot{q}## are points in the base manifold and the tangent space over ##q## respectively. As such they can (locally) be varied independently of one another (one can choose a tangent vector, ##\dot{q}## and then a point ##q##, or vice verse - the choice of one does not affect the choice of the other). It is only when one chooses to evaluate them along a trajectory in the tangent bundle, such that ##(q,\dot{q})=\left(q(t),\dot{q}(t)\right)## and further stipulates that the trajectory be the tangent lift of some trajectory in the base manifold, such that ##\dot{q}(t)=\frac{dq(t)}{dt}## (such that it corresponds to a physical trajectory), that the two variables become connected?!

I think what has confused me in this case is that the author has already considered ##q## and ##Q## as functions of ##t## and then taken their time derivatives, ##\dot{q}=\frac{dq}{dt}## and ##\dot{Q}=\frac{dQ}{dt}##. In this case it doesn't seem immediately obvious, in the form that is written in the text, $$
\left(p-\frac{\partial G}{\partial q}\right)\dot{q}-\left(P+\frac{\partial G}{\partial Q}\right)\dot{Q}=H(q,p)-\mathcal{H}(Q,P)+\frac{\partial G(q,Q)}{\partial t}\qquad$$ that the coefficients of ##\frac{dq}{dt}## and ##\frac{dQ}{dt}## must vanish identically?! I get that they are independent of one another and hence their derivatives should not (in general) be related, but what the right hand side of the equation is a function of ##q## and ##Q## and so couldn't the left hand side be related to the right in some non-trivial way (since ##q## and ##Q## and ##\dot{q}## and ##\dot{Q}## are related)?!
 

Related to Canonical transformations and generating functions

1. What are canonical transformations?

Canonical transformations are mathematical transformations that preserve the equations of motion in classical mechanics. They are used to transform the coordinates and momenta of a system into a new set of variables, while still describing the same physical system.

2. How are canonical transformations related to generating functions?

Generating functions are mathematical functions that are used to generate new coordinates and momenta in a canonical transformation. They are used to simplify the process of finding the new variables, and can also help determine the form of the transformation equations.

3. What is the importance of canonical transformations in physics?

Canonical transformations are important in physics because they allow us to simplify complicated equations of motion, making it easier to analyze and solve problems. They also help to reveal symmetries and conserved quantities in a system, providing insight into the underlying physical principles.

4. Can canonical transformations be applied to any physical system?

Yes, canonical transformations can be applied to any physical system described by classical mechanics. However, they are not applicable to quantum mechanical systems.

5. Are there any limitations to canonical transformations?

One limitation of canonical transformations is that they can only be applied to systems with a finite number of degrees of freedom. Additionally, some transformations may not be physically meaningful or may lead to equations that are difficult to solve.

Similar threads

Replies
3
Views
612
  • Classical Physics
Replies
21
Views
1K
Replies
4
Views
643
  • Classical Physics
Replies
7
Views
914
  • Classical Physics
Replies
4
Views
742
Replies
7
Views
1K
Replies
2
Views
3K
Replies
1
Views
970
Replies
7
Views
824
  • Classical Physics
Replies
0
Views
227
Back
Top