From the Hamilton’s Variational Principle to the Hamilton ... · From the Hamilton’s Variational Principle to ... Recipe for solving problems in mechanics ... 4.6 Universality

A. La Rosa Lecture Notes PSU-Physics PH 411/511 ECE 598

I N T R O D U C T I O N T O Q U A N T U M M E C H A N I C S

________________________________________________________________________ From the Hamilton’s Variational Principle to the Hamilton Jacobi Equation

4.1. The Lagrange formulation and the Hamilton’s variational principle 4.1A Specification of the state of motion

4.1B Time evolution of a classical state: Hamilton’s variational principle Definition of the classical action The variational principle leads to the Newton’s Law The Lagrange equation of motion obtained from the variational principle

Example: The Lagrangian for a particle in an electromagnetic field 4.1C Constants of motion Cyclic coordinates and the conservation of the generalized momentum Lagrangian independent of time and the conservation of the Hamiltonian Case of a potential independent of the velocities: Hamiltonian is the

mechanical energy

4.2 The Hamilton formulation of mechanics 4.2A Legendre transformation

4.2B The Hamilton Equations of Motion Recipe for solving problems in mechanics Properties of the Hamiltonian 4.2C Finding constant of motion before calculating the motion itself Looking for functions whose Poisson bracket with the Hamiltonian vanishes Cyclic coordinates

4.2D The modified Hamilton’s principle: Derivation of the Hamilton’s equations from a variational principle.

4.3 The Poisson bracket 4.3A Hamiltonian equations in terms of the Poisson brackets 4.3B Fundamental brackets 4.3C The Poisson bracket theorem: Preserving the description of the classical

motion in terms of a Hamiltonian a) Example of motion described by no Hamiltonian b) Change of coordinates to attain a Hamiltonian description

4.4 Canonical transformations 4.4A Canonoid transformations (i.e. not quite canonical)

Preservation of the canonical equations with respect to a particular Hamiltonian)

4.4B Canonical transformations Definition Canonical transformation theorem Canonical transformation and the invariance of the Poisson bracket

4.4C Restricted canonical transformation

4.5 How to generate (restricted) canonical transformations

4.5A Generating function of transformations

4.5B Classification of (restricted) canonical transformations

4.5C Time evolution of a mechanics state viewed as series of canonical transformations

The generator of the identity transformation Infinitesimal transformations The Hamiltonian as a generating function of canonical transformations Time evolution of a mechanical state viewed as a canonical transformation

4.6 Universality of the Lagrangian 4.6A Invariant of the Lagrangian equation with respect to the configuration space

coordinates 4.6B The Lagrangian equation as an invariant operator

4.7 The Hamilton Jacobi equation The Hamilton principal function Further physical significance of the Hamilton principal function

From the Hamilton’s Variational Principle to the Hamilton Jacobi Equation

4.1 The Lagrange formulation and the Hamilton’s variational principle

4.1A Classical specification of the state of motion

The spatial configuration of a system composed by N point masses is completely described by

3N Cartesian coordinates (x1, y1, z1), (x2, y2, z2), …, (xN , yN , zN ).

If the system is subjected to constrains then the 3N Cartesian coordinates are not

independent variables. If n is the least number of variables necessary to specify the most general motion of the system, then the system is said to have n degrees of

freedom.

The configuration of a system with n degrees of freedom is fully specified by n generalized position-coordinates q1, q2, … , qn

The objective in classical mechanics is to find the trajectories,

q = q (t) = 1 , 2, 3, …, n (1)

or simply q = q (t) where q stands collectively for the set (q1, q2, … qn)

4.1B Time evolution of a classical state: Hamilton’s variational principle One of the most elegant ways of expressing the condition that determines the

particular path )(tq that a classical system will actually follow, out of all other possible

paths, is the Hamilton’s principle of least action, which is described below.

The classical action

One first expresses the Lagrangian L of the system in terms of the generalized

position and the generalized velocities coordinates q and q ( =1,2,…, n).

L( q , q , t ) (2)

Typically L=T-V, where T is the kinetic energy and V is the potential energy of the system. For example, for a particle of mass m moving in a potential V(x,t), the Lagrangian is,

L (x, x , t) = (1/2)m x 2 - V(x,t) (3)

Then, for a couple of fixed end points (a,t1) and (b, t2) the classical action S is defined as,

S (q ) ≡

)(

)(

2

1

t b,

ta,L( )(tq , )(tq , t ) dt Classical action (4)

Function or set of functions (n-dimension case)

Number (1-dimension case) or set of numbers (n-dimension case)

Notice S varies depending on the arbitrarily selected path q that joins the ending point a and b, at the corresponding times t1 and t2. See figure 1 below.

We emphasize that in (4), q stands for a function (or set of functions, in the n-dimension case) Indistinctly we will also call it a path

q(t) is a number (or set of numbers, in the n-dimension case) It is the value of the function q when the argument is t.

S is evaluated at a function), i.e. S (q)

L is evaluated at a number (or set of numbers, in the n-dimension case)

Hamilton’s variational principle for conservative systems

Out of all the possible paths that go from (a, t1) to (b, t2), the system takes only one.

On what basis such a path is chosen?

Answer: The path followed by the system is the one that makes the functional S an extreme (i. e. a maximum or a minimum).

S qq = 0 The variational principle (5)

That is, out of all possible paths by which the system could travel from an initial position at time t1 to a final position at time t2, it will actually travel along the path for which the integral is an extremum, whether a maximum or a minimum. The figure below shows the case for the one-dimensional motion.

x(t)

t t1 t2

xC

a

b

xC(t)

Figure 1. For a fixed points (a, t1) and (b, t2), among all the possible paths with the same end points, the path xC makes the action S an extremum.

Example: The variational principle leads to the Newton’s law Consider a particle moving under the influence of a conservative force F (be

gravitational force, spring force, …) whose associate potential is V (i.e. F = -dx

dV.)

Since the kinetic energy is given by T= (1/2) m x 2, where x means dx/dt, then

L(x, x , t) = (1/2) m x 2 - V(x), and

S(x) ≡ )(

)()(

2

1

tb,

ta,dtt,xx,L =

)(

)( 2

12

1

][ )(tb,

ta,dtxVxm 2 (6)

On the left side, S is evaluated on a path x.

On the right side, the values x(t) and x (t) have to be used inside the integral, but for simplicity we have just written x

and x respectively

Different paths x give, in general, different values for S (for fixed (a, t1) and (b, t2.)

Let’s assume xC is the path that makes the value of S extremum. One way to obtain

a more explicit form of xC is to probe expression (6) with a family of trial paths that are infinitesimally neighbors to that particular path. Let’s try, for example,

x( = xC + h (7)

Scalar

(parameter)

Function

which means

x(( t) = xC(t) + h(t) (7)’

where h(t) is an arbitrary function subjected to the condition,

h(t1)= h(t2) =0 (8)

and is an arbitrary scalar parameter,

In expression (7), when choosing small values for , x( becomes a path just a

“bit” different than the path xC.

Note: In the literature the path difference [x( - xC ] is sometimes called x.

Here we are opting to use h ( a number, and h a function) instead, just

to emphasize that [x(-xC] is a difference between two paths (and not the difference of just two numbers).

Evaluating expression (6) at the path x(( gives,

S(x() = )(

)(

2

1

t b,

t a,{ m

2

1 [ x C(t) + h (t) ]2

– V( xC(t ) + h(t ) ) }dt (9)

The condition that xC is an extremum becomes,

00

d

dS (10)

x(t)

t t1 t2

h(t)

a

b xC(t)

Figure 2. For an arbitrary function h, a parametric family of trial paths x(t ) =

xC(t ) + h(t ) is used to probe the action S given by expression (6).

From (6),

dα

dS

)(

)(

2

1

t b,

t a,{ m

2

12 [ x C(t) + h (t)] h (t) –V ’h(t) }dt

where V’ stands for the derivative of the potential V

= )(

)(

2

1

t b,

t a,{ m x C (t) h (t) – V ’ h(t) }dt +

)(

)(

2

1

t b,

t a, m [ h (t)]2dt

The last term will be cancelled when evaluated at =0.

0d

dS

)(

)(

2

1

t b,

t a,{ m x C (t) h (t) – V ’(xC) h(t) }dt (11)

The first term on the right side of the equality above can be integrated by parts,

)(

)(

2

1

t b,

t a,x C (t) h (t) dt = x C (t)

2

1

)(t

tth -

)(

)(

2

1

t b,

t a,x C (t) h(t) dt

According to (8), h(t) vanishes at t1 and t2 ; therefore the first terms on the right side vanishes,

)(

)(

2

1

t b,

t a,x C (t) h (t) dt = -

)(

)(

2

1

t b,

t a,x C (t)h(t) dt

Expression (11) becomes,

0d

dS –

)(

)(

2

1

t b,

t a,{ m x C (t) + V ’(xC)} h(t)dt (12)

Since this last expression has to be equal to zero for any function h(t), it must happen

that m x C (t) + V ’(xC) =0, or,

x C (t) = – dx

dV(xC) = F

That is,

S ≡ )(

)(

2

1

t b,

t a,t)dt,xL(x, =

)(

)( 2

11

1

][ )(t b,

t a,dtxVxm 2 ….

takes an extreme value when the path x(t) . (13)

satisfies the Newton’s Law x (t) = – dx

dV = F

(13)

In the next section, a key expression we will use through the derivation is the following:

Consider an arbitrary function, )( t,,SS vu ,

where )u,u,u( 321u and )v,v,v( 321v

For arbitrary increments , , and the value of

)( dt t , , S δvΔu

can be approximated by,

)( )( t,,Sdtt , , S vuδvΔu +

+ 3

3

2

2

1

1 uuu

SSS +

+ 3

3

2

2

1

1 vvv

SSS

+ dtt

S

Sometimes the following notation is used,

)u

,u

,u

( 321

SSSS

u

Accordingly,

)( )( t,,Sdtt , , S vuδvΔu + v

δu

Δ

SS + dt

t

S

(13)

The Lagrange equations of motion obtained from the Hamilton’s variational principle In a more general case, the system may be composed by n particles. Using,

x (t)= (x1(t), x2(t), … , xn(t), 1x (t), 2x (t), ,…, nx (t) )

the action is defined as

S( x

) ≡ )(

)(

2

1

t ,

t ,

b

aL( x (t), x (t) , t) dt (14)

where a and b are two fixed point in the configuration space. Using a family of trial functions of the form

x ( (t) = x (t) + h (t) (15)

where h (t1) = h (t2) = 0

x (t)= ( x1(t), x2(t), … , xn(t), 1x (t), 2x (t), ,…, nx (t) )

h (t)= ( h1(t), h2(t), … , hn(t), 1h (t), 2h (t), ,…, nh (t) )

we look for an extreme value of the function

S() = 2

1

t

tL( x (

(t), x ((t), t) dt (16)

From (15) and (16) one obtains,

d

dS

2

1

t

t{ i

ii

i

ii

hx

Lh

x

L

} dt

Notice, the term t

L

does not appear in the last expression.

Integrating by parts the second term,

d

dS

2

1

t

t{ i

ii

hx

L

} dt +

2

1

t

t

i

ii

hx

L

-

2

1

t

t{ i

ii

hx

L

dt

d)(

} dt

Since h

(t1) = h

(t2) = 0, one obtains

d

dS

2

1

t

t{ i

ii

hx

L

} dt -

2

1

t

t{ i

ii

hx

L

dt

d)(

} dt

= i

2

1

t

t{

ix

L

- )(

ix

L

dt

d

} ih dt

In the last expression ix

L

and )(

ix

L

dt

d

are evaluated at x

(t) + )(th

.

0d

dS

i

2

1

t

t{

ix

L

- )(

ix

L

dt

d

} )(thi dt

In the last expression ix

L

and )(

ix

L

dt

d

are evaluated at x

(t).

The variational principle requires that,

0d

dS

i

2

1

t

t{

ix

L

- )(

ix

L

dt

d

} )(thi dt = 0 (17)

for arbitrary trial functions )(thi .

This could be satisfied only if,

0x

L

x

L

dt

d

ii

for i=1,2, …, n. The Lagrange Equations (18)

These are second order equations. The motion is completely specified if the initial

values of the n coordinates xi and the n velocities ix are specified. That is, the xi and ix

form a complete set of 2n independent variables for describing the motion.

Remark: Hamilton’s variational principle involves physical quantities T

and V, which can be defined without reference to a particular set of generalized coordinates. The set of Lagrange equations is therefore invariant with respect to the choice of coordinates. !

A key expression we will use through the derivation is the following:

For an arbitrary function, )( t,,SS vu ,

3

3

2

2

1

1

uu

uu

uu

SSS

dt

dS

+ 3

3

2

2

1

1

vv

vv

vv

SSS+

+ t

S

Sometimes, the following notation is used )u

,u

,u

( 321

SSSS

u

Accordingly,

dt

dS

vv

uu

SS +

t

S

Example: The Lagrangian for a particle in an electromagnetic field This is a case where the force depend on the velocity,

)v ( BEF q (19)

Since the Maxwell’s equation states that 0 B , then B can be expressed in terms of

a vector potential )( t,xAA

AB (20)

(because the divergence of the rotational is identically equal to zero.)

On the other hand, the third Maxwell’s equation 0/ tBE can then be written

as 0)(

t

AE , or 0)/( tAE . The quantity with vanishing curl can be

written as the gradient of some scalar function, that is t/AE , or

t

AE

(21)

where )( t,x

With these alternative expressions for E and B the equation of motion

)v ( BEx qm

can be expressed as,

t

qqm

Ax + )( Ax q (22)

It can be demonstrated (see homework assignment) that,

)( Ax )( -

Ax

x

Ax (23)

Accordingly, the vector expression (22) can be expressed in term of the components,

t

A

xx

q

m

+ )(

Ax

x

Ax = 1, 2, 3

The second and the fourth terms on the right side constitute a

exact differential of A

x

+

dt

dA

x

Ax (24)

We have to figure out the Lagrangian L, such that the Lagrangian equations (18) lead to

(24). If there were no magnetic field, L would be )( )( 2xx qm1/2L . The magnetic

field introduces a term depending on the velocity. Hence, let’s try,

)()( 2

2

1t,,fqmL xxxx (25)

where ),,( 322 xxxx

x

f

xq

x

L

x

fxm

x

L

x

f

tx

f

x

fxm

x

L

dt

d

)()(

xx

xx

where x

stands for ),,(

321 xxx

x

stands for ),,(

321 xxx

0

x

L

x

L

dt

d

implies

x

f

tx

f

x

f

x

f

xqxm

)()(

xx

xx (26)

which should be compared with (24)

xm x

q

+

dt

dAq

xq

Ax

Notice the right side of (26) contains a term in x , which is not present in (24). Therefore

we require

x

f

)(

xx = 0

Or, equivalently,

x

f

x

for = 1, 2, 3 (27)

A particular solution of (27) is,

321 GxGxGxt,f 321 )( xGx (28)

Replacing (28) in (26),

Gt

Gxx

qxm

)(

xx

Gx

The last two terms on the right side constitute a

exact differential of G

Gdt

d

xxqxm

Gx (29)

which should be compared with (24),

xm x

q

+

dt

dAq

xq

Ax

A comparison between (24) and (29) indicates that AG q . Replacing this solution in

(28) gives,

)( t,qf xAx (30)

The Lagrangian in (25) is then given by,

)( )( )( 2

2

1t,qqmt,,L xAxxxxx (31)

Lagrangian for a particle in an electromagnetic field described by a scalar potential and vector potential A.

t

AE

AB

4.1C Constants of motion Cyclic coordinates and the conservation of the generalized momentum

There is another way to identify constant of motion. For example, if the Lagrangian

of the system does not contain a given coordinate x, the corresponding Lagrangian

equation 0

x

L

x

L

dt

d

becomes

0

x

L

dt

d

It means that the generalized momentum x

L

is constant

If a coordinate x does not appear in the Lagrangian, the variable is said to be cyclic.

p = constant for a cyclic coordinate (32)

Lagrangian independent of time and conservation of the Hamiltonian

Consider a Lagrangian that does not depend explicitly on time, i.e. t

L

= 0.

The total time derivative of

L L(x, x ) (33) is,

dt

dL

i

[ ix

L

dt

dxi +

ix

L

dt

xd i

]

From the Lagrange equation ii x

L

dt

d

x

L

, the previous equation can be written as

dt

dL

i

[ (ix

L

dt

d

)

dt

dxi +

ix

L

dt

xd i

]

i

[ (ix

L

dt

d

)

dt

dxi +

ix

L

dt

xd i

]

i

( )(i

ix

Lx

dt

d

dt

d

i

i

ix

Lx

This implies,

i

i

ix

Lx

L = constant (34)

when L does not depend on t explicitly.

We have just found a constant of motion.

The quantity on the left of expression (34) is called the Hamiltonian H of the system.

H ≡ i

i

ix

Lx

L (35)

A more rigorous definition of the Hamiltonian function will be given in the following sections.

Expression (34) indicates that, when L does not depend explicitly on time, the Hamiltonian quantity H is a constant of motion. Case of a potential independent of the velocities

For the case in which V = V(x),

L(x, x ) = T( x ) V(x)

= k

2

2

1kk xm V(x) (36)

ix

L

= ii xm

i

i

ix

Lx

=

i

2

ii xm

= 2 T (x)

H ≡ i

i

ix

Lx

L

= 2 T ( x ) L

= 2 T ( x ) ( T( x ) V(x) )

= T( x ) V(x) = Energy (37)

Thus, when the potential does not depend on the the velocity, and L does not depend on the time explicitly, H is the energy of the system and at the same time a constant of motion.

4.2 The Hamiltonian formulation of mechanics An alternative to the Lagrangian description of mechanics, outlined above, is the

Hamiltonia formulation. Instead of dealing with a set of n differential equations of 2nd-order given in (18), one is resorted to solve 2n differential equations of 1st order, as we will see below. However one may end up with a similar intensity of difficulty when solving the corresponding equations. The advantages of the Hamiltonian formulation lie not necessarily in its use as a calculation tool, but rather in the deeper insight it affords into the formal structure of mechanics.1 Its more abstract formulation is of interest because of their essential role in constructing the more modern theories in physics. In this course it is used as a point of departure for elaborating a quantum theory.

In this section we show that the new formulation is implemented through:

a change of variables (to be specified later),

(x1, x2, …, xn, 1x , 2x , …, nx , t) later specifiedbe to (x1, x2, … xn, p1, p2, … pn,, t)

and, instead of ),,( tL xx , the use another function

H=H(x1, x2, …, xn, p1, p2, …,pn, t) = H(x, p, t)

How to obtain such a transformation? How to choose H? To get an idea of what transformation is convenient (i. e. what particular combination between the ),,( txx and ),,( tpx variables is suitable for our purpose), let’s familiarize

with the following, much simpler, Legendre transformation

4.2A Legendre transformation Consider a physical quantity is described by a two-variable function

L=L(x, y) (32)

A differential change of L is given by,

dL =x

L

dx +

y

L

dy

= u dx + v dy (33)

It may happen that a description could more convenient in terms of x and y

L

.

Accordingly, we would like, then, to perform a change of variables

(x, y) (x, v) (34)

where v = y

y)(x,L

(We will assume that the relationship v = y

y)(x,L

allow us to put y in terms of x and v).

One way to obtain a quantity B (to replace L) whose differential-change could be expressed directly in terms of dx and dv (instead of dx and dy) is by defining B=B(x, v) as follows,

B(x, v) ≡ L(x, y) – y v (35)

= L(x, y) – y y

y)(x,L

Its corresponding differential is given by

dB = dL – y dv – v dy

using (33)

= u dx + v dy – y dv – v dy

dB = u dx – y dv (36)

(As we said above, we assumed that the definition v = y/yx, L )( allows to

put y in terms of x and v. Also, since u= x L/ , the above assumption ensures u can be put in terms of x and v as well.)

Thus, dB end ups being expressed in terms of the differentials dx and dv (the new independent variables,) which is what we wanted.

The last expression also gives,

u =x

B

and y = –

v

B (37)

In summary,

(x, y) (x, v)

(x, v) = (x, y/ L )

L(x, y) B(x, v)

B(x, v) ≡ L(x, y) – y v

= L(x, y) – y y/ L

dL = ( x L/ ) dx + ( y L/ )dy = u dx + v d y

dB = u dx – y dv

(38)

With this background, we will see below that the transformation to transit from the Lagrangian formulation to the Hamiltonian formulation is of the Legendre’s type. (Just identify the x variable with the y variable we have used in this section.)

4.2B The Hamilton Equations of Motion

The equation of motion of a system is described by a Lagrange function

),,( tLL xx , leading to a set of differential equations of second order,

0

ii x

L

x

L

dt

d

, i = 1, 2, …, n

Consider the following transformation of coordinates,

),,( ),,( tt pxxx

(39)

where ),,(

tx

L

i

ip xx

H L

),,( ),,( i tt LH px xx-pxi

i

(40)

xi has to be expressed in terms of ),,( tpx

) , ( ),,(),,( , tpx tt Li pxpx xx-i

i

The Hamiltonian function

On one hand,

),,( tHH px

implies,

dH = dtt

Hdp

p

Hdx

x

Hi

ii

i

ii

(41)

On the other hand, the expression for ),,( ),,( i tLt pxi

iH xx-px given in (40)

implies,

dH = dtt

Lxd

x

Ldx

x

Ldpxxdp i

ii

i

iii

ii

i

ii

)(ix

L

dtd

ip

)( ipdtd

ip

Thus, the first and fourth sum-terms cancel out

dH = dtt

Ldxpdpx ii

ii

ii

(42)

From (41) and (42) one obtains,


i

) , ( ),,(),,( , tpx tt Li pxpx xx-i

i

Hamilton canonical equations (2n first-order equations) (43)

i

ip

Hx

i

ix

Hp

i = 1, 2, …, n and

(44) t

L

t

H

Notice if L does not depend on t explicitly, neither does H.

Recipe for solving problems in mechanics

i) Set up the Lagrangian ),,( tL xx .

ii) Obtain the canonical momenta ),,(

tx

L

i

ip xx

iii) Obtain ),,( ),,( i tLt pxi

iH xx-px in terms of x

and p

.

Example. Two dimensional motion of a particle in a central potential V( r

) =V(r).

L( ),,, rr )= T - V = 2

1m ( 222 rr ) - V(r).

Using the definition (39),

r

Lpr

= m r and

Lp = m 2r

Using the definition (40),

),,,( ),,,( r rrLrr pppprH -

ppr r

2

1m ( 222 rr ) V(r).

The expressions r

Lpr

and

Lp are inverted to

express r and in terms of ),,,( ppr

p

pp

p

mrm

2rr

2

1m ( 2

2

22r )()(mr

rm

pp )

),,,( pprrH = 2

22

r

2

2 mrm

pp V(r).

Using (30) one obtains the equation of motion for each of the 4 independent variables

pprr ,,, .

rp

Hr

=

m

pr

p

H

= 2

mr

p

r

Hpr

=

r

V

Hp r = 0

Properties of the Hamiltonian The significance of the Hamiltonian was already observed in Section 4.1C, in which,

for the case when 0

t

L the Hamiltonian constitutes a constant of motion. Now that

we have a more formal definition of the Hamiltonian, given in expression (40), we can re-state it the following manner:

t

H

dt

dH

(45)

Proof: dt

dH

t

Hp

p

Hx

x

Hi

ii

i

ii

- ip ix

The first and second sum-term cancel out, which proves the statement. That is,

(59)

a Hamiltonian that does not depend explicitly on time constitutes a constant of motion

H is a constant of motion if . (46)

L does not depend explicitly on t.

This follows from the two general results (44) t

L

t

H

and (45)

t

H

dt

dH

,

Sin 0

t

L implies 0

t

H, and, hence 0

dt

dH.

For a Lagrangian L=T-V where the potential V is independent

of x

and the kinetic energy T is homogeneous quadratic (47)

in ,x

then H is the total energy

2)( ii

i

xkT and L then has the form )( )( 2x

VxkLi

ii

One obtains, ix

L

= ii xk 2 and

i

ii

i i

i xkx

Lx 2)(2

= 2T

H = i

i

i x

Lx

L = 2T – L= 2T – ( T – V )= T + V .

H may be a constant of the motion but not the energy (if T is not homogeneous quadratic.) .

H may be the energy but not constant (if 0

t

L). .

H may be neither the energy nor a constant of motion.

In the Hamiltonian approach,

A mechanical system is completely specified at any time by giving all the

px

and coordinates.

That is, the state of a system is specified by a point ) ,( px

in phase space.

The task is to find how this point moves in time, i.e. the motion in phase space

The initial condition tell us where in phase space the system starts, but it is the Hamiltonian which tells us, though the canonical equations (43), how it proceeds from there.

( p

,x )

t=to

t

phase space

Alternatively, the Hamiltonian determines all the possible motions the system can perform in phase space, the initial conditions picking out the particular motion which is the solution to a particular problem.

4.2C Finding constant of motion before calculating the motion itself

a) Looking for functions whose Poisson bracket with the Hamiltonian vanishes It is possible to use the Hamiltonian to determine directly how a given dynamical

function varies along the solution-motion, even before calculating the motion itself. (For example, determining whether that quantity is a constant of motion or not.) To that effect, let’s consider a dynamical function F= F ) ,( t,px

and calculate its total

differential change with time,

dt

dF

t

Fp

p

Fx

x

Fi

ii

i

ii

ip

H

ix

H

-

t

F

x

H

p

F

p

H

x

F

iiiiii

dt

dF

t

F

x

H

p

F

p

H

x

F

i iiii

(48)

The first term on the right side is called the Poisson bracket between F and H (to be described in more detailed in the sections below).

i iiii x

H

p

F

p

H

x

FHF, (49)

In terms of the Poisson bracket, the time dependence of a physical quantity is expressed as,

dt

dF HF,

t

F

(48)

A quantity F that does not depend explicitly on time, will be a constant of motion if the Poisson bracket between F and the Hamiltonian H vanishes. Certainly, one would have to have a lot of intuition to figure out such a function F. We will see later, however, that there exist systematic methods to find just that. Here we just want to show that the possibility of finding constant of motion without solving the equations.

To see how this works, let’s take the familiar example of the two dimensional, symmetric simple harmonic oscillator. 2 For simplicity take k=1 and m=1. Then,

)()(),,,,(2

2

2

1

2

2

2

121212

1

2

1xxxxtxxxxL (49)

ii xtxxxxx

L

i

p

),,,,( 2121 for i= 1,2 (50)

),,( ),,( tL - xt i

i

i pH xxpx

),,,,( ),,,,( 212122112121 txxxxL -x xtppxx ppH

2211 pp x x )()(2

2

2

1

2

2

2

12

1

2

1xxxx

where we have to replace the ix in terms of the ip given in (50)

2211 pppp )()(2

2

2

1

2

2

2

12

1

2

1xxpp

),,,,( 2121 tppxxH )()(2

2

2

1

2

2

2

12

1

2

1xxpp (51)

Without finding the explicit solution, let’s show that expression (48) tell us that the angular momentum F,

12212121 ),,,,( pxpxtppqqF angular momentum (52)

Is a constant of the motion.

First, let’s calculate,

1212

1111

xxppx

H

p

F

p

H

x

F

2121

2222

xxppx

H

p

F

p

H

x

F

0

t

F

which gives

t

F

x

H

p

F

p

H

x

F

iiii

2

1i

=0

Accordingly, expression (48) gives,

0 dt

dF (53)

That is, without explicitly calculating the solution, we know that, for this problem, the angular momentum is a constant of motion.

b) Cyclic coordinates According to the definition given in Section 4.1C, a cyclic coordinate xi is one that does

not appear in the Lagrangian. The Lagrangian equation ii x

L

dt

d

x

L

then implies that

the generalized momentum pi =ix

L

is a constant of motion. But the Hamiltonian

equation i

ix

Hp

implies also that in this case 0

ix

H; that is,

a coordinate the is cyclic (i.e. absent in the Lagrangian) (54) will also be absent from the Hamiltonian

This conclusion can also be obtained from the definition of the Hamiltonian,


i

. Notice, H differ from – L by ipxi

i , which does

not involved the coordinates xi explicitly. Hamiltonian with full cyclic coordinates The solution of the Hamilton’s equation is trivial for the particular case in which the Hamiltonian does not depend explicitly on time (it is a constant of motion) and all coordinates xi are cyclic.

xi i=1, …, n does not appear in the Hamiltonian

Under those conditions all the conjugate momenta i

ix

Hp

will be constant.

iip

The Hamiltonian then may be written in the form

) , ... , ,( n21 HH

and the equation of motion for the coordinates x i will be,

) ( constantH

xi

i i

whose solutions are, ii txi

It is true that rarely occurs in practice that all the coordinates are cyclic (for, thus, taking advantage of the easy way to find the solutions.) However, a given problem can be described by different sets of coordinates. It becomes important then to find a systematic procedure for transforming from one set of variables to another set of variables where the solution is more conveniently tractable. That will be the subject of using canonical transformations, to be described in section 4.4. 4.2D The modified Hamilton’s principle: Derivation of the Hamilton’s equations from a

variational principle. Hamilton’s canonical equation can be obtained from a variational principle, similar

to the way the Lagrange equations were obtained in Section 4.1B above. However, the variations will be over paths in the ) ,( px phase-space, which has 2n dimensions, twice

the n dimensions of the x

configuration-space. Interestingly enough, the function inside the integral, upon which the variational principle will be applied, is again the Lagrangian L, but now considered as a function of ),, ,( t,ppxx .

) ( ),, ,( ,, tpxL Ht, i p-ppxx xi

i (55)

As a first step, let’s apply the variational principle to

),( pxS ) ≡

)(

)(

222

111

t,

t, ,

, , px

px

),, ,( t,L ppxx dt (56)

Inside the integral, we have just written ),, ,( t,ppxx for

simplicity, but ) , , , ( )()()()( t,tttt ppxx should be used

instead, respectively.

In applying the variational principle, we realized it is very similar to the case when we applied it in the configuration space. This time we just have more independent variables. The result is,

0x

L

x

L

dt

d

ii

and 0

p

L

p

L

dt

d

ii

, for i=1,2, …, n. (57)

Applying (57) to the Lagrangian given in (55), ) ( ),, ,( ,, tpxL Ht, i p-ppxx xi

i ,

ipix

L

,

ii x

H

x

L

-

0x

L

x

L

dt

d

ii

implies

ix

Hip

(58a)

On the other hand,

0

ip

L

,

i

i

i p

Hx

p

L

-

0p

L

p

L

dt

d

ii

implies

i

ip

Hx

(58b)

That is, we obtain in (58) the canonical Hamiltonian equations (43).

4.3 The Poisson bracket Expression (48) evaluates how a given dynamic quantity F= F ) ,( t,px

varies as a

function of time, while x

and p

evolves according to the Hamilton equations. The first

term of the right hand in (48) turns out to be an important expression in itself; it is

called the Poisson bracket of F and H.

In general, the Poisson bracket [S,R] of the dynamical variable S= S ) ,( px

with the

dynamical variable R= R ) ,( px

is defined as,

i iiii x

R

p

S

p

R

x

SRS, (55)

Poisson bracket of the dynamic quantities R and S

Note: Sometimes it will be convenient to express the Poisson bracket as

[S,R](x,p) (56)

to emphasize that the derivatives in the bracket are taken with respect to the variables ) ,( px

. For, it may happen that an (invertible) transformation of

coordinates

),,( tpxQQ

and ),,( tpxPP

may have taken place. Therefore the dynamical quantities will have a

dependence on Q

and P

, and the Poisson bracket of the S and R can be taken

with respect to the new variables,

[S,R](Q,P) (57)

In terms of the Poisson bracket, expression (48) can be expressed as,

dt

dF

t

FHF

],[ (58)

4.3A The Hamiltonian equations in terms of the Poisson brackets

In the particular case that F = x , expression (58) gives,

][][ H,xt

xH,x

dt

dxα

αα

α

But notice that ][ H,xα =

i ii

α

ii

α

x

H

p

x

p

H

x

x=

p

H

Therefore dt

dxα =p

H

= ][ H,xα

In the particular case that F=p, expression (58) gives,

],[],[

Hpt

pHp

dt

dp

But notice that ][ H,pα =

i ii

α

ii

α

x

H

p

p

p

H

x

p=

x

H

Therefore ][ H,pdt

dpα

α =x

H

Thus,

(59)

][ H,xdt

dxi

i

],[ Hpdt

dpi

i

Hamilton canonical equations (2n first-order equations)

i = 1, 2, …, n

constitutes an alternative way to express the Hamilton canonical equations.

4.3B Fundamental brackets

i iiii x

x

p

x

p

x

x

xxx

, = 0 for any 1, 2, ..., n

i iiii x

p

p

p

p

p

x

ppp

, = 0 for any 1, 2, ..., n

i iiii x

p

p

x

p

p

x

xpx

, = 0 for ≠

i iiii x

p

p

x

p

p

x

xpx

, = 1 for =

That is,

0, , )()( px,px, ][ ][ ppxx (60)

, )( px,][ px … ..

Remark. Notice, the result (60) appears to be trivial. It is. In fact, this is a property of the Poisson bracket itself, regardless of the existence of a Hamiltonian.

It turns out, however, that if the bracket of the variables px , were calculated with

respect to other arbitrary variables (Q,P), the value of )(, PQ,][ px would be, in general,

different than .

(61) For an arbitrary transformation (x,p) (Q,P)

, )( PQ,][ px (61)

But, for a particular type of transformation of coordinates, called canonical transformations (which are introduced in connection to Hamiltonian description of motion, to be described in detail in the following sections), the Poisson bracket has the

remarkable property of remaining constant, that is

, )( PQ,][ px , regardless of

the new canonical variables used to describe the motion. Hence the usefulness of the Poisson bracket; it helps to identify such important canonical transformations.

Here we derive the chain rule that relates the Poisson brackets evaluated with respect to different coordinates.

If S=S ) ,( PQ

and R=R ) ,( PQ

where ),,( tpxQQ

and ),,( tpxPP

i iiii

px,

x

R

p

S

p

R

x

SRS )(, =

i

)( )( i

β

βi

β

βα i

α

αi

α

α p

P

P

R

p

Q

Q

R

x

P

P

S

x

Q

Q

S

i

)( )( i

β

βi

β

βi

α

αi

α

α x

P

P

R

x

Q

Q

R

p

P

P

S

p

Q

Q

S

)

(

i

β

βi

α

αi

β

βi

α

α

i

β

βi

α

αi

β

βi

α

α

p

P

P

R

x

P

P

S

p

Q

Q

R

x

P

P

S

p

P

P

R

x

Q

Q

S

p

Q

Q

R

x

Q

Q

S

i

)

(

i

β

βi

α

αi

β

βi

α

α

i

β

βi

α

αi

β

βi

α

α

x

P

P

R

p

P

P

S

x

Q

Q

R

p

P

P

S

x

P

P

R

p

Q

Q

S

x

Q

Q

R

p

Q

Q

S

i

β

px,

βα

αβ

px,

βα

α P

RP,Q

Q

S

Q

RQ,Q

Q

S )()( ][][ (

+ ) ][][ )()(

β

px,

βα

αβ

px,

βα

α P

RP,P

P

S

Q

RQ,P

P

S

Thus,

For S=S ) ,( PQ

and R=R ) ,( PQ

where ),,( tpxQQ

and ),,( tpxPP

(62)

P

RP,P

P

S

Q

RQ,P

P

S

P

RP,Q

Q

S

Q

RQ,Q

Q

S

β

px,

βα

αβ

px,

βα

α

β

px,

βα

αβ

px,

βα

αβα

)()(

)()(

][][

][][

i i iii

px,

x

R

p

S

p

R

x

S

RS,)(

Notice, if a particular transformation of coordinates (x,p) (Q,P) satisfies,

)(],[ px,

αβ PQ = ,

)(],[ px,

αβ QQ = 0, and (63)

)(],[ px,

αβ PP = ,

then (62) gives )( px,

RS,

Q

R

P

S

P

R

Q

S

iiiii

. That is,

)( px,

RS, ) ( PQ,RS, Valid only for those particular trans- (64)

formation (x,p) (Q,P) that satisfy (63)

Thus, we have found that those transformation of coordinates (x,p) (Q,P) satisfying (63) are very special: the bracket of two arbitrary dynamical quantities R and S remain invariant, independent of whether we use (x,p) or (Q,P) to evaluate the bracket.

4.3C The Poisson bracket theorem: Preserving the description of the classical motion

in terms of a Hamiltonian a) Example of a motion for which there is not Hamiltonian to describe it

Given a Hamiltonian ),,( tH px

we say that the time evolution of the classical system

is generated by H according to the canonical equations (43) and (59). There are cases, however, in which no Hamiltonian generates a particular motion. Consider, for example, the following motion:

22 )( xpx (65)

22

2)(

)(21 xp 2x-

xp

xp

That there is no H whose associated canonical equation ends up in (60) can be seen by first requiring,

22 )( xp p

H x

22

2)(

)(21 xp 2x-

xp

x

x

H p

The first equation gives,

)(4)( 2222

xpxxp x

px

H

and the second equation gives,

])()(2

1[ 22

2

2

xp 2x- xp

x

p

xp

H

xp 4x- xp

x - )(

)(21 2

22

The fact that

xp

H

px

H

22

(66)

implies that such a function H does not exist.

b) Change of coordinates to attain a Hamiltonian description

It is interesting to note in the example above that under a proper change of coordinates (x,p) (Q,P), a Hamiltonian function K can be found such that

P

K Q

and

Q

K- P

. The following transformation will do the trick.

xQ (67)

22 )( xp P

The equations of motion for Q and P are,

xQ according to (65)

22 )( xp

= P

)2()( 2 xxpxp2 P

= 2P1/2

( p +2QQ ) according to (65)

= 2P1/2

( 22

2)(

)(21 xp 2x-

xp

x

+ 2QQ )

= 2P1/2

( PQ 2 - P

Q

1/221 2QQ )

But PQ

= 2P1/2

( P

Q

1/221 )

Q P

In short, the motion described, in the x,p coordinates, as 22 )( xpx

22

2)(

)(21 xp 2x-

xp

xp

for which there is not a Hamiltonian,

is alternatively described in the new coordinates Q , P by the following equation of motion,

PQ (68)

Q P

These equations do have a Hamiltonian. Indeed,

setting P P

K Q

, gives K(Q,P) = (1/2)P2 + f (Q)

setting Q Q

K P

, gives K(Q,P) = (1/2)Q2 + g (P)

Therefore,

K(Q,P) = (1/2) Q2 + (1/2) P2 (69)

c) The Poisson bracket theorem The above remark indicates that, when a Hamiltonian cannot be found, it may be

that we are using the “wrong” coordinates. “Wrong” in the sense that it does not exist a Hamiltonian K to describe the motion in terms of the canonical equations (43) and (59). The following Poisson bracket theorem comes handy, then, as a guidance to recognize situations in which the motion, in the coordinates being used to describe it, can be generated by a Hamiltonian.

(70)

Let )(,)( tt px

be the time development of a system on phase space.

This development is generated by a Hamiltonian ),,( tH px

if and only if every pair of dynamic variables ),,( tR px

, ),,( tS px

satisfies the

relation

], [ ] [][dt

dSR S,

dt

dRSR,

dt

d

An outline of the proof is given in the Appendix-1, at the end of this chapter. This theorem is used to verify whether or not a given motion is a Hamiltonian motion. If we knew x=x(t) and p = p(t), we would apply (70) with the choice of R=q and S=p. If (70) were not fulfilled, then there will be no Hamiltonian to describe such a motion.

4.4 Canonical Transformations In many occasions, it may be convenient to make a transformation of coordinates

(x,p) (Q,P). The motivation may not be necessarily to simplify the mathematical burden in solving the Hamiltonian equations in the new coordinates; instead, more often it is to identify more clearly those physical quantities that remain constants

throughout the motion (even without solving the equations of motion.) In other occasions, the transformation of coordinates allows a better interpretation of classical mechanics formalism as a point of departure for elaborating a quantum theory. The latter is more pertinent to this course.

The example given in the previous section illustrate, however, that, if we want to keep a Hamiltonian formalism to describe the motion, we have to be careful when making a proper change of coordinates. Otherwise we may end up with no Hamiltonian associated to that motion when described in those new coordinates. In this section we study those types of transformations of coordinates that allow keeping the description of the classical motion within a Hamiltonian formalism, i. e. its description according to the canonical equations (43) and (59). They are called canonical transformations. We will find that the systematic procedure for generating such transformations, involves the participation of four types of so called generating functions. Each generating function produces a corresponding type of canonical transformation. In the course of constructing a consistent formalism for obtaining canonical transformations, a fortunate turn of events occurs. It turns out that, finding the proper generating function (to obtain a canonical transformation of specific characteristics) is equivalent to finding the solution to the canonical Hamiltonian equations! (This will be shown in Section 4.5C below.) Further, a particular canonical transformation will lead to a new Hamiltonian that is identically equal to zero; the equation that such a generating function must satisfy is the celebrated Hamilton Jacobi equation (this will be addressed in Section 4.7). In the subsequent chapters, when elaborating a quantum mechanics formalism, we will require that the quantum mechanics equation should have the Hamilton Jacobi equation as a limit. The latter justify our current effort for attaining a god understanding of the Hamilton Jacoby equation. We start by addressing the concept of canonical transformations first. Transformation of coordinates Consider that a classical motion is described by

)( ,)( tt px (71)

for which a Hamiltonian exists; i.e. the changes of )(tx

and )(tp

are governed by a

Hamiltonian H:

i

ip

Hx

and

i

ix

Hp

(72)

Consider a transformation of coordinates,

),,( ),,( tt P

Qpx (73)

),,( ; ),,( tpxtpxQQ

PP

As )(tx

and )(tp

change, the variables )(tQ

and )(tP

also change accordingly.

However, nothing ensures that the time evolution of the latter variables preserves the

Hamiltonian formalism given in expression (43). That is, it doesn’t always exists a

function K such that i

iP

KQ

and

i

iQ

KP

.

Those particular transformations that allow preserving the Hamiltonian form of the equation of motion are called canonical transformation. In this section we address,

a method to generate such transformations, and

inquire how to select a canonical transformation that renders a Hamiltonian K that is a constant function, for, in such a case, the equation of motion becomes obviously very simple.

( p

,x )

t=to

t

phase space ( P

,Q )

t=to

t

phase space

H

Transformation

Figure 3. For

),,( ; ),,( tpxtpxQQ

PP

4.4A Canonoid transformations (i.e. not quite canonical)

Preservation of the canonical equations with respect to a particular Hamiltonian)

Consider a Hamiltonian H = H ).,,( tpx

An invertible transformation of variables ),,( ),,( tt P

Qpx such

that the time evolution of the new variables )(tiQ

and )(tP

(74)

preserve the form of the Hamiltonian equations,

i.e. there exists a function K such that i

iP

KQ

and

i

iQ

KP

,

is called canonoid (i.e. not quite canonical) with respect to H.

. Note: It turns out, a transformation that is canonoid with respect to a given Hamiltonian need not be so with respect to another.

4.4B Canonical transformations

Definition A transformation that is canonoid with respect to all Hamiltonians (75) is called canonical.

Canonical transformation theorem3 Let ),( px

be a set of general coordinates on phase space. The Poisson brackets of any

two dynamic variables R and S will be specified as,

i iiii

px

x

R

p

S

p

R

x

SS

),(R,

Consider an invertible transformation

),,( ; ),,( tpxtpxQQ PP

The following three statements are equivalent:

a) The transformation ),,( ; ),,( tt pxpxQQ PP is canonical. (76)

b) There exists a nonzero constant z such that any dynamical variables R and S satisfy,

),(),(R, R,

pxQSzS

P (77)

(That is, the Poisson bracket is practically independent of the coordinates used to calculate it.)

The transformation ),,( ; ),,( tt pxpxQQ PP is canonoid with

respect to all quadratic Hamiltonians of the form,

H= C +

n

pcxc1

)'(

+

+

n n

pppxxx1 1

)"'(2

1

2

1

(78)

where = , ' = ', ” = ”

An outline of the proof is given in the Appendix-2.

Canonical transformation and the invariance of the Poisson bracket In short, the theorem above establishes that,

A transformation is canonical if and only if (79)

It preserves the Poisson bracket (to within a constant factor z.)

4.4C Restricted canonical transformations A canonical transformation ),,( ; ),,( tt pxpxQQ PP is called (80)

restricted-canonical transformation if in expression (78) z=1.

That is, ),(),(R, R,

pxQSS

P.

Note: In the literature the restricted canonical transformations are often simply called canonical transformations.

4.5 How to generate (restricted) canonical transformations

4.5A Generating functions of transformations In the remaining of the notes, whenever referring to canonical transformations, we

will assume that they are restricted (z=1); that is ),(),(R, R,

pxQSS

P. A way to

generate restricted canonical transformations will result along the way of attempting to classify them. Toward this end, let’s analyze first the following expression

ii P i

i

i

i Qpx - (81)

where the variables correspond to the transformation (73),

),,( ; ),,( tpxtpxQQ PP (82)

It turns out, when (81) is written as a function of ),,( tpx

it reveals much about the

transformation. It is found that when the transformation (82) is (restricted) canonical,

expression (81) becomes an exact total derivative of a scalar function F. The latter is then used to generate the transformation

The strategy is to show that there exists a function F that allows express (81) in the following alternative form,

ii P i

i

i

i Qpx - = else ...t

Fp

p

Fx

x

Fi

i

i

i

ii

(83)

To this end, let’s expand the left side in terms of the old coordinates,

ii P i

i

i

i Qpx - =

= ik P )(t

Q

p

Q

x

Q i

k

i

k

ii

i

i pxpx

ki k

k-

iQ

)PPP( imimi

i

i

i i

m

mm i

i

i

m

i

it

Q

p

Q

x

Qpxpx -

i

i

m i

m

ii m

i

i

mi

t

Q

p

Q

x

Qpxp iimm P)P()P( (84)

i i

The next step is then to identify, term by term, expressions (83) and (84).

Appendix-3 outlines the demonstration that there exists such a scalar function F such that,

im

m

i

mi

x

F P

x

Q p

im

m

i

m

p

F P

p

Q

and

KHt

FP

t

Q

i

ii

(85)

Further, replacing (85) in (84),

ii P i

i

i

i Qpx - = t

Fp

p

Fx

x

Fi

i

i

i

(

ii

+ H - K)

Or

H - i

i

i px -

KPQ i

i

i =

t

Fp

p

Fx

x

Fi

i

i

i

ii

H - i

i

i px -

KPQ i

i

i =

dt

dF (86)

What is remarkable is that the left side is an exact differential. The function F is called the generating function of the transformation for, as we will see, once F is given the transformation equations (82).

4.5B Classification of (restricted) canonical transformations Expression (86) suggests that, in order to effect the transformation between the two

sets of canonical variables, F must be a function of both the old and the new variables. F is a function of 4n variables plus the time. But only 2n of these are independent, because the two sets ) , P(Q and ),( px are connected by the 2n transformation

equations (82). Accordingly, there are four potential forms to express F, which will depend on the circumstances dictated by the specific problem:

),(1 t ,F Qx = ) ,( ) ,( t ,F Qxpx

),(2 t ,F Px = ), ,( ) ,( t F Pxpx ,

),(3 t ,F Qp = ) ,( ) ,( t ,F px Qp

),(4 t ,F Pp = ) ,( ) ,( t ,F px p P

Case 1: Assume the function ),(1 t ,F Qq is given.

In this case, (86) states,

H - i

i

i px -

KQ i

i

iP = ),(1 t ,dt

dFQx

=t

FQ

Q

Fx

x

Fi

i

i

i

111

ii

(87)

Since the xi and the Qi are independent, then the coefficients of ix and iQ should be

equal,

),(1 t ,x

F

i

ip Qx

i= 1, 2, … , n (88a)

These are used to solve for the n variables Qi as a function of ),( px

),(1P t ,Q

F

i

i Qx

i= 1, 2, … , n (88b)

Since the Qi have been determined in (88a), this expression gives the n variables Pi as a function of ),( px

which leaves (87) with,

t

FHK

1 (88c)

Case 2: Assume the function ),(2 t ,F Px is given.

In this case, (86) states,

H - i

i

i px -

KQ i

i

iP = ),(2 t ,dt

dFPx

=t

FFx

x

Fi

i

i

i

222 P

P

ii

(89)

On the left side, we expand the term i

i

iQ P . For convenience we write it as m

m

mQ P ,

H - i

i

i px -

KQ m

m

mP = ),(2 t ,dt

dFPx

=t

FFx

x

Fi

i

i

i

222 P

P

ii

(89)’

where we will replace PP

mmi

i

i

i i

m

Qx

x

QQ

i

m

m

mQ P = m

m

i

i

i

i i

Qx

x

QP ) P

P ( mm

i

Inverting the order of the summation

m

m

mQ P = P P

PP mm

i

im

ii

im

m i

Qx

x

Q m

(90)

Replacing (90) in (89)’

H - i

i

i px -

i

im

ii

im

m i

Qx

x

QP )

P ( ) ( PP mm

m

+ K = ),(2 t ,dt

dFPx

= t

FFx

x

Fi

i

i

i

222 P

P

ii

(91)

Since the xi and the Pi are independent in (91), then the coefficients of ix and iP

should be equal,

),( P 2m t ,x

F

x

Q

i

m

m i

ip Px

PP

mm

i

Q

m

= ),(P

2 t ,F

i

Px

(92)

K - H = t

F

2

It is not straightforward to visualize that from the first equation the Pi can be solved as a function of ),( px , since the Qm are involved there. Hence, we perform an extra step.

First, notice that the first equation in (92) can be written as,

),( P 2m t ,

x

FQ

x i

m

mi

ip Px

)P( m2 m

mi

i QFx

p

(93a)

Also notice in the second equation of (92),

m

i

m

ii

m QQQ

mmm P

PP

P

P

)P( mmm

mm

i

QQ

m

im

m

PP

m

im

i

QQ

P

P m

m

Hence, the second equation in (92) can be expressed as,

im

P

)P(Q

Q

i

m

m

= ),(P

2 t ,F

i

Px

iQ =

m

)P(P

m2 m

i

QF (93b)

In term of the function F2’,

m

)P( ' m22 mQFF ,

expressions (92a) and (92b) give,

i

ix

Fp

' 2

iQ = i

F

P

'2

(94)

K - H = t

F

' 2

At the end, there will be no interest in the function F2. We already found a F2’ that, if expressed in terms of (x,P) as independent variables, it will generate a canonical transformation. Hence, let’s just rename the F2’ as F2. In summary

),( 2 t ,x

F

i

ip Px

, iQ = ),(

P

2 t ,F

i

Px

, K - H =

t

F

2

solve for Once the Pi are known,

this gives

),,(PP t ii px ),,( t QQ ii px

Invert them to obtain

),, t P x(Qx ),, t K P (Q = ),,( t H px +t

F

' 2 ),( t ,Px

),, t P p(Qp

),, t K P (Q = ), ,( ),,),, t H t t P (QP (Q px +

+t

F

' 2 ),( ),, t ,t PP (Qx

4.5C Evolution of the mechanical state viewed as series of canonical transformations i) The generator of the identity transformation Consider the generating function

i

x i2 P),( ixt ,F P (95)

Let’s find out the canonical transformation it generates. Using (94),

i

i

ix

Fp P 2

(96)

iQ = i

i

xF

P

2

We find the new coordinates ) ,( PQ are the same old coordinates ),( px . That is, the

function in i

x i2 ),( Pxt ,F iP generates the identity transformation.

),( px I

) ,( PQ

ii) Infinitesimal transformations Consider the generating function

i

x i2 ),( Pxt ,F iP + ),( t ,G px (97)

where is an infinitesimal number, and ),( t ,G px an arbitrary function (to be specified

later.)

Due to the small value of , and since i

iPxi generates the identity transformation, we

expect the new coordinates ) ,( PQ will differ from the old ones ),( px also by

infinitesimal values; that is,

iQ = iix (98)

iP = iip

Let’s figure out the values of i and i .

Applying (94) to (97) gives,

ii

ix

G

x

Fp

i

2 P (99a)

iQ = ),(PP

)(2 t ,Gx

F

i

i

i

Ppx

based on (99a) one obtains

),( )( t ,Gp

xi

i Ppx

ip

iP

),( )( t ,Gp

xi

i Ppx

(99b)

In short, we have obtained:

The function i

x i2 ),( Pxt ,F iP + ),( t ,G px generates

the transformation

i

ix

Gp

Pi (100)

iQ i

ip

Gx

iii) The Hamiltonian as a generating function of canonical transformations

Notice in (100), if G is the Hamiltonian ),( t ,H px , and is chosen to be the incremental

time differential dt , one obtains,

i

ix

Gp

Pi =

i

ix

Hdtp

)(

Using the Hamiltonian equations (43)

= )( )( ii pdtp

But )( dtpi is the increment of p due to the motion

= ii dpp

iQ i

ip

Gx

=

i

ip

Hdtx

)(

Using the Hamiltonian equations (43)

= )( )( ii xdtx

= ii dxx

That is, we have obtained:

(70)

The function i

x i2 ),( Pxt ,F iP + (dt) ),( t ,H px generates the

transformation ),( px ) ,( PQ

ii dxx Q i (101)

ii dpp Pi

where dxi and dpi are, respectively, the changes in xi and pi due to the motion governed by the Hamiltonia H.

Notice in (101) that, basically, the Hamiltonian can be viewed as the generator of the transformation. It transforms the value of the coordinates

),( px at the time t, to the value of those coordinates at the time t +dt.

This result is very interesting. It tells us that the evolution of the state of

motion ) ,( )()( tt ii px can be viewed as a series of canonical transformations

generated by the function H, (as the result applying (101) successively one after another differential time dt.)

iv) Time evolution of a mechanical state viewed as a canonical transformation Symbolically, the result in (101) can be expressed this way:

Defining )( t't,F i

iPxi + (t’-t) ),( t ,H px (102)

x (t), p(t) )( t't,F

x(t’), p(t’) )( t"t,F

x(t”), p((t”) , …. , (103)

Evolution of the classical state x (t), p(t) over time. The Hamiltonian in (102) constitutes the generator of the state’s evolution with time.

Expression (103) also hints an approach, at least conceptually, of how to attain a solution of the equation of motion. In effect, since the successive applications of

canonical transformations is a canonical transformation, finding the solution x (t), p(t)

of the Hamilton equations can be view as the task of finding a canonical transformation

that takes the initial conditions x(t0), p(t0) (old coordinates) to the values of the state at

the time t, x (t), p(t) (the new coordinates).

Canonical transformation

x(t0), p(t0) )( 0tt,F

x(t), p(t) (104) old coordinates new coordinates

4.6 Universality of the Lagrangian 4.6A Invariant of the Lagrangian equation with respect to the coordinates used in the

configuration space It was stated in the sections 4.1B above that the Lagrangian equations

0x

L

x

L

dt

d

ii

(105)

have the particular interesting property of being independent of the particular coordinates used in the configuration space. Regardless of the coordinates being used, the Lagrangian equations look the same. From each new coordinates we would be able to obtain a corresponding Hamiltonian, by following the procedure of Section 4.2B. Such a statement may appear to be at odds with the concepts outlined in Section 4.4, where we had to be careful in not making the wrong transformation of coordinates, otherwise we would end up with a no Hamiltonian describing the system. The transformation had to be canonical. The following example helps clarify the issue. Consider the generating function,

m

mm t ft ,F P)(),(2 xx P (106)

where the fi are arbitrary functions.

Using (94),

iQ = )(P

2 t fF

i

i

x

(107)

ii

ix

Fp

2 =

m

m

i

m

x

t fP

)(x

That is, the new coordinates Qi depend only on the old coordinates and time, but do not involve the old momenta. Such a transformation is then an example of the class of point transformation, done in the configuration space, which led to the consideration of the invariant of the Lagrangian equation with respect to a transformation of coordinates. In this context, such a point transformation (107) are canonical, and therefore preserve the Hamiltonian description of the motion. We have therefore a view of the invariance of the Lagrangian from a canonical transformation of coordinates perspective.

4.6B The Lagrangian equation as an invariant operator

In Section 4.2D, the application of the modified Hamilton’s variational principle to the generalized Lagrangian ),, ,( t,L ppxx led to

0x

L

x

L

dt

d

ii

and 0

p

L

p

L

dt

d

ii

, for i=1,2, …, n (106)

For simplification purposes in this section let’s use, ζpx ) ,(

So we express the Lagrangian as,

),( t,L ζζ (107)

And the equation (106) take the compact form,

0 ),(

t,

dt

dL

ii

ζζ

, for i=1,2, …, 2n (108)

For new coordinates obtained through a canonical transformation

),,( ),,( tt P

Qpx

which in our new notation will be expressed as, ),( ),( tηtζ (109)

The application of the variational principle to the Lagrangian

),( t,L' ηη (110)

will lead to

0 ),(

t,

dt

dL'

ii

ηη

, for i=1,2, …, 2n (111)

On the other hand, expression (86) tell us that

)( t,,H - i

i

i px px -

)( t,,KPQ i

i

i PQ = dt

dF

or, in the new notation,

),( ),( dt

dFt,t, 'LL ηη-ζζ (112)

Now come the time to highlight further the importance of expression (112). It turns out that replacing (112) in (108), one obtains,

0 ),(

t,

dt

dL'

ii

ηη

(113)

That 0 ),(

t,

dt

dL

ii

ζζ

and 0 ),(

t,

dt

dL'

ii

ηη

are the same is

remarkable.

It was obtained not obtained because 0 ),(

t,

dt

dL

ii

ζζ

and

0 ),(

t,

dt

dL'

ii

ηη

are equations for the same set of motion in the

phase-space.

It was obtained only because L and L’ differ by a function dt

dF

It suggest that

iidt

d

is a kind of operator independent of the particular

coordinates being used. For, if the Lagrangian is expressed as ),( t,L ζζ or

),( t,L' ηη , the application of the operator

iidt

d

renders the

description of the same set of motion in the phase-space.

4.7 The Hamilton Jacobi equation

The Hamilton principal function We have been envisioning ways to obtain simple ways to solve the Hamiltonian

equations. In one case, we alluded to the convenience of finding a systematic way to transform from one set of coordinates to another set of coordinates in such a way that the new Hamiltonian ends up having all the coordinates xi 1=1, 2,…,n being cyclic. Another approach was outlined in Section 4.5D, with a canonical transformation relating the old coordinates (taken as the state variables at a given time t) to the new coordinates (taken as the state variables at a time t’.)

We will follow this latter approach for being more convenient, since it applies even when the Hamiltonian depends on time (i.e. it is not a constant of motion.) The outlook will have a slight variation. We will be looking for a canonical transformation F such that

it transform the coordinates x(t), p(t) to a new set of coordinates x(t0), p(t0) (the latter are constant values of the initial conditions). In our custom terminology,

( x(t), p(t) ) )( 0tt,F

( Q , P ) (114)

( x(t0) , p(t0) )

( x0 , p0 )

New coordinates constant in time

H(x , p ) )( 0tt,F

K( Q, P)

One can automatically ensure that the new variables (Q , P) are constant in time by requiring that the new Hamiltonian K( Q, P) be identically zero!, for then the equations of motion are,

0P

i

KQi

(115)

0Q

P

i

Ki

K is related to the old Hamiltonian H and the generating function by,

t

FHK

which will be zero if,

0 ) , ,(

t

FtH px (116)

It is convenient to choose ),( Px as the independent coordinates; that is,

),(2 t ,F Px = ), ,( ) ,( t F Pxpx (117)

which gives the

i

ix

Fp

2 ),( t ,Px (118)

iQ = i

F

P

2

),( t ,Px

Equation (116) takes the form,

0 , , .... , , 2

n

2

1

2 )(

t

F t

x

F

x

FH x

Given H, this constitutes an equation for F2. It is customary to denote the solution of this equation by S, which is called the Hamilton’s principal function.

0 , , .... , , )(n1

t

S t

x

S

x

SH x (119)

Hamilton-Jacobi Equation ),( t ,SS Px with P = constant

Hamilton’s principal function

When solving (119), the n constants of integration can be taken as the Pi s.

Expression i

ix

Sp

),( t ,Px evaluated at t0 gives a relationship between the

constant value of P and the initial conditions x0 , p0.

Then using expression (118), iQ =i

F

P

2

),( t ,Px , one has a relationship to find the

Qis. In particular the Qis can be obtained by evaluating this expression at t0.

Inverting iQ =i

F

P

2

),( t ,Px , one obtains,

) ,( t ,PQxx (120)

which gives the the coordinates as a function of time and the initial conditions. In short, when solving the Hamilton-Jacobi equation, we are at the same time obtaining a solution of the mechanical problem

Further physical significance of the Hamilton principal function Let’s evaluate the total derivative of ),( t ,SS Px .

dt

dS

t

Sx

x

Si

i

i

because the Pis are constant .

Using expression (118), i

ix

Sp

),( t ,Px , gives

dt

dS

t

Sxp ii

i

Using the Hamilton Jacobi equation 0 , , )(

t

S tH px , o

dt

dS Hxp ii

i

The term on the right iis the Lagrangian )( , , tL px ; thus,

dt

dS L (121)

L is then an indefinite time integral of the Lagrangian,

dt' t't't'LS , , )( )()( pxt

(122)

Appendix-1 Poisson bracket theorem

Let (t)(t) ) , ( px

be the time development of a system on phase space. This

development is generated by some Hamiltonian ),,( tH px

if and only if

every pair of dynamic variables ),,( tR px

, ),,( tS px

satisfies the relation

],[],[],[ SRSRSRdt

d (1)

Outline of the proof:

H implying (1) is straightforward. Hint: The existence of a Hamiltonian implies that any variable F changes

according to dt

dF

t

FHF

],[ ). Use the last expression with F= [R,S].

That (1) implies the existence of a Hamiltonian governing the variations of (t)x

and (t)p

is more elaborated.

The variable (t)x , for example, will have the form,

(t)x = (t)(t) ) , ( px

= 1, 2, …, n

Similarly,

(t)pβ = (t)(t) ) , ( px

= 1, 2, …, n

(Below we demonstrate that (1) implies

x

=

p

)( )

We want to prove and can be obtained from a function H such that

pH/ and xH/ .

We need to demonstrate the existence of H.

Note: It may be sufficient to demonstrate that

x

=

p

)( (2)

For it will ensure not contradicting the fact that, if H existed, it would

have to satisfy xp

H

px

H

22

Since, regardless of the existence of a Hamiltonian or not,

, ][ px , we

start with the following result: 0],[ pxdt

d. Expression (1) implies

],[],[],[0 pxpxpxdt

d

Using x = and p = one obtains,

],[],[0 xp

i iiii x

p

pp

p

x

+

i iiii xp

x

px

x =0

x

+

p

=0

or

x

=

p

)(

In other words, (2) does not contradict the quest for finding a function H

that satisfies pH/ and xH/ .

________________________________________________________________________ Appendix-2 Canonical transformation theorem

Let ),( px

be a set of general coordinates on phase space. The Poisson brackets

of any two dynamic variables R and S will be specified as,

i iiii x

R

p

S

p

R

x

SS

),(R,

px

Consider an invertible transformation

),,( ; ),,( tt pxpxQQ

PP

The following three statements are equivalent:

a) The transformation ),,( ; ),,( tt pxpxQQ

PP is canonical.

b) There exists a nonzero constant z such that any dynamical variables R and S satisfy,

),(),(R, R,

pxQ

SzS P

(1)

(That is, the Poisson bracket is practically independent of the coordinates used to calculate it.)

c) The transformation ),,( ; ),,( tt pxpxQQ

PP is canonoid with

respect to all quadratic Hamiltonians of the form,

H= C +

n

pcxc1

)'(

+

n n

pppxxx1 1

)"'(2

1

2

1

(2)

where = , ' = ', ” = ”

Proof:

That b) implies a) is a bit straightforward. All it needs to be done is to show

),(),(),( ],[],[],[ PPP

QQQ SRSRSRdt

d . For, according to the Poisson bracket

theorem (shown above), the latter implies that the evolution of )(tQ

and )(tP

is generated by a Hamiltonian.

From b:) ),(),( R, ],[pxQ

Szdt

dSR

dt

dP

),(R,

px

Sdt

dz

Assuming that the evolution of ),( px

is governed by

a Hamiltonian, Poisson bracket theorem implies,

} ],[],[ {R, ),(),(),( pxpxpx

SRSRzSdt

dz

Using b) again,

],[],[ ),(),( PP

QQ SRSR

That c) implies b) is more elaborated.

Exploiting the fact that ),,( ; ),,( tt pxpxQQ

PP is canonoid with respect to

any quadratic Hamiltonian, we will figure out first the value of

),(],[ P

Q

xx = ? ,),(],[ P

Q

px =?, and ),(],[ P

Q

pp =?,

which then be used in the result for ),(],[ P

QSR obtained in (36).

Lets’ start considering the case that ),,( ; ),,( tt pxpxQQ

PP

is canonoid with respect to H=C (constant).

),,( tH px

constant implies,

0

ii

ip

const

p

Hx and 0

ii

ix

const

x

Hp (3)

Since ),,(),,( ; tt pxpxQQ

PP is canonoid with respect to the

Hamiltonian ),,( tH px

=const, it means (according to the Poisson bracket

theorem) that there exist a Hamiltonian (K= ),( PQK ) according to which a

quantities like ),(],[ P

Q

xx and ),(],[ P

Q

px would evolve in time as,

),(),(),( ],[],[],[ PPP

QQQ

xxxxxxdt

d

),(),(),( ],[],[],[ PPP

QQQ

pxpxpxdt

d

),(),(),( ],[],[],[ PPP

QQQ

ppppppdt

d

But according to (40) all the 0ix and 0ip , which gives

0],[ ),( P

Q

xxdt

d;

0],[ ),( P

Q

pxdt

d; and (4)

0],[ ),( P

Q

ppdt

d

Let’s put ),(],[ P

Q

xx , ),(],[ P

Q

px , and ),(],[ P

Q

pp as a

function of ),,( tpx

),(],[ P

Q

xx ≡ f ),,( tpx

(5)

),(],[ P

Q

px ≡ g ),,( tpx

),(],[ P

Q

pp ≡ s ),,( tpx

For a Hamiltonian H ),,( tpx

=const,

0],[ ),( px

Hf , 0],[ ),( px

Hg , 0],[ ),( px

Hs ; and

the time variation of f, g, and s, are given by,

ft

Hffdt

d

),(],[ px

= ft

gt

Hggdt

d

),(],[ px

= gt

st

Hssdt

d

),(],[ px

= gt

Since, according to (4), 0 fdt

d, 0 g

dt

d, and 0 s

dt

d, one

obtains,

0],[ ),(

P

Q

xxt

ft

; .

0],[ ),(

P

Q

pxt

gt

; and .. (6)

0],[ ),(

P

Q

ppt

st

…

So, neither ),(],[ P

Q

xx , ),(],[ P

Q

px

nor ),(],[ P

Q

pp depend explicitly on the

time variable. . Note: Notice that, although we have used the Hamiltonian H=constant to

obtain the result (6), neither of the values ),(],[ P

Q

xx , ),(],[ P

Q

px , or

),(],[ P

Q

pp depend on the particular Hamiltonian used. They depend

only on the transformation coordinates.

Consider now the Hamiltonian H=

n

pcxc1

)'(

This implies

i

i

i cp

Hx '

and i

i

i cx

Hp

(7)


PP is canonoid with respect to this

Hamiltonian H, it means (according to the Poisson bracket theorem) that

there exist a Hamiltonian (K’= ),(' PQK ) according to which a quantities like

),(],[ P

Q

xx , ),(],[ P

Q

px and ),(],[ P

Q

pp would evolve in time as,

),(),(),( ],[],[],[ PPP

QQQ

xxxxxxdt

d ,

),(),(),( ],[],[],[ PPP

QQQ

pxpxpxdt

d , and

),(),(),( ],[],[],[ PPP

QQQ

ppppppdt

d

Using (7),

0],'[],[ ),(),( PP

QQ

xcxx ;

0],[],[ ),(),( PP

QQ

cxpx ; and

0],[],[ ),(),( PP

QQ

cppp

Thus,

0],[ ),( fdt

dxx

dt

d P

Q

0],[ ),( gdt

dpx

dt

d P

Q

0],[ ),( sdt

dpp

dt

d P

Q

Hence, in terms of the Hamiltonian H=H ),,( tpx

ft

Hffdt

d

),(],[0 px

gt

Hggdt

d

),(],[ 0 px

st

Hssdt

d

),(],[ 0 px

Using (6),

0],[ ),( px

Hf

0],[1

)'(

n

ipcxc iiiif

011

],['],[

n

i

ii

n

i

ii pfcxfc

011

'

n

k ki

n

k kk x

fc

p

fc

This should be true for all arbitrary values of ck and c’k, which implies,

0

kx

f and 0

kx

f for k = 1, 2, …. , n

Similarly, (8)

0

kx

g and 0

kx

g for k = 1, 2, …. , n.

From (6) and (8), the quantities ,

),(],[ P

Q

xx , ),(],[ P

Q

px and ),(],[ P

Q

pp (9)

do not depend on the ),,( tpx

variables. .

That is, they are constant values. .

Now we need to prove that all those constant values are of the same magnitude │z│. To that effect consider the Hamiltonian

H =

n n

pppxxx1 1

)"'(2

1

2

1

)""'(2

1

2

1

ppxp

Hx iii

i

i

)"'(

px ii (10)

)'(

pxx

Hp ii

i

i

)'((

px ii (10)’


PP is canonoid with respect to this

Hamiltonian H, it means

),(),(),( ],[],[],[ PPP

QQQ

xxxxxxdt

d =0

),(),(),( ],[],[],[ PPP

QQQ

pxpxpxdt

d =0

),(),(),( ],[],[],[ PPP

QQQ

ppppppdt

d =0

(All these quantities are equal to zero, according to (9).)

Using (10), ),(),( ],[],[0 PP

QQ

xxxx implies,

),(

),(

)]"'(,[

]),"'([0

P

P

Q

Q

kkkk

k

kkkk

k

pxx

xpx

),(),(

),(),(

)],["],['(

],["],['(

PP

PP

QQ

QQ

kkkk

k

kkkk

k

pxxx

xpxx

0 = ('F) + (F') - (G”) + (G”) (11)

’=0 implies (G”) = (G”) ”=I implies G is symmetric

”=0 implies (’F) = (F’)’=I implies F in symmetric

Similarly, using (10) ),(),( ],[],[0 PP

QQ

pxpx implies,

),(

),(

)]'((,[

]),"'([0

P

P

Q

Q

kkkk

k

kkkk

k

pxx

ppx

),(),(

),(),(

)],['],[

],["]['(0

PP

PP

QQ

QQ

kkkk

k

kkkk

k

pxxx

pppx

0 = (’G) + (”S)- (F)- (G’) (12)

’=”=0 implies F = 0

=’=0 implies ”S=0; ”=I implies S=0

=”=0 implies ’G =G’ ; ’=I implies G=G

Using (10), ),(),( ],[],[0 PP

QQ

pppp implies

),(

),(

)]'((,[]),'(([0 PP

QQ

kkkk

k

kkkk

k

pxpppx

),(

),(

),(),(

)],['],[

],['],[0

PP

PP

QQ

QQ

kkkk

k

kkkk

k

ppxp

pppx

0 = (G) + (’S)- (G) + (S’) (13)

’=0 implies (G) = (G); =I implies G is symmetric

=0 implies ('S) = -(S’); '=I implies S= -S =0

From (11), (12) and (13)

F=0 or f ≡ ),(],[ P

Q

xx = 0

S=0 or s ≡ ),(],[ P

Q

pp = 0 (14)

G symmetric g≡ ),(],[ P

Q

px = g

We can further determine the g values for ≠.

For example:

Since the matrix is arbitrary, let’s consider one in which only 13=31 are the only elements ≠0.

......00

...

...000

...

...........

...

...

...

.........0

...00

...000

...00

133112311131

331332133113

333231

232221

131211

31

13

ggg

ggg

ggg

ggg

ggg

G

Expression (13) requires (G) = (G)which implies

g11= g33 and g32= g3n =0 for n>3

We arrive to the conclusion then that

g = const (15)

The constant value depends on the particular transformation used, and it is referred to as z in the text of the theorem.

Thus,

F=0 or f ≡ ),(],[ P

Q

xx = 0

S=0 or s ≡ ),(],[ P

Q

pp = 0 (16)

G symmetric g≡ ),(],[ P

Q

px = z

From the chain rule, expression (40) in the main text of this chapter,

i iiii Q

R

P

S

P

R

Q

SS

),(

R, P

Q

. … ..

],[ ],[

],[ ],[

),(),(

),(),(

p

Rpp

p

S

x

Rxp

p

S

p

Rpx

x

S

x

Rxx

x

S

PP

PP

QQ

QQ

),(

R, P

QS

],[],[ ),(),(

x

Rxp

p

S

p

Rpx

x

S PP

QQ

] [] [ xx

x

Rz

p

S

p

Rz

x

S

x

x

R

p

S

p

R

x

Sz

which gives,

),(x

),(R,R,

pP

xQSS z

The Poisson bracket is invariant under a canonical transformation (up to a constant value z.)

___________________________________________________________________ Appendix-3 How to generate canonical transformations

A way to generate restricted canonical transformations will result along the way of attempting to classify them. Toward this end, lets analyze the following expression,

ii P i

i

i

i Qpx - (1)

where the variables correspond to the transformation,

),,( ; ),,( tt pxpxQQ

PP .

It turns out, when (1) is written as a function of ),,( tpx

it reveals much about the

transformation.

Basically, the strategy consist in showing that, when the transformation is (restricted) canonical (that is, there exist a

H=H ),,( tpx

and a K=K )( , P

Q ),

expression (1) turns out to be an exact total derivative of a scalar function F=F ),,( tpx

.

ii P i

i

i

i Qpx - =

expanding the second term

= )( PPP iikiit

Q

p

Q

x

Q i

k k

i

i k

k

k

i

i

i pxpx

-

,

ik

andmi

Renanimng

,

ik

andmi

Renanimng

)Pt

QP

p

QP

x

Q -

i

ii

i

mi

i

m

m

m

m i

i

i

mi

i

i pxpx

i

i

m i

m

ii m

i

i

mi

t

Q

p

Q

x

Qpxp iimm P)P()P( (2)

i i

The objective is to show that there exists a function F that makes (2) equal to

t

Fp

p

Fx

x

Fi

i

i

i

ii

m i

mi

x

Qp mi P implies

m ki

m

m ik

m

m i

m

kk

i

xx

Q

xx

Q

x

Q

xx

mm

2

m

P)P()P(

m k

mk

x

Qp mk P implies

m ik

m

m ki

m

m k

m

ii

k

xx

Q

xx

Q

x

Q

xx

mm

2

m

P)P()P(

which gives,

PQ

,mm },{PP

ki

m i

m

km ik

m

i

k

k

i xxx

Q

xxx

Q

xx

the latter is the Lagrange bracket

Without proof we state:

The transformation ),,( ; ),,( tt pxpxQQ

PP is canonical

if and only if (3)

ikkikiki pxppxx PQPQPQ

,,, },{ and 0},{},{

Therefore, one obtains

0

i

k

k

i

xx

This indicates that there exist a function F such that,

x

F

(4)

Similarly

m i

mi

p

QmP implies,

m ki

m

m ik

m

k

i

pp

Q

pp

Q

p

mm

2 P)()P(

m k

mk

p

QmP implies,

m ik

m

m ki

m

i

k

pp

Q

pp

Q

p

mm

2 P)()P(

which gives,

PQ

,mm },{)P

)P

)( ki

m ki

m

ik

m

i

k

k

i pppp

Q

pp

Q

pp

= 0

This indicates that there exist a function F such that,

p

F

(5)

i

i

t

QΨ iP implies,

)P( i

2

k

ii

i k

i

k x

P

t

Q

tx

Q

x

Ψ

and )P( i

2

k

ii

i k

i

k p

P

t

Q

tp

Q

p

Ψ

m k

mk

x

Qp mk P implies

m k

mk

p

QmP implies

)P( m

2

t

P

x

Q

xt

Q

t

m

k

m

m k

mk

and )P( m

2

t

P

p

Q

pt

Q

t

m

k

m

m k

mk

This gives,

)(t

P

x

Q

x

P

t

Q

xt

i

k

i

i k

ii

k

k Ψ

and )(

t

P

p

Q

p

P

t

Q

pt

i

k

i

i k

ii

k

k Ψ

(6)

On one hand iQ can be determined by the Hamiltonian K=K )( , P

Q

i

iP

KQ

On the other hand )(tQi is a quantity that can also be tracked by the

Hamiltonian H=H ),,( tpx

t

QHQQ i

ii

),(],[ p

x

Thus

) (

],[],[ ),(),(

x

H

p

Q

p

H

x

Q

P

K

HQP

KHQQ

t

Q

ii

i

i

i

iii pp

xx

Similarly,

) (

],[],[ ),(),(

x

H

p

P

p

H

x

P

Q

K

HPQ

KHPP

t

P

ii

i

i

i

iii pp

xx

Further,

k

iii

k

i

ik

ii

x

P

x

H

p

Q

p

H

x

Q

x

P

P

K

x

P

t

Q

) (

) (x

H

p

P

p

H

x

P

x

Q

Q

K

x

Q

t

P

x

Q ii

k

i

ik

ii

k

i

Upon subtraction

k

iiiii

k

i

ik

i

k

i

i

i

k

i

k

ii

x

P

x

H

p

Q

p

H

x

Q

x

H

p

P

p

H

x

P

x

Q

Q

K

x

Q

x

P

P

K

t

P

x

Q

x

P

t

Q

) () (

) () (

x

H

x

P

p

Q

p

H

x

P

x

Q

x

H

x

Q

p

P

p

H

x

Q

x

P

Q

K

x

Q

x

P

P

K

t

P

x

Q

x

P

t

Q

k

ii

k

ii

k

ii

k

ii

ik

i

k

i

i

i

k

i

k

ii

x

H

x

Q

p

P

x

P

p

Q

p

H

x

P

x

Q

x

Q

x

P

Q

K

x

Q

x

P

P

K

t

P

x

Q

x

P

t

Q

k

ii

k

ii

k

ii

k

ii

ik

i

k

i

i

i

k

i

k

ii

) () (

Now, summing on the i variable (see expression (6)),

x

H

x

Q

p

P

x

P

p

Q

p

H

x

P

x

Q

x

Q

x

P

Q

K

x

Q

x

P

P

K

t

P

x

Q

x

P

t

Q

k

ii

k

ii

ik

ii

k

ii

ik

i

k

i

ii

i

k

i

k

ii

) () (

i

i

k

k

k

kk

ki

i

k

i

k

ii

x

HK

x

H

x

K

x

Hpx

p

Hxx

x

K

t

P

x

Q

x

P

t

Q

)(

0

},{},{ ,,

PQPQ

kk

k

x

HK

xt

Ψ

)(

(7)

But, according to (4), k

kx

F

kkk x

HK

xx

F

t

Ψ

)(

t

F

xx

HK

x kkk

Ψ

)(

0

) (0t

FHK

xΨ

k

(8)

Similarly, we expect to obtain

) (0t

FHK

pΨ

k

(9)

This implies that ) (t

FHKΨ

is a function of t alone. However, (4) and (5)

determine F up to an additive function f(t); the latter can be choose such that,

t

FHKΨ

(10)

From (2), which defines , and (10)

t

F

t

QHK

i

i

iP

From (2), which defines i , and (4)

im i

m

x

F

x

Qp

mi P

From (2), which defines i , and (5)

im i

m

p

F

p

Q

mP

___________________________________________________________________ 1 H. Goldstein, Classical Mechanics, Addison-Wesley Publishing (1959).

2 Saletan and Cromer, “Theoretical Mechanics” Wiley, 1971.), page 180.

3 Ref.2, page188.

Documents

From the Hamilton’s Variational Principle to the Hamilton ... · From the Hamilton’s Variational Principle to ... Recipe for solving problems in mechanics ... 4.6 Universality