In this first section we will discuss various aspects of classical fields. We will cover only the bare minimum ground necessary before turning to the quantum theory, and will return to classical field theory at several later stages in the course when we need to introduce new ideas.
A field is a quantity defined at every point of space and time . While classical particle mechanics deals with a finite number of generalized coordinates , indexed by a label , in field theory we are interested in the dynamics of fields
(1.3) |
where both and are considered as labels. Thus we are dealing with a system with an infinite number of degrees of freedom — at least one for each point in space. Notice that the concept of position has been relegated from a dynamical variable in particle mechanics to a mere label in field theory.
The most familiar examples of fields from classical physics are the electric and magnetic fields, and . Both of these fields are spatial 3-vectors. In a more sophisticated treatement of electromagnetism, we derive these two 3-vectors from a single 4-component field where shows that this field is a vector in spacetime. The electric and magnetic fields are given by
(1.4) |
which ensure that two of Maxwell’s equations, and , hold immediately as identities.
The dynamics of the field is governed by a Lagrangian which is a function of , and . In all the systems we study in this course, the Lagrangian is of the form,
(1.5) |
where the official name for is the Lagrangian density, although everyone simply calls it the Lagrangian. The action is,
(1.6) |
Recall that in particle mechanics depends on and , but not . In field theory we similarly restrict to Lagrangians depending on and , and not . In principle, there’s nothing to stop depending on , , , etc. However, with an eye to later Lorentz invariance, we will only consider Lagrangians depending on and not higher derivatives. Also we will not consider Lagrangians with explicit dependence on ; all such dependence only comes through and its derivatives.
We can determine the equations of motion by the principle of least action. We vary the path, keeping the end points fixed and require ,
(1.7) | |||||
The last term is a total derivative and vanishes for any that decays at spatial infinity and obeys . Requiring for all such paths yields the Euler-Lagrange equations of motion for the fields ,
(1.8) |
Consider the Lagrangian for a real scalar field ,
where we are using the Minkowski space metric
(1.10) |
Comparing (1.1.1) to the usual expression for the Lagrangian , we identify the kinetic energy of the field as
(1.11) |
and the potential energy of the field as
(1.12) |
The first term in this expression is called the gradient energy, while the phrase “potential energy”, or just “potential”, is usually reserved for the last term.
To determine the equations of motion arising from (1.1.1), we compute
(1.13) |
The Euler-Lagrange equation is then
(1.14) |
which we can write in relativistic form as
(1.15) |
This is the Klein-Gordon Equation. The Laplacian in Minkowski space is sometimes denoted by . In this notation, the Klein-Gordon equation reads .
An obvious generalization of the Klein-Gordon equation comes from considering the Lagrangian with arbitrary potential ,
(1.16) |
We could also consider a Lagrangian that is linear in time derivatives, rather than quadratic. Take a complex scalar field whose dynamics is defined by the real Lagrangian
(1.17) |
We can determine the equations of motion by treating and as independent objects, so that
(1.18) |
This gives us the equation of motion
(1.19) |
This looks very much like the Schrödinger equation. Except it isn’t! Or, at least, the interpretation of this equation is very different: the field is a classical field with none of the probability interpretation of the wavefunction. We’ll come back to this point in Section 2.8.
The initial data required on a Cauchy surface differs for the two examples above. When , both and must be specified to determine the future evolution; however when , only and are needed.
We may derive Maxwell’s equations in the vacuum from the Lagrangian,
(1.20) |
Notice the funny minus signs! This is to ensure that the kinetic terms for are positive using the Minkowski space metric (1.10), so . The Lagrangian (1.20) has no kinetic term for . We will see the consequences of this in Section 6. To see that Maxwell’s equations indeed follow from (1.20), we compute
(1.21) |
from which we may derive the equations of motion,
(1.22) |
where the field strength is defined by . You can check using (1.4) that this reproduces the remaining two Maxwell’s equations in a vacuum: and . Using the notation of the field strength, we may rewrite the Maxwell Lagrangian (up to an integration by parts) in the compact form
(1.23) |
In each of the examples above, the Lagrangian is local. This means that there are no terms in the Lagrangian coupling directly to with . For example, there are no terms that look like
(1.24) |
A priori, there’s no reason for this. After all, is merely a label, and we’re quite happy to couple other labels together (for example, the term in the Maxwell Lagrangian couples the field to the field). But the closest we get for the label is a coupling between and through the gradient term . This property of locality is, as far as we know, a key feature of all theories of Nature. Indeed, one of the main reasons for introducing field theories in classical physics is to implement locality. In this course, we will only consider local Lagrangians.
The laws of Nature are relativistic, and one of the main motivations to develop quantum field theory is to reconcile quantum mechanics with special relativity. To this end, we want to construct field theories in which space and time are placed on an equal footing and the theory is invariant under Lorentz transformations,
(1.25) |
where satisfies
(1.26) |
For example, a rotation by about the -axis, and a boost by along the -axis are respectively described by the Lorentz transformations
(1.27) |
with . The Lorentz transformations form a Lie group under matrix multiplication. You’ll learn more about this in the “Symmetries and Particle Physics” course.
The Lorentz transformations have a representation on the fields. The simplest example is the scalar field which, under the Lorentz transformation , transforms as
(1.28) |
The inverse appears in the argument because we are dealing with an active transformation in which the field is truly shifted. To see why this means that the inverse appears, it will suffice to consider a non-relativistic example such as a temperature field. Suppose we start with an initial field which has a hotspot at, say, . After a rotation about the -axis, the new field will have the hotspot at . If we want to express in terms of the old field , we need to place ourselves at and ask what the old field looked like where we’ve come from at . This is the origin of the inverse transformation. (If we were instead dealing with a passive transformation in which we relabel our choice of coordinates, we would have instead ).
The definition of a Lorentz invariant theory is that if solves the equations of motion then also solves the equations of motion. We can ensure that this property holds by requiring that the action is Lorentz invariant. Let’s look at our examples:
For a real scalar field we have . The derivative of the scalar field transforms as a vector, meaning
where . This means that the derivative terms in the Lagrangian density transform as
(1.29) | |||||
The potential terms transform in the same way, with . Putting this all together, we find that the action is indeed invariant under Lorentz transformations,
(1.30) |
where, in the last step, we need the fact that we don’t pick up a Jacobian factor when we change integration variables from to . This follows because . (At least for Lorentz transformation connected to the identity which, for now, is all we deal with).
In the first-order Lagrangian (1.17), space and time are not on the same footing. ( is linear in time derivatives, but quadratic in spatial derivatives). The theory is not Lorentz invariant.
In practice, it’s easy to see if the action is Lorentz invariant: just make sure all the Lorentz indices are contracted with Lorentz invariant objects, such as the metric . Other Lorentz invariant objects you can use include the totally antisymmetric tensor and the matrices that we will introduce when we come to discuss spinors in Section 4.
Under a Lorentz transformation . You can check that Maxwell’s Lagrangian (1.23) is indeed invariant. Of course, historically electrodynamics was the first Lorentz invariant theory to be discovered: it was found even before the concept of Lorentz invariance.
The role of symmetries in field theory is possibly even more important than in particle mechanics. There are Lorentz symmetries, internal symmetries, gauge symmetries, supersymmetries…. We start here by recasting Noether’s theorem in a field theoretic framework.
Every continuous symmetry of the Lagrangian gives rise to a conserved current such that the equations of motion imply
(1.31) |
or, in other words, .
(1.32) |
which one can immediately see by taking the time derivative,
(1.33) |
assuming that sufficiently quickly as . However, the existence of a current is a much stronger statement than the existence of a conserved charge because it implies that charge is conserved locally. To see this, we can define the charge in a finite volume ,
(1.34) |
Repeating the analysis above, we find that
(1.35) |
where is the area bounding and we have used Stokes’
theorem. This equation means that any charge leaving must be
accounted for by a flow of the current 3-vector out of
the volume. This kind of local conservation of charge holds in any
local field theory.
Proof of Noether’s Theorem: We’ll prove the
theorem by working infinitesimally. We may always do this if we
have a continuous symmetry. We say that the transformation
(1.36) |
is a symmetry if the Lagrangian changes by a total derivative,
(1.37) |
for some set of functions . To derive Noether’s theorem, we first consider making an arbitrary transformation of the fields . Then
(1.38) | |||||
When the equations of motion are satisfied, the term in square brackets vanishes. So we’re left with
(1.39) |
But for the symmetry transformation , we have by definition . Equating this expression with (1.39) gives us the result
(1.40) |
Recall that in classical particle mechanics, invariance under spatial translations gives rise to the conservation of momentum, while invariance under time translations is responsible for the conservation of energy. We will now see something similar in field theories. Consider the infinitesimal translation
(1.41) |
(where the sign in the field transformation is plus, instead of minus, because we’re doing an active, as opposed to passive, transformation). Similarly, once we substitute a specific field configuration into the Lagrangian, the Lagrangian itself also transforms as
(1.42) |
Since the change in the Lagrangian is a total derivative, we may invoke Noether’s theorem which gives us four conserved currents , one for each of the translations with ,
(1.43) |
is called the energy-momentum tensor. It satisfies
(1.44) |
The four conserved quantities are given by
(1.45) |
where is the total energy of the field configuration, while is the total momentum of the field configuration.
Consider the simplest scalar field theory with Lagrangian (1.1.1). From the above discussion, we can compute
(1.46) |
One can verify using the equation of motion for that this expression indeed satisfies . For this example, the conserved energy and momentum are given by
(1.47) | |||||
(1.48) |
Notice that for this example, came out symmetric, so that . This won’t always be the case. Nevertheless, there is typically a way to massage the energy momentum tensor of any theory into a symmetric form by adding an extra term
(1.49) |
where is some function of the fields that is anti-symmetric in the first two indices so . This guarantees that so that the new energy-momentum tensor is also a conserved current.
One reason that you may want a symmetric energy-momentum tensor is to make contact with general relativity: such an object sits on the right-hand side of Einstein’s field equations. In fact this observation provides a quick and easy way to determine a symmetric energy-momentum tensor. Firstly consider coupling the theory to a curved background spacetime, introducing an arbitrary metric in place of , and replacing the kinetic terms with suitable covariant derivatives using “minimal coupling”. Then a symmetric energy momentum tensor in the flat space theory is given by
(1.50) |
It should be noted however that this trick requires a little more care when working with spinors.
In classical particle mechanics, rotational invariance gave rise to conservation of angular momentum. What is the analogy in field theory? Moreover, we now have further Lorentz transformations, namely boosts. What conserved quantity do they correspond to? To answer these questions, we first need the infinitesimal form of the Lorentz transformations
(1.51) |
where is infinitesimal. The condition (1.26) for to be a Lorentz transformation becomes
(1.52) | |||||
So the infinitesimal form of the Lorentz transformation must be an anti-symmetric matrix. As a check, the number of different anti-symmetric matrices is , which agrees with the number of different Lorentz transformations (3 rotations + 3 boosts). Now the transformation on a scalar field is given by
(1.53) | |||||
from which we see that
(1.54) |
By the same argument, the Lagrangian density transforms as
(1.55) |
where the last equality follows because due to anti-symmetry. Once again, the Lagrangian changes by a total derivative so we may apply Noether’s theorem (now with ) to find the conserved current
(1.56) | |||||
Unlike in the previous example, I’ve left the infinitesimal choice of in the expression for this current. But really, we should strip it out to give six different currents, i.e. one for each choice of . We can write them as
(1.57) |
which satisfy and give rise to 6 conserved charges. For , the Lorentz transformation is a rotation and the three conserved charges give the total angular momentum of the field.
(1.58) |
But what about the boosts? In this case, the conserved charges are
(1.59) |
The fact that these are conserved tells us that
(1.60) | |||||
But we know that is conserved, so , leaving us with the following consequence of invariance under boosts:
(1.61) |
This is the statement that the center of energy of the field travels with a constant velocity. It’s kind of like a field theoretic version of Newton’s first law but, rather surprisingly, appearing here as a conservation law.
The above two examples involved transformations of spacetime, as well as transformations of the field. An internal symmetry is one that only involves a transformation of the fields and acts the same at every point in spacetime. The simplest example occurs for a complex scalar field . We can build a real Lagrangian by
(1.62) |
where the potential is a general polynomial in . To find the equations of motion, we could expand in terms of and and work as before. However, it’s easier (and equivalent) to treat and as independent variables and vary the action with respect to both of them. For example, varying with respect to leads to the equation of motion
(1.63) |
The Lagrangian has a continuous symmetry which rotates and or, equivalently, rotates the phase of :
(1.64) |
where the latter equation holds with infinitesimal. The Lagrangian remains invariant under this change: . The associated conserved current is
(1.65) |
We will later see that the conserved charges arising from currents of this type have the interpretation of electric charge or particle number (for example, baryon or lepton number).
Consider a theory involving scalar fields , all with the same mass and the Lagrangian
(1.66) |
In this case the Lagrangian is invariant under the non-Abelian symmetry group . (Actually in this case). One can construct theories from complex fields in a similar manner that are invariant under an symmetry group. Non-Abelian symmetries of this type are often referred to as global symmetries to distinguish them from the “local gauge” symmetries that you will meet later. Isospin is an example of such a symmetry, albeit realized only approximately in Nature.
There is a quick method to determine the conserved current associated to an internal symmetry for which the Lagrangian is invariant. Here, is a constant real number. (We may generalize the discussion easily to a non-Abelian internal symmetry for which becomes a matrix). Now consider performing the transformation but where depends on spacetime: . The action is no longer invariant. However, the change must be of the form
(1.67) |
since we know that when is constant. The change in the action is therefore
(1.68) |
which means that when the equations of motion are satisfied (so for all variations, including ) we have
(1.69) |
We see that we can identify the function as the conserved current. This way of viewing things emphasizes that it is the derivative terms, not the potential terms, in the action that contribute to the current. (The potential terms are invariant even when ).
The link between the Lagrangian formalism and the quantum theory goes via the path integral. In this course we will not discuss path integral methods, and focus instead on canonical quantization. For this we need the Hamiltonian formalism of field theory. We start by defining the momentum conjugate to ,
(1.70) |
The conjugate momentum is a function of , just like the field itself. It is not to be confused with the total momentum defined in (1.45) which is a single number characterizing the whole field configuration. The Hamiltonian density is given by
(1.71) |
where, as in classical mechanics, we eliminate in favour of everywhere in . The Hamiltonian is then simply
(1.72) |
For the Lagrangian
(1.73) |
the momentum is given by , which gives us the Hamiltonian,
(1.74) |
Notice that the Hamiltonian agrees with the definition of the total energy (1.47) that we get from applying Noether’s theorem for time translation invariance.
In the Lagrangian formalism, Lorentz invariance is clear for all to see since the action is invariant under Lorentz transformations. In contrast, the Hamiltonian formalism is not manifestly Lorentz invariant: we have picked a preferred time. For example, the equations of motion for arise from Hamilton’s equations,
(1.75) |
which, unlike the Euler-Lagrange equations (1.8), do not look Lorentz invariant. Nevertheless, even though the Hamiltonian framework doesn’t look Lorentz invariant, the physics must remain unchanged. If we start from a relativistic theory, all final answers must be Lorentz invariant even if it’s not manifest at intermediate steps. We will pause at several points along the quantum route to check that this is indeed the case.