We would now like to quantize the Dirac Lagrangian,
(5.439) |
We will proceed naively and treat as we did the scalar field. But we’ll see that things go wrong and we will have to reconsider how to quantize this theory.
We start in the usual way and define the momentum,
(5.440) |
For the Dirac Lagrangian, the momentum conjugate to is . It does not involve the time derivative of . This is as it should be for an equation of motion that is first order in time, rather than second order. This is because we need only specify and on an initial time slice to determine the full evolution.
To quantize the theory, we promote the field and its momentum to operators, satisfying the canonical commutation relations, which read
(5.441) |
It’s this step that we’ll soon have to reconsider.
Since we’re dealing with a free theory, where any classical solution is a sum of plane waves, we may write the quantum operators as
(5.442) |
where the operators create particles
associated to the spinors , while
create particles associated to . As with the scalars,
the commutation relations of the fields imply commutation
relations for the annihilation and creation operators
Claim: The field commutation relations (5.441) are
equivalent to
(5.443) |
with all other commutators vanishing.
Note the strange minus sign in the term. This means that we can’t define the ground state as something annihilated by , because then the excited states would have negative norm. To avoid this, we will have to flip the interpretation of and , with the vacuum defined by and the excited states by . This, as we will see, will be our undoing.
Proof: Let’s show that the and
commutators reproduce the field commutators
(5.441),
(5.444) | |||||
At this stage we use the outer product formulae (4.435) and (4.436) which tell us and , so that
where, in the second term, we’ve changed under the integration sign. Now, using we have
(5.445) |
as promised. Notice that it’s a little tricky in the middle there, making sure that the terms cancel. This was the reason we needed the minus sign in the commutator terms in (5.443).
To proceed, let’s construct the Hamiltonian for the theory. Using the momentum , we have
(5.446) |
which means that agrees with the conserved energy computed using Noether’s theorem (4.399). We now wish to turn the Hamiltonian into an operator. Let’s firstly look at
where, for once we’ve left the sum over implicit. There’s a small subtlety with the minus signs in deriving this equation that arises from the use of the Minkowski metric in contracting indices, so that . Now we use the defining equations for the spinors and given in (4.412) and (4.418), to replace
(5.447) |
so we can write
(5.448) |
We now use this to write the operator Hamiltonian
where, in the last two terms we have relabelled . We now use our inner product formulae (4.429), (4.431) and (4.434) which read
giving us
(5.449) | |||||
(5.450) |
The term is familiar and easily dealt with by normal ordering. The term is familiar and we can check that create positive energy states as expected,
The minus sign in front of the term should make us nervous. If we think of as creation operators then there’s no problem since, using the commutation relation (5.443), we still find that creates positive energy states,
However, as we noted after (5.443), these states have negative norm. To have a sensible Hilbert space, we need to interpret as the creation operator. But then the Hamiltonian is not bounded below because
This is a disaster. Taken seriously it would tell us that we could tumble to states of lower and lower energy by continually producing particles. As the English would say, it’s all gone a bit Pete Tong. (No relation).
Since the above calculation was a little tricky, you might think that it’s possible to rescue the theory to get the minus signs to work out right. You can play around with different things, but you’ll always find this minus sign cropping up somewhere. And, in fact, it’s telling us something important that we missed.
The key piece of physics that we missed is that spin particles are fermions, meaning that they obey Fermi-Dirac statistics with the quantum state picking up a minus sign upon the interchange of any two particles. This fact is embedded into the structure of relativistic quantum field theory: the spin-statistics theorem says that integer spin fields must be quantized as bosons, while half-integer spin fields must be quantized as fermions. Any attempt to do otherwise will lead to an inconsistency, such as the unbounded Hamiltonian we saw in (5.450).
So how do we go about quantizing a field as a fermion? Recall that when we quantized the scalar field, the resulting particles obeyed bosonic statistics because the creation and annihilation operators satisfied the commutation relations,
(5.451) |
To have states obeying fermionic statistics, we need anti-commutation relations, . Rather than (5.441), we will ask that the spinor fields satisfy
(5.452) |
We still have the expansion (5.442) of and in terms of and . But now the same proof that led us to (5.443) tells us that
(5.453) |
with all other anti-commutators vanishing,
(5.454) |
The calculation of the Hamiltonian proceeds as before, all the way through to the penultimate line (5.449). At that stage, we get
(5.455) | |||||
The anti-commutators have saved us from the indignity of an unbounded Hamiltonian. Note that when normal ordering the Hamiltonian we now throw away a negative contribution . In principle, this could partially cancel the positive contribution from bosonic fields. Cosmological constant problem anyone?!
Just as in the bosonic case, we define the vacuum to satisfy,
(5.456) |
Although and obey anti-commutation relations, the Hamiltonian (5.455) has nice commutation relations with them. You can check that
(5.457) |
This means that we can again construct a tower of energy eigenstates by acting on the vacuum by and to create particles and antiparticles, just as in the bosonic case. For example, we have the one-particle states
(5.458) |
The two particle states now satisfy
(5.459) |
confirming that the particles do indeed obey Fermi-Dirac statistics. In particular, we have the Pauli-Exclusion principle . Finally, if we wanted to be sure about the spin of the particle, we could act with the angular momentum operator (4.403) to confirm that a stationary particle does indeed carry intrinsic angular momentum 1/2 as expected.
“In this attempt, the success seems to have been on the side of Dirac rather than logic”
Pauli on Dirac
Let’s pause our discussion to make a small historical detour. Dirac originally viewed his equation as a relativistic version of the Schrödinger equation, with interpreted as the wavefunction for a single particle with spin. To reinforce this interpretation, he wrote as
(5.460) |
where and . Here the operator is interpreted as the one-particle Hamiltonian. This is a very different viewpoint from the one we now have, where is a classical field that should be quantized. In Dirac’s view, the Hamiltonian of the system is defined above, while for us the Hamiltonian is the field operator (5.455). Let’s see where Dirac’s viewpoint leads.
With the interpretation of as a single-particle wavefunction, the plane-wave solutions (4.411) and (4.417) to the Dirac equation are thought of as energy eigenstates, with
(5.461) |
which look like positive and negative energy solutions. The spectrum is once again unbounded below; there are states with arbitrarily low energy . At first glance this is disastrous, just like the unbounded field theory Hamiltonian (5.450). Dirac postulated an ingenious solution to this problem: since the electrons are fermions (a fact which is put in by hand to Dirac’s theory) they obey the Pauli-exclusion principle. So we could simply stipulate that in the true vacuum of the universe, all the negative energy states are filled. Only the positive energy states are accessible. These filled negative energy states are referred to as the Dirac sea. Although you might worry about the infinite negative charge of the vacuum, Dirac argued that only charge differences would be observable (a trick reminiscent of the normal ordering prescription we used for field operators).
Having avoided disaster by floating on an infinite sea comprised of occupied negative energy states, Dirac realized that his theory made a shocking prediction. Suppose that a negative energy state is excited to a positive energy state, leaving behind a hole. The hole would have all the properties of the electron, except it would carry positive charge. After flirting with the idea that it may be the proton, Dirac finally concluded that the hole is a new particle: the positron. Moreover, when a positron comes across an electron, the two can annihilate. Dirac had predicted anti-matter, one of the greatest achievements of theoretical physics. It took only a couple of years before the positron was discovered experimentally in 1932.
Although Dirac’s physical insight led him to the right answer, we now understand that the interpretation of the Dirac spinor as a single-particle wavefunction is not really correct. For example, Dirac’s argument for anti-matter relies crucially on the particles being fermions while, as we have seen already in this course, anti-particles exist for both fermions and bosons. What we really learn from Dirac’s analysis is that there is no consistent way to interpret the Dirac equation as describing a single particle. It is instead to be thought of as a classical field which has only positive energy solutions because the Hamiltonian (4.399) is positive definite. Quantization of this field then gives rise to both particle and anti-particle excitations.
This from Julian Schwinger:
“Until now, everyone thought that the Dirac equation referred directly to physical particles. Now, in field theory, we recognize that the equations refer to a sublevel. Experimentally we are concerned with particles, yet the old equations describe fields…. When you begin with field equations, you operate on a level where the particles are not there from the start. It is when you solve the field equations that you see the emergence of particles.”
Let’s now move to the Heisenberg picture. We define the spinors at every point in spacetime such that they satisfy the operator equation
(5.462) |
We solve this by the expansion
(5.463) |
Let’s now look at the anti-commutators of these fields. We define the fermionic propagator to be
(5.464) |
In what follows we will often drop the indices and simply write , but you should remember that is a matrix. Inserting the expansion (5.463), we have
(5.465) | |||||
where to reach the final line we have used the outer product formulae (4.435) and (4.436). We can then write
(5.466) |
in terms of the propagator for a real scalar field which, recall, can be written as (2.165)
(5.467) |
Some comments:
For spacelike separated points , we have already seen that . In the bosonic theory, we made a big deal of this since it ensured that
(5.468) |
outside the lightcone, which we trumpeted as proof that our theory was causal. However, for fermions we now have
(5.469) |
outside the lightcone. What happened to our precious causality? The best that we can say is that all our observables are bilinear in fermions, for example the Hamiltonian (5.455). These still commute outside the lightcone. The theory remains causal as long as fermionic operators are not observable. If you think this is a little weak, remember that no one has ever seen a physical measuring apparatus come back to minus itself when you rotate by degrees!
At least away from singularities, the propagator satisfies
(5.470) |
which follows from the fact that using the mass shell condition .
By a similar calculation to that above, we can determine the vacuum expectation value,
(5.471) |
We now define the Feynman propagator , which is again a matrix, as the time ordered product,
(5.472) |
Notice the minus sign! It is necessary for Lorentz invariance. When , there is no invariant way to determine whether or . In this case the minus sign is necessary to make the two definitions agree since outside the lightcone. We have the 4-momentum integral representation for the Feynman propagator,
(5.473) |
which satisfies , so that is a Green’s function for the Dirac operator.
The minus sign that we see in (5.472) also occurs for any string of operators inside a time ordered product . While bosonic operators commute inside , fermionic operators anti-commute. We have this same behaviour for normal ordered products as well, with fermionic operators obeying . With the understanding that all fermionic operators anti-commute inside and , Wick’s theorem proceeds just as in the bosonic case. We define the contraction
(5.474) |
The interaction between a Dirac fermion of mass and a real scalar field of mass is governed by the Yukawa theory,
(5.475) |
which is the proper version of the baby scalar Yukawa theory we looked at in Section 3. Couplings of this type appear in the standard model, between fermions and the Higgs boson. In that context, the fermions can be leptons (such as the electron) or quarks.
Yukawa originally proposed an interaction of this type as an effective theory of nuclear forces. With an eye to this, we will again refer to the particles as mesons, and the particles as nucleons. Except, this time, the nucleons have spin. (This is still not a particularly realistic theory of nucleon interactions, not least because we’re omitting isospin. Moreover, in Nature the relevant mesons are pions which are pseudoscalars, so a coupling of the form would be more appropriate. We’ll turn to this briefly in Section 5.7.3).
Note the dimensions of the various fields. We still have , but the kinetic terms require that . Thus, unlike in the case with only scalars, the coupling is dimensionless: .
We’ll proceed as we did in Section 3, firstly computing the amplitude of a particular scattering process then, with that calculation as a guide, writing down the Feynman rules for the theory. We start with:
Let’s study scattering. This is the same calculation we performed in Section (3.3.3) except now the fermions have spin. Our initial and final states are
(5.476) |
We need to be a little cautious about minus signs, because the ’s now anti-commute. In particular, we should be careful when we take the adjoint. We have
(5.477) |
We want to calculate the order terms from the S-matrix element .
(5.478) |
where, as usual, all fields are in the interaction picture. Just as in the bosonic calculation, the contribution to nucleon scattering comes from the contraction
(5.479) |
We just have to be careful about how the spinor indices are contracted. Let’s start by looking at how the fermionic operators act on . We expand out the fields, leaving the fields alone for now. We may ignore the pieces in since they give no contribution at order . We have
(5.480) | |||||
where we’ve used square brackets to show how the spinor indices are contracted. The minus sign that sits out front came from moving past . Now anti-commuting the ’s past the ’s, we get
(5.481) | |||||
Note, in particular, the relative minus sign that appears between these two terms. Now let’s see what happens when we hit this with . We look at
The term in (5.481) doubles up with this, cancelling the factor of in front of (5.478). Meanwhile, the terms cancel the relativistic state normalization. Putting everything together, we have the following expression for
where we’ve put the propagator back in. Performing the integrals over and , this becomes,
And we’re almost there! Finally, writing the S-matrix element in terms of the amplitude in the usual way, , we have
which is our final answer for the amplitude.
It’s important to bear in mind that the calculation we just did kind of blows. Thankfully the Feynman rules will once again encapsulate the combinatoric complexities and make life easier for us. The rules to compute amplitudes are the following
To each incoming fermion with momentum and spin , we associate a spinor . For outgoing fermions we associate .
![]() |
![]() |
To each incoming anti-fermion with momentum and spin , we associate a spinor . For outgoing anti-fermions we associate .
![]() |
![]() |
Each vertex gets a factor of .
Each internal line gets a factor of the relevant propagator.
(5.482) |
The arrows on the fermion lines must flow consistently through the diagram (this ensures fermion number conservation). Note that the fermionic propagator is a matrix. The matrix indices are contracted at each vertex, either with further propagators, or with external spinors , , or .
Impose momentum conservation at each vertex, and integrate over undetermined loop momenta.
Add extra minus signs for statistics. Some examples will be given below.
Let’s run through the same examples we did for the scalar Yukawa theory. Firstly, we have
For the example we worked out previously, the two lowest order Feynman diagrams are shown in Figure 25.
We’ve drawn the second Feynman diagram with the legs crossed to emphasize the fact that it picks up a minus sign due to statistics. (Note that the way the legs point in the Feynman diagram doesn’t tell us the direction in which the particles leave the scattering event: the momentum label does that. The two diagrams above are different because the incoming legs are attached to different outgoing legs). Using the Feynman rules we can read off the amplitude.
(5.483) |
The denominators in each term are due to the meson propagator, with the momentum determined by conservation at each vertex. This agrees with the amplitude we computed earlier using Wick’s theorem.
Let’s now look at . The two lowest order Feynman diagrams are shown in Figure 26. Applying the Feynman rules, we have
Since the internal line is now a fermion, the propagator contains factors. This is a matrix which sits on the top, sandwiched between the two external spinors. Now the exchange statistics applies to the final meson states. These are bosons and, correspondingly, there is no relative minus sign between the two diagrams.
For , the two lowest order Feynman diagrams are of two distinct types, just like in the bosonic case. They are shown in Figure 27.
The corresponding amplitude is given by,
(5.484) |
As in the bosonic diagrams, there is again the difference in the momentum dependence in the denominator. But now the difference in the diagrams is also reflected in the spinor contractions in the numerator.
More subtle are the minus signs. The fermionic statistics mean that the first diagram has an extra minus sign relative to the scattering of Figure 25. Since this minus sign will be important when we come to figure out whether the Yukawa force is attractive or repulsive, let’s go back to basics and see where it comes from. The initial and final states for this scattering process are
(5.485) |
The ordering of and in these states is crucial and reflects the scattering , as opposed to which would differ by a minus sign. The first diagram in Figure 27 comes from the term in the perturbative expansion,
where we’ve neglected a bunch of objects in this equation like and exponential factors because we only want to keep track of the minus signs. Moving the annihilation operators past the creation operators, we have
(5.486) |
Repeating the process by expanding out the and fields and moving them to the left to annihilate , we have
where the minus sign has appeared from anti-commuting past . This is the overall minus sign found in (5.484). One can also follow similar contractions to compute the second diagram in Figure 27.
Finally, we can also compute the scattering of which, as in the bosonic case, picks up its leading contribution at one-loop. The amplitude for the diagram shown in the figure is
Notice that the high momentum limit of the integral is , which is no longer finite, but diverges logarithmically. You will have to wait until next term to make sense of this integral.
There’s an overall minus sign sitting in front of this amplitude. This is a generic feature of diagrams with fermions running in loops: each fermionic loop in a diagram gives rise to an extra minus sign. We can see this rather simply in the diagram
which involves the expression
After passing the fermion fields through each other, a minus sign appears, sitting in front of the two propagators.
We saw in Section 3.5.2, that the exchange of a real scalar particle gives rise to a universally attractive Yukawa potential between two spin zero particles. Does the same hold for the spin particles?
Recall that the strategy to compute the potential is to take the non-relativistic limit of the scattering amplitude, and compare with the analogous result from quantum mechanics. Our new amplitude now also includes the spinor degrees of freedom and . In the non-relativistic limit, , and
(5.487) |
In this limit, the spinor contractions in the amplitude for scattering (5.483) become and the amplitude is
(5.488) |
The symbols tell us that spin is conserved in the non-relativistic limit, while the momentum dependence is the same as in the bosonic case, telling us that once again the particles feel an attractive Yukawa potential,
(5.489) |
Repeating the calculation for , there are two minus signs which cancel each other. The first is the extra overall minus sign in the scattering amplitude (5.484), due to the fermionic nature of the particles. The second minus sign comes from the non-relativistic limit of the spinor contraction for anti-particles in (5.484), which is . These two signs cancel, giving us once again an attractive Yukawa potential (5.489).
Rather than the standard Yukawa coupling, we could instead consider
(5.490) |
This still preserves parity if is a pseudoscalar, i.e.
(5.491) |
We can compute in this theory very simply: the Feynman rule for the interaction vertex is now changed to a factor of . For example, the Feynman diagrams for scattering are again given by Figure 25, with the amplitude now
We could again try to take the non-relativistic limit for this amplitude. But this time, things work a little differently. Using the expressions for the spinors (5.487), we have in the non-relativistic limit. To find the non-relativistic amplitude, we must go to next to leading order. One can easily check that . So, in the non-relativistic limit, the leading order amplitude arising from pseudoscalar exchange is given by a spin-spin coupling,
(5.492) |