The equation also implied the existence of a new form of matter,antimatter, previously unsuspected and unobserved. The existence of antimatter was experimentally confirmed several years later. It also provided atheoretical justification for the introduction of several component wave functions inPauli'sphenomenological theory ofspin. The wave functions in the Dirac theory are vectors of fourcomplex numbers (known asbispinors), two of which resemblethe Pauli wavefunction in the non-relativistic limit, in contrast to theSchrödinger equation, which described wave functions of only one complex value. Moreover, in the limit of zero mass, the Dirac equation reduces to theWeyl equation. In the context ofquantum field theory, the Dirac equation is reinterpreted to describe quantum fields corresponding to spin-1/2 particles.
Dirac did not fully appreciate the importance of his results; however, the entailed explanation of spin as a consequence of the union of quantum mechanics and relativity—and the eventual discovery of thepositron—represents one of the great triumphs oftheoretical physics. This accomplishment has been described as fully on par with the works ofIsaac Newton,James Clerk Maxwell, andAlbert Einstein before him.[4] The equation has been deemed by some physicists to be "the real seed of modern physics".[5] The Dirac equation has been described as the "centerpiece of relativistic quantum mechanics", with it also stated that "the equation is perhaps the most important one in all of quantum mechanics".[6]
The first phase in the development ofquantum mechanics, lasting between 1900 and 1925, focused on explaining individual phenomena that could not be explained throughclassical mechanics.[7] The second phase, starting in the mid-1920s, saw the development of two systematic frameworks governing quantum mechanics. The first, known asmatrix mechanics, usesmatrices to describephysical observables; it was developed in 1925 byWerner Heisenberg,Max Born, andPascual Jordan.[8]: 51 The second, known as wave mechanics, uses awave equation known as theSchrödinger equation to describe thestate of asystem; it was developed the next year byErwin Schrödinger. While these two frameworks were initially seen as competing approaches, they would later be shown to be equivalent.[9]: 21
Both these frameworks only formulated quantum mechanics in anon-relativistic setting.[9]: 22 This was seen as a deficiency right from the start, with Schrödinger originally attempting to formulate arelativistic version of Schrödinger equation, in the process discovering theKlein–Gordon equation.[8]: 679 However, after showing that thisequation did not correctly reproduce the relativistic corrections to thehydrogen atomspectrum for which an exact form was known due toArnold Sommerfeld, he abandoned hisrelativistic formulation.[10]: 1025 The Klein–Gordon equation was also found by at least six other authors in the same year.[11]: 6
During 1926 and 1927 there was a widespread effort to incorporate relativity into quantum mechanics, largely through two approaches. The first was to consider the Klein–Gordon as the correct relativistic generalization of the Schrödinger equation.[11]: 7 Such an approach was viewed unfavourably by many leadingtheorists since it failed to correctly predict numerousexperimental results, and more importantly it appeared difficult to reconcile with the principles of quantum mechanics as understood at the time.[12]: 35 These conceptual issues primarily arose due to the presence of asecond temporal derivative.[10]: 1031
The second approach introduced relativistic effects as corrections to the known non-relativistic formulas.[12]: 36 This provided many provisional answers that were expected to eventually be supplanted by some yet-unknown relativistic formulation of quantum mechanics. One notable result by Heisenberg and Jordan was the introduction of two terms forspin and relativity into the hydrogenHamiltonian, allowing them to derive thefirst-order approximation of the Sommerfeldfine structure formula.[13]
A parallel development during this time was the concept of spin, first introduced in 1925 bySamuel Goudsmit andGeorge Uhlenbeck.[14] Shortly after, it was conjectured by Schrödinger to be the missing link in acquiring the correct Sommerfeld formula.[12]: 44 In 1927Wolfgang Pauli used the ideas of spin to find aneffective theory for a nonrelativisticspin-1/2particle, thePauli equation.[15] He did this by taking the Schrödinger equation and rather than just assuming that the wave function depends on thephysical coordinate, he also assume that it depends on a spin coordinate that can take only two values. While this was still a non-relativistic formulation, he believed that a fully relativistic formulation possibly required a more complicated model for theelectron, one that moved beyond apoint particle.[12]: 47
By 1927, many physicists no longer considered the fine structure of hydrogen as a crucial puzzle that called for a completely new relativistic formulation since it could effectively be solved using the Pauli equation or by introducing a spin-1/2angular momentumquantum number in the Klein–Gordon equation.[12]: 51 At the fifthSolvay Conference held that year,Paul Dirac was primarily concerned with the logical development of quantum mechanics. However, he realized that many other physicists complacently accepted the Klein–Gordon equation as a satisfactory relativistic formulation, which demanded abandoning basic principles of quantum mechanics as understood at the time, to which Dirac strongly objected.[12]: 52 After his return fromBrussels, Dirac focused on finding a relativistic theory for electrons. Within two months he solved the problem and published his results on January 2, 1928.[1]
In hispaper, Dirac was guided by two principles fromtransformation theory, the first being that the equation should beinvariant under transformations of special relativity, and the second that it should transform under the transformation theory of quantum mechanics.[11]: 9 The latter demanded that the equation would have to belinear intemporal derivatives, so that it would admit aprobabilistic interpretation. His argument begins with the Klein–Gordon equation[16]: 32
describing a particle using the wave function. Here is the square of themomentum, is therest mass of the particle, is thespeed of light, and is thereduced Planck constant. The naive way to get an equation linear in the time derivative is to essentially consider thesquare root of both sides. This replaces with. However, such asquare root is mathematically problematic for the resulting theory, making it unfeasible.[9]: 379
Dirac's first insight was the concept of linearization. He looked for some sort ofvariables that are independent of momentum and spacetime coordinates for which the square root could be rewritten in a linear form[9]: 379
By squaring thisoperator and demanding that it reduces to the Klein–Gordon equation, Dirac found that the variables must satisfy and if. Dirac initially considered thePauli matrices as a candidate, but then showed these would not work since it is impossible to find a set of four matrices that allanticommute with each other.[8]: 687 His second insight was to instead considerfour-dimensional matrices. In that case the equation would be acting on afour-component wavefunction.[17]: 235 Such a proposal was much more bold than Pauli's original generalization to a two-component wavefunction in the Pauli equation.[12]: 57 This is because in Pauli's case, this was motivated by the demand to encode the two spin states of the particle. In contrast, Dirac had no physical argument for a four-component wavefunction, but instead introduced it as a matter of mathematical necessity. He thus arrived at the Dirac equation[1]
Dirac constructed the correct matrices without realizing that they form a mathematical structure known of since the early 1880s, theClifford algebra.[18]: 320 By recasting the equation in aLorentz invariant form, he also showed that it correctly combines special relativity with his principle of quantum mechanical transformation theory, making it a viable candidate for a relativistic theory of the electron.[8]: 685
To investigate the equation further, he examined how it behaves in the presence of anelectromagnetic field.[12]: 58 To his surprise, this showed that it described a particle with amagnetic moment arising due to the particle having spin1/2. Spin directly emerged from the equation, without Dirac having added it in by hand. Additionally, he focused on showing that the equation successfully reproduces the fine structure of the hydrogen atom, at least to first order. The equation therefore succeeds where all previous attempts have failed, in correctly describing relativistic phenomena of electrons fromfirst principles rather than through the ad hoc modification of existing formulas.[12]: 60
Except for his followup paper[19] deriving theZeeman effect and Paschen–Back effect from the equation in the presences of amagnetic fields, Dirac left the work of examining the consequences of his equation to others, and only came back to the subject in 1930.[12]: 65 Once the equation was published, it was recognized as the correct solution to the problem of spin, relativity, and quantum mechanics. At first the Dirac equation was considered the only valid relativistic equation for a particle withmass. Then in 1934 Pauli andVictor Weisskopf reinterpreted the Klein–Gordon equation as the equation for a relativisticspinless particle.[8]: 685
One of the first calculations was to reproduce the Sommerfeld fine structure formula exactly, which was performed independently byCharles Galton Darwin andWalter Gordon in 1928.[20][21][9]: 377 This is the first time that the full formula has been derived from first principles. Further work on the mathematics of the equation was undertaken byHermann Weyl in 1929.[22] In this work he showed that themassless Dirac equation can be decomposed into a pair ofWeyl equations.
A problem that gained more focus with time was the presence ofnegative energystates in the Dirac equation, which led to many efforts to try to eliminate such states. Dirac initially simply rejected the negative energy states as unphysical,[12]: 63 but the problem was made more clear when in 1929Oskar Klein showed that in static fields there exists inevitable mixing between the negative and positive energy states.[9]: 378 Dirac's initial response was to believe that his equation must have some sort of defect, and that it was only the first approximation of a future theory that would not have this problem.[9]: 378 However, he then suggested a solution to the problem in the form of theDirac sea. This is the idea that the universe is filled with an infinite sea of negative energy electrons states. Positive energy electron states then live in this sea and are prevented from decaying to the negative energy states through thePauli exclusion principle.[8]: 715
Additionally, Dirac postulated the existence of positively chargedholes in the Dirac sea, which he initially suggested could be theproton. However, Oppenheimer showed that in this case stableatoms could not exist[27] and Weyl further showed that the holes would have to have the same mass as the electrons.[28] Persuaded by Oppenheimer's and Weyl's argument, Dirac published a paper in 1931 that predicted the existence of an as-yet-unobserved particle that he called an "anti-electron" that would have the same mass and the opposite charge as an electron and that would mutually annihilate upon contact with an electron. He suggested that every particle may have an oppositely charged partner, a concept now calledantimatter[29][30]: 47
Significant work was done over the following decades to try to findspectroscopic discrepancies compared to the predictions made by the Dirac equation, however it was not until 1947 thatLamb shift was discovered, which the equation does not predict.[8]: 710 This led to the development ofquantum electrodynamics in 1950s, with the Dirac equation then being incorporated within the context of quantum field theory.[36]: 35 Since it describes the dynamics of Dirac spinors, it went on to play a fundamental role in theStandard Model as well as many other areas of physics. For example, within condensed matter physics, systems whose fermions have a near lineardispersion relation are described by the Dirac equation. Such systems are known asDirac matter and they includegraphene andtopological insulators, which have become a major area of research since the start of the 21st century.[35]: 522
The Dirac equation is inscribed upon a plaque on the floor ofWestminster Abbey. Unveiled on 13 November 1995, the plaque commemorates Dirac's life.[37] The equation, in itsnatural units formulation, is also prominently displayed in the auditorium at the ‘Paul A.M. Dirac’ Lecture Hall at the Patrick M.S. Blackett Institute (formerly The San Domenico Monastery) of theEttore Majorana Foundation and Centre for Scientific Culture inErice,Sicily.
where is theanticommutator, is the Minkowskimetric in a mainlynegative signature,[nb 2] and is theidentity matrix. The Dirac algebra is a special case of the more general mathematical structure known as aClifford algebra.[16]: 169 The Dirac algebra can also be seen as the real part of thespacetime algebra. There is no unique choice of matrices for the gamma matrices, with different choices known as differentrepresentations of the algebra.[32]: 323 One common choice, originally discovered by Dirac, is known as the Dirac representation. Here the matrices are given by[35]: 531
where are the threePauli matrices for. There are two other common representations for the gamma matrices. The first is thechiral representation, which is useful when decomposing the Dirac equation into a pair ofWeyl equations.[36]: 41 The second is the Majorana representation, for which all gamma matrices areimaginary, so theDirac operator ispurely real.[16]: 344 This representation is useful for studyingMajorana spinors, which are purely real four-componentspinor solutions of the Dirac equation.[35]: 536
The adjoint spinor is useful in forming Lorentz invariant quantities. For example, thebilinear is not Lorentz invariant, but is.[35]: 532 Here is shorthand notation for apartial derivative acting on the left. In regular notation, the adjoint Dirac equation is equivalent to[36]: 43
The Dirac equation can be rewritten in a non-covariant form similar to that of theSchrödinger equation[8]: 686
The right-hand side of the equation is theHamiltonian acting on the Dirac spinor. Here and are a set of fourHermitian matrices that all anticommute with each other and square to the identity. They are related to the gamma matrices through and.[9]: 375 This form is useful inquantum mechanics, where the Hamiltonian can be easily modified to solve a wide range of problems, such as by introducing apotential[8]: 702 or through aminimal coupling to theelectromagnetic field.[8]: 688
The Dirac equation can also be acquired from aLagrangian formulation of the field theory, where the Diracaction is given by[35]: 535
The equation then arises as theEuler–Lagrange equation of this action, found by varying the spinor.[36]: 43 Meanwhile, the adjoint Dirac equation is acquired by varying the adjoint spinor. The action formulation of the Dirac equation has the advantage of making thesymmetries of the Dirac equation more explicit, since they leave its action invariant.[16]: 166–168 Noether's theorem then allows for the direct calculation ofcurrents corresponding to these symmetries.[40]: 166 Additionally, the action is usually used to define the associatedquantum field theory, such as through thepath integral formulation.[16]: 251
In quantum mechanics, the Dirac spinor corresponds to a four-component spinorwave function describing thestate of aDirac fermion.[8]: 685 Its positionprobability density, the probability of finding the fermion in a region of space, is described by the zeroth component of its vector current,.[8]: 689 In the case of a large number ofparticles, it can also be interpreted as thecharge density.[18]: 137 An appropriatenormalization is required to ensure that the total probability across all of space is equal to one, withprobability conservation following directly from theconservation of the vector current. The Dirac equation is therelativistic analogue of the Schrödinger equation for the Dirac fermion wavefunction.
In thesecond quantization form of quantum field theory the Dirac spinor isquantized to be a operator-valued spinor field.[41] In contrast to quantum mechanics, it no longer represents the state in theHilbert space, but is rather theoperator that acts on states to create or destroy particles.[16]: 22 [42]: 92 Observables are formed usingexpectation values of these operators.[43]: 41 The Dirac equation then becomes an operator equation describing the state-independent evolution of the operator-valued spinor field[44]: 35
In thepath integral formulation of quantum field theory, the spinor field is an anti-commutingGrassmann-valued field that only acts as an integration variable.[38]: 29 The Dirac equation then emerges as the classicalsaddle point behaviour of the path integral. It also arises as an equation of the expectation value of the classical field variables[44]: 35
in the sense of theSchwinger–Dyson equations.[16]: 80 This version of the equation can also be acquired by taking the expectation value of the operator equation.
The Dirac equation also arises in describing thetime evolution of a spinor field inclassical field theory. Such a field theory would have thespecial linear group as itsspacetime symmetry group rather than the Lorentz group, since the latter does not admitspinor representations.[40]: 117 This is in contrast to the quantum theory which does admit spinor representations even when the spacetime symmetries are described by the Lorentz group. This is because the states in a Hilbert space are defined only up to a complex phase,[45]: 81 so particles belong toprojective representations rather than regular representations, with the projective representations of the Lorentz group being equivalent to regular representations of.[16]: 176 Classical spinor fields do not arise in our universe because thePauli exclusion principle prevents populating the field with a sufficient number of particles to reach theclassical limit.[40]: 347
Lie group elements can be generated using the correspondingLie algebra, which together with aLie bracket, describes thetangent space of the groupmanifold around its identity element.[50]: 64 Thebasis elements of this vector space are known asgenerators of the group. A particular group element is then acquired byexponentiating a corresponding tangent space vector.[51]: 295 The generators of the Lorentz Lie algebra must satisfy certainanticommutation relations, known as a Lie bracket. The six vectors can be packaged into an antisymmetric object indexed by, with the bracket for the Lorentz algebra given by[36]: 39
This algebra admits numerous representation, where each generator is represented by a matrix, with eachalgebra representation generating a correspondingrepresentation of the group.[51]: 662 For example, the representation acting on real vectors is given by the six matrices where[51]: 664
The Lorentz transformation matrix can then be acquired from these generators through an exponentiation[16]: 169
where is an antisymmetric matrix encoding the sixdegrees of freedom of the Lorentz group used to specify the particular group element. These correspond to the three boosts and threerotations.[36]: 40
Another representation for the Lorentz algebra is thespinor representation where the generators are given by[16]: 169
In this case the Lorentz group element, specified by, is given by
The mapping of is not one-to-one since there are two consistent choices for that give the same but a different.[50]: 45 This is a consequence of the spinor representation beingprojective representations of the Lorentz group.[45]: 81 Equivalently, they are regular representation of, which is adouble cover of the Lorentz group.[40]: 106
Under a Lorentz transformation, spacetime coordinates transform under the vector representation, while the spinors transform under the spinor representation
The Dirac equation is a Lorentz covariant equation, meaning that it takes the form in all inertial reference frames.[48]: 5 That is, it takes the same form when for a spinor withcoordinates, as well as for a Lorentz transformed spinor in the Lorentz transformed coordinates
where is thefour-gradient for the new coordinates.[nb 5] Meanwhile, the Dirac action is Lorentz invariant, meaning that it is the same in all reference frames.[16]: 166–168
This symmetry is known as the vector symmetry because its current transforms as avector under Lorentz transformations. Promoting this symmetry to agauge symmetry gives rise toquantum electrodynamics.
In themassless limit, the Dirac equation has a second inequivalent symmetry known as theaxial symmetry, which acts on the spinors as[52]: 99
where is the chiral matrix. This arises because in the massless limit the Dirac equation reduces to a pair ofWeyl equations.[16]: 168 Each of these is invariant under a phase symmetry. These two symmetries can then be grouped into the vector symmetry where both Weyl spinors transform by the same phase, and the axial symmetry where they transform under the opposite signs of the phase.[16]: 621 The current corresponding to the axial symmetry is given by[35]: 532
This transforms as apseudovector, meaning that its spatial part is odd under parity transformations.[49]: 469 Classically, the axial symmetry admits a well-formulated gauge theory, but at thequantum level it has achiral anomaly that provides an obstruction towards gauging.[16]: 616
where the last term is the Dirac Lagrangian, which vanisheson shell. Invariance under Lorentz transformations meanwhile yields a set of currents indexed by and, given by[52]: 99
where are the spinor representation generators of the Lorentz Lie algebra, used to define how spinors transform under Lorentz transformations.
Acting on the Dirac equation with the operator gives rise to theKlein–Gordon equation for each component of the spinor[53]: 93
As a result, any solution to the Dirac equation is also automatically a solution to the Klein–Gordon equation.[40]: 349 Its solutions can therefore be written as alinear combination ofplane waves.[54]: 59
The Dirac equation admits positivefrequency plane wave solutions[54]: 59
with a positive energy given by. It also admitsnegative frequency solutions taking the same form except with.[16]: 188 It is more convenient to rewritten these negative frequency solutions by flipping the sign of the momentum to ensure that they have a positiveenergy and so take the form[36]: 48
At the classical level these are positive and negative frequency solutions to a classical wave equation, but in the quantum theory they correspond to operators creating particles with spinorpolarization orannihilatingantiparticles with spinor polarization.[16]: 188 Both these spinor polarizations satisfy themomentum space Dirac equation[54]: 59
Since these are simple matrix equations, they can be solved directly once an explicit representation for the gamma matrices is chosen. In the chiral representation the general solution is given by[16]: 190
where and are arbitrary complex 2-vectors, describing the two spin degrees of freedom for the particle and two for the antiparticle.[36]: 45 In the massless limit, these spin states correspond to the possiblehelicity states that the massless fermions can have, either beingleft-handed or right-handed.[16]: 190
While the standard Dirac equation was originally derived in a dimensionalspacetime, it can be directly generalized to arbitrarydimension andmetric signatures, where it takes the same covariant form.[55]: 51 The crucial difference is that thegamma matrices must be changed to gamma matrices of theClifford algebra appropriate in those dimensions and metric signature, with the size of theDirac spinor corresponding to the dimensionality of the gamma matrices. While the Dirac equation always exists, since every dimension admits Dirac spinors, the properties of these spinors and their relation to otherspinor representations differs significantly across dimensions.[38]: 59 Other differences include the absence of achirality matrix in odd dimensions.[35]: 534
Ageometric reformulation of the Dirac equation is known as the Dirac–Hestenes equation.[59] In this formulation all the components of the Dirac equation have an explicit geometric interpretation. Another related geometric equation is theDirac–Kähler equation, which is a geometric analogue of the Dirac equation that can be defined on any generalpseudo-Riemannian manifold and which acts ondifferential forms.[60] In the case of aflat manifold, it reduces to four copies of the Dirac equation. However, on curved manifolds this decomposition breaks down and the equation fundamentally differs.[61] This equation is used inlattice field theory to describe the continuum limit ofstaggered fermions.
The Dirac spinor can be decomposed into a pair of Weyl spinors of opposite chirality.[40]: 340 UnderLorentz transformations, one transforms as a left-handed Weyl spinor and the other as a right-handed Weyl spinor. In the chiral representation of the gamma matrices, the Dirac equation reduces to the pair of equations for the Weyl spinors[54]: 57
In particular, in themassless limit the Weyl spinorsdecouple and the Dirac equation is equivalent to a pair ofWeyl equations.[16]: 168 .
This decomposition has been proposed as an intuitive explanation ofZitterbewegung, as these massless components would propagate at thespeed of light and move in opposite directions, since thehelicity is the projection of thespin onto the direction ofmotion.[62] Here the role of themass is not to make thevelocity less than the speed of light, but instead controls theaverage rate at which these reversals occur; specifically, the reversals can be modelled as aPoisson process.[63]
A closely related equation is theMajorana equation, with this formally taking the same form as the Dirac equation except that it acts on Majorana spinors.[16]: 193 These are spinors that satisfy a reality condition, where is thecharge-conjugation operator.[38]: 56 Inhigher dimensions, the Dirac equation has similar relations to the equations describing other spinor representations that arise in those dimensions.
The Pauli equation is often used inquantum mechanics to describe phenomena whererelativistic effects are negligible but the spin of the fermion is important.[8]: 711–715 It can also be recast in a form which directly shows that thegyromagnetic ratio of the fermion described by the Dirac equation is exactly.[54]: 74 Inquantum electrodynamics there are additional quantum corrections that modify this value, give rise to a non-zeroanomalous magnetic moment.[8]: 712
The vector andaxial symmetries of the Dirac action are both globalsymmetries, in that they act the same everywhere inspacetime.[16]: 34 Inclassical field theory, theLagrangian can always be modified in a way to elevate a global symmetry to a local symmetry, which can act differently at different spacetime locations.[40]: 166–167 In the case of the vector symmetry, which corresponding to a global change of thespinorfield by aphase,gauging would result in anaction invariant under a local symmetry where the phase can take different values at different points.[54]: 70 While symmetries can always be gauged in classical field theory, this may not always be possible in the fullquantum theory due to various obstructions such asanomalies, which signal that the full quantum theory is not invariant under the local symmetry despite its classical Lagrangian being invariant.[48]: 177 For example, the axial symmetry in themassless Dirac theory with onefermion is anomalous due to thechiral anomaly and cannot be gauged.[36]: 661
Elevating the vector symmetry to a local symmetry means that the original action is no longer invariant under the symmetry due to the appearance of a term arising from thekinetic term.[40]: 167 Instead, a new field, known as a gauge field, must be introduced. It must also transform under the local symmetry as[36]: 78
where plays the role of thecharge of theDirac spinor to the gauge field. The Dirac action can then be made invariant under the local symmetry by replacing the derivative term with a newgauge covariant derivative[16]: 173
This result can also be directly acquired through the Noether procedure, which is the general principle that a global symmetry can be gauged through the introduction of a term coupling the gauge field to the appropriate global symmetrycurrent.[40]: 161 Additionally introducing the kinetic term for the gauge field results in the action forquantum electrodynamics.[36]: 78
The symmetries that can be gauged can be greatly expanded by considering a theory withidentical Dirac spinors labelled by a newindex. Together these spinors can be considered as being part of a single object with components where labels the four spin components, and the different spinors.[16]: 491 The largest global symmetry of this action is then given by theunitary group.[36]: 490
Any continuoussubgroup of can then be gauged. In particular, if one wishes to gauge the symmetry acting on all components, then the symmetry being gauged must admit an-dimensionalunitary representation acting on the spinors.[49]: 420 That is, for every, there exists an dimensional matrixrepresentation such that
One symmetry that frequently gets gauged is thespecial unitary group symmetry. Spinors transforming in the fundamental representation transform as[49]: 416
where is aunitary matrix corresponding to a particular group element. The gauge field is a matrix valued gauge field which transforms in the adjoint representation as[16]: 492
The covariant derivative then takes the form
By also introducing the kinetic term for the gauge field, one constructs the action forquantum chromodynamics[36]: 489
The case of describes strong interactions of thequark sector of theStandard Model, with the gauge field corresponding togluons and the Dirac spinors to the quarks.[65]: 54 The case also plays a role in the Standard Model, describing theelectroweak sector. The gauge field in this case is theW-boson, while the Dirac spinors areleptons.[16]: 592
^This article uses theEinstein summation convention, where there is an implicit summation over spacetime coordinates for any pairs of repeated indices.
^In the mainly negative signature is adiagonal matrix with +1 for and -1 for the remaining components.
^Strictly speaking the, which is the part of the Lorentz group connected to theidentity. It excludesparity andtime-reversal transformations.
^A representation, mapping group elements to matrices, is faithful if the mapping between group elements and matrices isinjective.
^The proof relies on the fact thatgamma matrices satisfy the identity.
^Gauging a larger dimensional representation of with representation dimension requires instead considering identical spinors and gauging the subgroup.
^abKragh, H. (1984). "Equation with the many fathers. The Klein-Gordon equation in 1926".American Journal of Physics.52 (11):1024–1033.doi:10.1119/1.13782.
^abcTsutsui, T.T.; Silva, E.O.; M de Castro, A.S.; Andrade, F.M. (2025). "The Dirac equation: historical context, comparisons with the Schrödinger and Klein-Gordon equations, and elementary consequences".European Journal of Physics.46 (5): 053001.arXiv:2504.17797.doi:10.1088/1361-6404/adf9a5.
^abcLounesto, P. (1997).Clifford Algebras and Spinors. London Mathematical Society Lecture Note Series, Series Number 239. Cambridge University Press.ISBN978-0521599160.
^Keski-Vakkuri, E.; Montonen, C.; Panero, M. (2022).Mathematical Methods for Physics: An Introduction to Group Theory, Topology and Geometry. Cambridge University Press.ISBN978-1107191136.
^abcdSrednicki, M. (2007).Quantum Field Theory. Cambridge University Press.ISBN978-0521864497.
^abOsborn, H."PART I: Symmetries and particles"(PDF).Department of Applied Mathematics and Theoretical Physics. University of Cambridge. Retrieved16 November 2025.
^abcAldrovandi, R.; Pereira, J.G. (1995).Introduction To Geometrical Physics. World Scientific Publishing.ISBN978-9810222321.
^Rodrigues, W.A.; Oliveira, E.C. (2010).The Many Faces of Maxwell, Dirac and Einstein Equations: A Clifford Bundle Approach. Springer.ISBN978-3642090387.
^Penrose, Roger (2004).The Road to Reality (Sixth Printing ed.). Alfred A. Knopf. pp. 628–632.ISBN0-224-04447-8.
^Gaveau, B.; Jacobson, T.; Kac, M.; Schulman, L. S. (30 July 1984). "Relativistic Extension of the Analogy between Quantum Mechanics and Brownian Motion".Physical Review Letters.53 (5):419–422.Bibcode:1984PhRvL..53..419G.doi:10.1103/PhysRevLett.53.419.
^Nakahara, M. (2003).Geometry, Topology and Physics (2 ed.). CRC Press. pp. 377–380.ISBN978-0750306065.
^Burgess, C.; Moore, G. (2006).The Standard Model: A Primer. Cambridge University Press.ISBN978-0521860369.