Lagrangian field theory is a formalism inclassical field theory. It is the field-theoretic analogue ofLagrangian mechanics. Lagrangian mechanics is used to analyze the motion of a system of discrete particles each with a finite number ofdegrees of freedom. Lagrangian field theory applies to continua andfields, which have an infinite number of degrees of freedom.
One motivation for the development of the Lagrangian formalism on fields, and more generally, forclassical field theory, is to provide a clear mathematical foundation forquantum field theory, which is infamously beset by formal difficulties that make it unacceptable as a mathematical theory. The Lagrangians presented here are identical to their quantum equivalents, but, in treating the fields as classical fields, instead of being quantized, one can provide definitions and obtain solutions with properties compatible with the conventional formal approach to the mathematics ofpartial differential equations. This enables the formulation of solutions on spaces with well-characterized properties, such asSobolev spaces. It enables various theorems to be provided, ranging from proofs of existence to theuniform convergence of formal series to the general settings ofpotential theory. In addition, insight and clarity is obtained by generalizations toRiemannian manifolds andfiber bundles, allowing the geometric structure to be clearly discerned and disentangled from the corresponding equations of motion. A clearer view of the geometric structure has in turn allowed highly abstract theorems from geometry to be used to gain insight, ranging from theChern–Gauss–Bonnet theorem and theRiemann–Roch theorem to theAtiyah–Singer index theorem andChern–Simons theory.
In field theory, the independent variable is replaced by an event inspacetime(x,y,z,t), or more generally still by a points on aRiemannian manifold. The dependent variables are replaced by the value of a field at that point in spacetime so that theequations of motion are obtained by means of anaction principle, written as:where theaction,, is afunctional of the dependent variables, their derivatives ands itself
where the brackets denote;ands = {sα} denotes theset ofnindependent variables of the system, including the time variable, and is indexed byα = 1, 2, 3, ...,n. The calligraphic typeface,, is used to denote thedensity, and is thevolume form of the field function, i.e., the measure of the domain of the field function.
In mathematical formulations, it is common to express the Lagrangian as a function on afiber bundle, wherein the Euler–Lagrange equations can be interpreted as specifying thegeodesics on the fiber bundle, leading to topics liketangent manifolds,symplectic manifolds andcontact geometry.[1] The field theories of physics can be developed in terms of gauge invariant fiber bundles.[2]
In Lagrangian field theory, the Lagrangian as a function ofgeneralized coordinates is replaced by a Lagrangian density, a function of the fields in the system and their derivatives, and possibly the space and time coordinates themselves. In field theory, the independent variablet is replaced by an event in spacetime(x,y,z,t) or still more generally by a points on a manifold.
Often, a "Lagrangian density" is simply referred to as a "Lagrangian".
For one scalar field, the Lagrangian density will take the form:[nb 1][3]
For many scalar fields
In mathematical formulations, the scalar fields are understood to becoordinates on afiber bundle, and the derivatives of the field are understood to besections of thejet bundle.
The above can be generalized forvector fields,tensor fields, andspinor fields. In physics,fermions are described by spinor fields.Bosons are described by tensor fields, which include scalar and vector fields as special cases.
For example, if there arereal-valuedscalar fields,, then the field manifold is. If the field is a realvector field, then the field manifold isisomorphic to.
Thetime integral of the Lagrangian is called theaction denoted byS. In field theory, a distinction is occasionally made between theLagrangianL, of which the time integral is the actionand theLagrangian density, which one integrates over allspacetime to get the action:
The spatialvolume integral of the Lagrangian density is the Lagrangian; in 3D,
The action is often referred to as the "actionfunctional", in that it is a function of the fields (and their derivatives).
In the presence of gravity or when using general curvilinear coordinates, the Lagrangian density will include a factor of. This ensures that the action is invariant under general coordinate transformations. In mathematical literature, spacetime is taken to be aRiemannian manifold and the integral then becomes thevolume form
Here, the is thewedge product and is the square root of the determinant of themetric tensor on. For flat spacetime (e.g.,Minkowski spacetime), the unit volume is one, i.e. and so it is commonly omitted, when discussing field theory in flat spacetime. Likewise, the use of the wedge-product symbols offers no additional insight over the ordinary concept of a volume in multivariate calculus, and so these are likewise dropped. Some older textbooks, e.g., Landau and Lifschitz write for the volume form, since the minus sign is appropriate for metric tensors with signature (+−−−) or (−+++) (since the determinant is negative, in either case). When discussing field theory on general Riemannian manifolds, the volume form is usually written in the abbreviated notation where is theHodge star. That is,and so
Not infrequently, the notation above is considered to be entirely superfluous, andis frequently seen. Do not be misled: the volume form is implicitly present in the integral above, even if it is not explicitly written.
TheEuler–Lagrange equations describe thegeodesic flow of the field as a function of time. Taking thevariation with respect to, one obtains
Solving, with respect to theboundary conditions, one obtains theEuler–Lagrange equations:
Often the Lagrangian consists of a sum ofpolynomial terms, with thesymmetries of the theory and the fields involved dictating the types of terms that are allowed. For example, inrelativistic theories, each term must beLorentz invariant while in a theory with agauge field, they must be gauge invariant.
Terms that contain the product of two fields and noderivatives are known asmass terms, with these givingmass to the fields.[4] For example, a single real scalar field of mass has a mass term given by
The other terms that have two fields, those with at least one derivative, are known askinetic terms. They make fieldsdynamical, with most theories requiring a restriction of at most two derivatives in kinetic terms topreserve probabililties in aquantum theory. They are also usuallypositive-definite to ensure positive energies.[nb 2] For example, the kinetic term for a relativistic real scalar field is given by
Fields with no kinetic terms can also be found, playing the role ofauxiliary fields, background fields, orcurrents. Theories with only kinetic and mass terms, formfree field theories.
Any term with more than two fields per term is known as aninteraction term.[5] The presence of these gives rise to interacting theories where particles canscatter off each other. The coefficients in front of these terms are known ascoupling constants and they dictate the strength of the interaction. For example, aquartic interaction in a real scalar field theory is given by
where is its coupling constant. This term gives rise to scattering processes whereby two scalar fields can scatter off each other. Interacting terms can have any number of derivatives, with each derivative providing amomentum dependence to the scattering term as can be seen by going intomomentum space.
Terms with only one field are known astadpole terms since they give rise totadpole Feynman diagrams.[4]: 415 In theories withtranslational symmetries, such terms can usually be eliminated by redefining some of the fields though a shift.[nb 3]
Constant terms, those with no fields, have no physical consequences in non-gravitational theories.[6] In classical field theories, the equations of motion only depend on variations of the Lagrangian, so constant terms play no role. In quantum field theories they only provide an irrelevant overall multiplicative term to thepartition function, so again play no role. Physically this is because in these theories there is no absoluteenergy scale as thepotential energy can always be shifted by an arbitrary constant without altering the physics. However, ingravitational systems the constant terms are multiplied by the metric determinant, coupling them to the spacetime. They play the role of thecosmological constant, directly affecting the dynamics of the theory at both a classical and quantum level.
Polynomial terms are often expressed with certaincanonical normalizations, used to simplify theFeynman rules that are derived from them. Usually one divides by the product of thefactorial of themultipicity of the fields. For example, in a theory with two real scalar fields, a term of the form term would be divided by. Particles andantiparticles are distinguished in this counting, so that a complex scalar field term of the form is divided by rather than.
A large variety of physical systems have been formulated in terms of Lagrangians over fields. Below is a sampling of some of the most common ones found in physics textbooks on field theory.
The Lagrangian density for Newtonian gravity is:
whereΦ is thegravitational potential,ρ is the mass density, andG in m3·kg−1·s−2 is thegravitational constant. The density has units of J·m−3. Here the interaction term involves a continuous mass densityρ in kg·m−3. This is necessary because using a point source for a field would result in mathematical difficulties.
This Lagrangian can be written in the form of, with the providing a kinetic term, and the interaction the potential term. See alsoNordström's theory of gravitation for how this could be modified to deal with changes over time. This form is reprised in the next example of a scalar field theory.
The variation of the integral with respect toΦ is:
After integrating by parts, discarding the total integral, and dividing out byδΦ the formula becomes:which is equivalent to:which yieldsGauss's law for gravity.
The Lagrangian for a scalar field moving in a potential can be written asIt is not at all an accident that the scalar theory resembles the undergraduate textbook Lagrangian for thekinetic term of a free point particle written as. The scalar theory is the field-theory generalization of a particle moving in a potential. When the is theMexican hat potential, the resulting fields are termed theHiggs fields.
Thesigma model describes the motion of a scalar point particle constrained to move on aRiemannian manifold, such as a circle or a sphere. It generalizes the case of scalar and vector fields, that is, fields constrained to move on a flat manifold. The Lagrangian is commonly written in one of three equivalent forms:where the is thedifferential. An equivalent expression iswith theRiemannian metric on the manifold of the field; i.e. the fields are justlocal coordinates on thecoordinate chart of the manifold. A third common form iswithand, theLie groupSU(N). This group can be replaced by any Lie group, or, more generally, by asymmetric space. The trace is just theKilling form in hiding; the Killing form provides a quadratic form on the field manifold, the lagrangian is then just the pullback of this form. Alternately, the Lagrangian can also be seen as the pullback of theMaurer–Cartan form to the base spacetime.
In general, sigma models exhibittopological soliton solutions. The most famous and well-studied of these is theSkyrmion, which serves as a model of thenucleon that has withstood the test of time.
Consider a point particle, a charged particle, interacting with theelectromagnetic field. The interaction termsare replaced by terms involving a continuous charge density ρ in A·s·m−3 and current density in A·m−2. The resulting Lagrangian density for the electromagnetic field is:
Varying this with respect toϕ, we getwhich yieldsGauss' law.
Varying instead with respect to, we getwhich yieldsAmpère's law.
Usingtensor notation, we can write all this more compactly. The term is actually the inner product of twofour-vectors. We package the charge density into the current 4-vector and the potential into the potential 4-vector. These two new vectors areWe can then write the interaction term asAdditionally, we can package the E and B fields into what is known as theelectromagnetic tensor.We define this tensor asThe term we are looking out for turns out to beWe have made use of theMinkowski metric to raise the indices on the EMF tensor. In this notation, Maxwell's equations arewhere ε is theLevi-Civita tensor. So the Lagrange density for electromagnetism in special relativity written in terms of Lorentz vectors and tensors isIn this notation it is apparent that classical electromagnetism is a Lorentz-invariant theory. By theequivalence principle, it becomes simple to extend the notion of electromagnetism to curved spacetime.[7][8]
Usingdifferential forms, the electromagnetic actionS in vacuum on a (pseudo-) Riemannian manifold can be written (usingnatural units,c =ε0 = 1) asHere,A stands for the electromagnetic potential 1-form,J is the current 1-form,F is the field strength 2-form and the star denotes theHodge star operator. This is exactly the same Lagrangian as in the section above, except that the treatment here is coordinate-free; expanding the integrand into a basis yields the identical, lengthy expression. Note that with forms, an additional integration measure is not necessary because forms have coordinate differentials built in. Variation of the action leads toThese are Maxwell's equations for the electromagnetic potential. SubstitutingF = dA immediately yields the equation for the fields,becauseF is anexact form.
TheA field can be understood to be theaffine connection on aU(1)-fiber bundle. That is, classical electrodynamics, all of its effects and equations, can becompletely understood in terms of acircle bundle overMinkowski spacetime.
TheYang–Mills equations can be written in exactly the same form as above, by replacing theLie groupU(1) of electromagnetism by an arbitrary Lie group. In theStandard Model, it is conventionally taken to be although the general case is of general interest. In all cases, there is no need for any quantization to be performed. Although the Yang–Mills equations are historically rooted in quantum field theory, the above equations are purely classical.[2]
In the same vein as the above, one can consider the action in one dimension less, i.e. in acontact geometry setting. This gives theChern–Simons functional. It is written as
Chern–Simons theory was deeply explored in physics, as a toy model for a broad range of geometric phenomena that might be expected to be found in aGrand Unified Theory.
The Lagrangian density forGinzburg–Landau theory combines the Lagrangian for thescalar field theory with the Lagrangian for theYang–Mills action. It may be written as:[9]where is asection of avector bundle with fiber. The corresponds to the order parameter in asuperconductor; equivalently, it corresponds to theHiggs field, after noting that the second term is the famous"Sombrero hat" potential. The field is the (non-Abelian) gauge field, i.e. theYang–Mills field and is its field-strength. TheEuler–Lagrange equations for the Ginzburg–Landau functional are theYang–Mills equationsandwhere is theHodge star operator, i.e. the fully antisymmetric tensor. These equations are closely related to theYang–Mills–Higgs equations. Another closely related Lagrangian is found inSeiberg–Witten theory.
The Lagrangian density for aDirac field is:[10]: 143 where is aDirac spinor, is itsDirac adjoint, and isFeynman slash notation for.[dubious –discuss] There is no particular need to focus on Dirac spinors in the classical theory. TheWeyl spinors provide a more general foundation; they can be constructed directly from theClifford algebra of spacetime; the construction works in any number of dimensions,[11] and the Dirac spinors appear as a special case. Weyl spinors have the additional advantage that they can be used in avielbein for the metric on a Riemannian manifold; this enables the concept of aspin structure, which, roughly speaking, is a way of formulating spinors consistently in a curved spacetime.
The Lagrangian density forQED combines the Lagrangian for the Dirac field together with the Lagrangian for electrodynamics in a gauge-invariant way. It is:where is theelectromagnetic tensor,D is thegauge covariant derivative, and isFeynman notation for with where is theelectromagnetic four-potential. Although the word "quantum" appears in the above, this is a historical artifact. The definition of the Dirac field requires no quantization whatsoever, it can be written as a purely classical field of anti-commutingWeyl spinors constructed from first principles from aClifford algebra.[11] The full gauge-invariant classical formulation is given in Bleecker.[2]
The Lagrangian density forquantum chromodynamics combines the Lagrangian for one or more massiveDirac spinors with the Lagrangian for theYang–Mills action, which describes the dynamics of a gauge field; the combined Lagrangian is gauge invariant. It may be written as:[12]whereD is the QCDgauge covariant derivative,n = 1, 2, ...6 counts thequark types, and is thegluon field strength tensor. As for the electrodynamics case above, the appearance of the word "quantum" above only acknowledges its historical development. The Lagrangian and its gauge invariance can be formulated and treated in a purely classical fashion.[2][11]
The Lagrange density for general relativity in the presence of matter fields iswhere is thecosmological constant, is thecurvature scalar, which is theRicci tensor contracted with themetric tensor, and theRicci tensor is theRiemann tensor contracted with aKronecker delta. The integral of is known as theEinstein–Hilbert action. The Riemann tensor is thetidal force tensor, and is constructed out ofChristoffel symbols and derivatives of Christoffel symbols, which define themetric connection on spacetime. The gravitational field itself was historically ascribed to the metric tensor; the modern view is that the connection is "more fundamental". This is due to the understanding that one can write connections with non-zerotorsion. These alter the metric without altering the geometry one bit. As to the actual "direction in which gravity points" (e.g. on the surface of the Earth, it points down), this comes from the Riemann tensor: it is the thing that describes the "gravitational force field" that moving bodies feel and react to. (This last statement must be qualified: there is no "force field"per se; moving bodies followgeodesics on the manifold described by the connection. They move in a "straight line".)
The Lagrangian for general relativity can also be written in a form that makes it manifestly similar to the Yang–Mills equations. This is called the Einstein–Yang–Mills action principle. This is done by noting that most of differential geometry works "just fine" on bundles with anaffine connection and arbitrary Lie group. Then, plugging in SO(3,1) for that symmetry group, i.e. for theframe fields, one obtains the equations above.[2][11]
Substituting this Lagrangian into the Euler–Lagrange equation and taking the metric tensor as the field, we obtain theEinstein field equations is theenergy momentum tensor and is defined bywhere is the determinant of the metric tensor when regarded as a matrix. Generally, in general relativity, the integration measure of the action of Lagrange density is. This makes the integral coordinate independent, as the root of the metric determinant is equivalent to theJacobian determinant. The minus sign is a consequence of the metric signature (the determinant by itself is negative).[7] This is an example of thevolume form, previously discussed, becoming manifest in non-flat spacetime.
The Lagrange density of electromagnetism in general relativity also contains the Einstein–Hilbert action from above. The pure electromagnetic Lagrangian is precisely a matter Lagrangian. The Lagrangian is
This Lagrangian is obtained by simply replacing the Minkowski metric in the above flat Lagrangian with a more general (possibly curved) metric. We can generate the Einstein Field Equations in the presence of an EM field using this lagrangian. The energy-momentum tensor isIt can be shown that this energy momentum tensor is traceless, i.e. thatIf we take the trace of both sides of the Einstein Field Equations, we obtainSo the tracelessness of the energy momentum tensor implies that the curvature scalar in an electromagnetic field vanishes. The Einstein equations are thenAdditionally, Maxwell's equations arewhere is thecovariant derivative. For free space, we can set the current tensor equal to zero,. Solving both Einstein and Maxwell's equations around a spherically symmetric mass distribution in free space leads to theReissner–Nordström charged black hole, with the defining line element (written innatural units and with chargeQ):[7]
One possible way of unifying the electromagnetic and gravitational Lagrangians (by using a fifth dimension) is given byKaluza–Klein theory.[2] Effectively, one constructs an affine bundle, just as for the Yang–Mills equations given earlier, and then considers the action separately on the 4-dimensional and the 1-dimensional parts. Suchfactorizations, such as the fact that the 7-sphere can be written as a product of the 4-sphere and the 3-sphere, or that the 11-sphere is a product of the 4-sphere and the 7-sphere, accounted for much of the early excitement that atheory of everything had been found. Unfortunately, the 7-sphere proved not large enough to enclose all of theStandard Model, dashing these hopes.