In themathematical field ofdifferential geometry, ametric tensor (or simplymetric) is an additionalstructure on amanifoldM (such as asurface) that allows defining distances and angles, just as theinner product on aEuclidean space allows defining distances and angles there. More precisely, a metric tensor at a pointp ofM is abilinear form defined on thetangent space atp (that is, abilinear function that maps pairs oftangent vectors toreal numbers), and a metric field onM consists of a metric tensor at each pointp ofM that varies smoothly withp.
A metric tensorg ispositive-definite ifg(v,v) > 0 for every nonzero vectorv. A manifold equipped with a positive-definite metric tensor is known as aRiemannian manifold. Such a metric tensor can be thought of as specifyinginfinitesimal distance on the manifold. On a Riemannian manifoldM, the length of a smooth curve between two pointsp andq can be defined by integration, and thedistance betweenp andq can be defined as theinfimum of the lengths of all such curves; this makesM ametric space. Conversely, the metric tensor itself is thederivative of the distance function (taken in a suitable manner).[citation needed]
While the notion of a metric tensor was known in some sense to mathematicians such asGauss from the early 19th century, it was not until the early 20th century that its properties as atensor were understood by, in particular,Gregorio Ricci-Curbastro andTullio Levi-Civita, who first codified the notion of a tensor. The metric tensor is an example of atensor field.
The components of a metric tensor in acoordinate basis take on the form of asymmetric matrix whose entries transformcovariantly under changes to the coordinate system. Thus a metric tensor is a covariantsymmetric tensor. From thecoordinate-independent point of view, a metric tensor field is defined to be anondegeneratesymmetric bilinear form on each tangent space that variessmoothly from point to point.
Carl Friedrich Gauss in his 1827Disquisitiones generales circa superficies curvas (General investigations of curved surfaces) considered a surfaceparametrically, with theCartesian coordinatesx,y, andz of points on the surface depending on two auxiliary variablesu andv. Thus a parametric surface is (in today's terms) avector-valued function
depending on anordered pair of real variables(u,v), and defined in anopen setD in theuv-plane. One of the chief aims of Gauss's investigations was to deduce those features of the surface which could be described by a function which would remain unchanged if the surface underwent a transformation in space (such as bending the surface without stretching it), or a change in the particular parametric form of the same geometrical surface.
One natural such invariant quantity is thelength of a curve drawn along the surface. Another is theangle between a pair of curves drawn along the surface and meeting at a common point. A third such quantity is thearea of a piece of the surface. The study of these invariants of a surface led Gauss to introduce the predecessor of the modern notion of the metric tensor.
The metric tensor is in the description below; E, F, and G in the matrix can contain any number as long as the matrix is positive definite.
If the variablesu andv are taken to depend on a third variable,t, taking values in aninterval[a,b], thenr→(u(t),v(t)) will trace out aparametric curve in parametric surfaceM. Thearc length of that curve is given by theintegral
where represents theEuclidean norm. Here thechain rule has been applied, and the subscripts denotepartial derivatives:
The integrand is the restriction[1] to the curve of the square root of the (quadratic)differential
| 1 |
where
| 2 |
The quantityds in (1) is called theline element, whileds2 is called thefirst fundamental form ofM. Intuitively, it represents theprincipal part of the square of the displacement undergone byr→(u,v) whenu is increased bydu units, andv is increased bydv units.
Using matrix notation, the first fundamental form becomes
Suppose now that a different parameterization is selected, by allowingu andv to depend on another pair of variablesu′ andv′. Then the analog of (2) for the new variables is
| 2' |
Thechain rule relatesE′,F′, andG′ toE,F, andG via thematrix equation
| 3 |
where the superscript T denotes thematrix transpose. The matrix with the coefficientsE,F, andG arranged in this way therefore transforms by theJacobian matrix of the coordinate change
A matrix which transforms in this way is one kind of what is called atensor. The matrix
with the transformation law (3) is known as the metric tensor of the surface.
Ricci-Curbastro & Levi-Civita (1900) first observed the significance of a system of coefficientsE,F, andG, that transformed in this way on passing from one system of coordinates to another. The upshot is that the first fundamental form (1) isinvariant under changes in the coordinate system, and that this follows exclusively from the transformation properties ofE,F, andG. Indeed, by the chain rule,
so that
Another interpretation of the metric tensor, also considered by Gauss, is that it provides a way in which to compute the length oftangent vectors to the surface, as well as the angle between two tangent vectors. In contemporary terms, the metric tensor allows one to compute thedot product of tangent vectors in a manner independent of the parametric description of the surface. Any tangent vector at a point of the parametric surfaceM can be written in the form
for suitable real numbersp1 andp2. If two tangent vectors are given:
then using thebilinearity of the dot product,
This is plainly a function of the four variablesa1,b1,a2, andb2. It is more profitably viewed, however, as a function that takes a pair of argumentsa = [a1a2] andb = [b1b2] which are vectors in theuv-plane. That is, put
This is asymmetric function ina andb, meaning that
It is alsobilinear, meaning that it islinear in each variablea andb separately. That is,
for any vectorsa,a′,b, andb′ in theuv plane, and any real numbersμ andλ.
In particular, the length of a tangent vectora is given by
and the angleθ between two vectorsa andb is calculated by
Thesurface area is another numerical quantity which should depend only on the surface itself, and not on how it is parameterized. If the surfaceM is parameterized by the functionr→(u,v) over the domainD in theuv-plane, then the surface area ofM is given by the integral
where× denotes thecross product, and the absolute value denotes the length of a vector in Euclidean space. ByLagrange's identity for the cross product, the integral can be written
wheredet is thedeterminant.
LetM be asmooth manifold of dimensionn; for instance asurface (in the casen = 2) orhypersurface in theCartesian space. At each pointp ∈M there is avector spaceTpM, called thetangent space, consisting of all tangent vectors to the manifold at the pointp. A metric tensor atp is a functiongp(Xp,Yp) which takes as inputs a pair of tangent vectorsXp andYp atp, and produces as an output areal number (scalar), so that the following conditions are satisfied:
A metric tensor fieldg onM assigns to each pointp ofM a metric tensorgp in the tangent space atp in a way that variessmoothly withp. More precisely, given anyopen subsetU of manifoldM and any (smooth)vector fieldsX andY onU, the real functionis a smooth function ofp.
The components of the metric in anybasis ofvector fields, orframe,f = (X1, ...,Xn) are given by[3]
| 4 |
Then2 functionsgij[f] form the entries of ann ×nsymmetric matrix,G[f]. If
are two vectors atp ∈U, then the value of the metric applied tov andw is determined by the coefficients (4) by bilinearity:
Denoting thematrix(gij[f]) byG[f] and arranging the components of the vectorsv andw intocolumn vectorsv[f] andw[f],
wherev[f]T andw[f]T denote thetranspose of the vectorsv[f] andw[f], respectively. Under achange of basis of the form
for someinvertiblen ×n matrixA = (aij), the matrix of components of the metric changes byA as well. That is,
or, in terms of the entries of this matrix,
For this reason, the system of quantitiesgij[f] is said to transform covariantly with respect to changes in the framef.
A system ofn real-valued functions(x1, ...,xn), giving a localcoordinate system on anopen setU inM, determines a basis of vector fields onU
The metricg has components relative to this frame given by
Relative to a new system of local coordinates, say
the metric tensor will determine a different matrix of coefficients,
This new system of functions is related to the originalgij(f) by means of thechain rule
so that
Or, in terms of the matricesG[f] = (gij[f]) andG[f′] = (gij[f′]),
whereDy denotes theJacobian matrix of the coordinate change.
Associated to any metric tensor is thequadratic form defined in each tangent space by
Ifqm is positive for all non-zeroXm, then the metric ispositive-definite atm. If the metric is positive-definite at everym ∈M, theng is called aRiemannian metric. More generally, if the quadratic formsqm have constantsignature independent ofm, then the signature ofg is this signature, andg is called apseudo-Riemannian metric.[4] IfM isconnected, then the signature ofqm does not depend onm.[5]
BySylvester's law of inertia, a basis of tangent vectorsXi can be chosen locally so that the quadratic form diagonalizes in the following manner
for somep between 1 andn. Any two such expressions ofq (at the same pointm ofM) will have the same numberp of positive signs. The signature ofg is the pair of integers(p,n −p), signifying that there arep positive signs andn −p negative signs in any such expression. Equivalently, the metric has signature(p,n −p) if the matrixgij of the metric hasp positive andn −p negativeeigenvalues.
Certain metric signatures which arise frequently in applications are:
Letf = (X1, ...,Xn) be a basis of vector fields, and as above letG[f] be the matrix of coefficients
One can consider theinverse matrixG[f]−1, which is identified with theinverse metric (orconjugate ordual metric). The inverse metric satisfies a transformation law when the framef is changed by a matrixA via
| 5 |
The inverse metric transformscontravariantly, or with respect to the inverse of the change of basis matrixA. Whereas the metric itself provides a way to measure the length of (or angle between) vector fields, the inverse metric supplies a means of measuring the length of (or angle between)covector fields; that is, fields oflinear functionals.
To see this, suppose thatα is a covector field. To wit, for each pointp,α determines a functionαp defined on tangent vectors atp so that the followinglinearity condition holds for all tangent vectorsXp andYp, and all real numbersa andb:
Asp varies,α is assumed to be asmooth function in the sense that
is a smooth function ofp for any smooth vector fieldX.
Any covector fieldα has components in the basis of vector fieldsf. These are determined by
Denote therow vector of these components by
Under a change off by a matrixA,α[f] changes by the rule
That is, the row vector of componentsα[f] transforms as acovariant vector.
For a pairα andβ of covector fields, define the inverse metric applied to these two covectors by
| 6 |
The resulting definition, although it involves the choice of basisf, does not actually depend onf in an essential way. Indeed, changing basis tofA gives
So that the right-hand side of equation (6) is unaffected by changing the basisf to any other basisfA whatsoever. Consequently, the equation may be assigned a meaning independently of the choice of basis. The entries of the matrixG[f] are denoted bygij, where the indicesi andj have been raised to indicate the transformation law (5).
In a basis of vector fieldsf = (X1, ...,Xn), any smooth tangent vector fieldX can be written in the form
| 7 |
for some uniquely determined smooth functionsv1, ...,vn. Upon changing the basisf by a nonsingular matrixA, the coefficientsvi change in such a way that equation (7) remains true. That is,
Consequently,v[fA] =A−1v[f]. In other words, the components of a vector transformcontravariantly (that is, inversely or in the opposite way) under a change of basis by the nonsingular matrixA. The contravariance of the components ofv[f] is notationally designated by placing the indices ofvi[f] in the upper position.
A frame also allows covectors to be expressed in terms of their components. For the basis of vector fieldsf = (X1, ...,Xn) define thedual basis to be thelinear functionals(θ1[f], ...,θn[f]) such that
That is,θi[f](Xj) =δji, theKronecker delta. Let
Under a change of basisf ↦fA for a nonsingular matrixA,θ[f] transforms via
Any linear functionalα on tangent vectors can be expanded in terms of the dual basisθ
| 8 |
wherea[f] denotes therow vector[a1[f] ...an[f] ]. The componentsai transform when the basisf is replaced byfA in such a way that equation (8) continues to hold. That is,
whence, becauseθ[fA] =A−1θ[f], it follows thata[fA] =a[f]A. That is, the componentsa transformcovariantly (by the matrixA rather than its inverse). The covariance of the components ofa[f] is notationally designated by placing the indices ofai[f] in the lower position.
Now, the metric tensor gives a means to identify vectors and covectors as follows. HoldingXp fixed, the function
of tangent vectorYp defines alinear functional on the tangent space atp. This operation takes a vectorXp at a pointp and produces a covectorgp(Xp, −). In a basis of vector fieldsf, if a vector fieldX has componentsv[f], then the components of the covector fieldg(X, −) in the dual basis are given by the entries of the row vector
Under a change of basisf ↦fA, the right-hand side of this equation transforms via
so thata[fA] =a[f]A:a transforms covariantly. The operation of associating to the (contravariant) components of a vector fieldv[f] = [v1[f]v2[f] ...vn[f] ]T the (covariant) components of the covector fielda[f] = [a1[f]a2[f] …an[f] ], where
is calledlowering the index.
Toraise the index, one applies the same construction but with the inverse metric instead of the metric. Ifa[f] = [a1[f]a2[f] ...an[f] ] are the components of a covector in the dual basisθ[f], then the column vector
| 9 |
has components which transform contravariantly:
Consequently, the quantityX =fv[f] does not depend on the choice of basisf in an essential way, and thus defines a vector field onM. The operation (9) associating to the (covariant) components of a covectora[f] the (contravariant) components of a vectorv[f] given is calledraising the index. In components, (9) is
LetU be anopen set inℝn, and letφ be acontinuously differentiable function fromU into theEuclidean spaceℝm, wherem >n. The mappingφ is called animmersion if its differential isinjective at every point ofU. The image ofφ is called animmersed submanifold. More specifically, form = 3, which means that the ambient Euclidean space isℝ3, the induced metric tensor is called thefirst fundamental form.
Suppose thatφ is an immersion onto the submanifoldM ⊂Rm. The usual Euclideandot product inℝm is a metric which, when restricted to vectors tangent toM, gives a means for taking the dot product of these tangent vectors. This is called theinduced metric.
Suppose thatv is a tangent vector at a point ofU, say
whereei are the standard coordinate vectors inℝn. Whenφ is applied toU, the vectorv goes over to the vector tangent toM given by
(This is called thepushforward ofv alongφ.) Given two such vectors,v andw, the induced metric is defined by
It follows from a straightforward calculation that the matrix of the induced metric in the basis of coordinate vector fieldse is given by
whereDφ is the Jacobian matrix:
The notion of a metric can be definedintrinsically using the language offiber bundles andvector bundles. In these terms, ametric tensor is a function
| 10 |
from thefiber product of thetangent bundle ofM with itself toR such that the restriction ofg to each fiber is a nondegenerate bilinear mapping
The mapping (10) is required to becontinuous, and oftencontinuously differentiable,smooth, orreal analytic, depending on the case of interest, and whetherM can support such a structure.
By theuniversal property of the tensor product, any bilinear mapping (10) gives risenaturally to asectiong⊗ of thedual of thetensor product bundle ofTM with itself
The sectiong⊗ is defined on simple elements ofTM ⊗ TM by
and is defined on arbitrary elements ofTM ⊗ TM by extending linearly to linear combinations of simple elements. The original bilinear formg is symmetric if and only if
where
is thebraiding map.
SinceM is finite-dimensional, there is anatural isomorphism
so thatg⊗ is regarded also as a section of the bundleT*M ⊗ T*M of thecotangent bundleT*M with itself. Sinceg is symmetric as a bilinear mapping, it follows thatg⊗ is asymmetric tensor.
More generally, one may speak of a metric in avector bundle. IfE is a vector bundle over a manifoldM, then a metric is a mapping
from thefiber product ofE toR which is bilinear in each fiber:
Using duality as above, a metric is often identified with asection of thetensor product bundleE* ⊗E*.
The metric tensor gives anatural isomorphism from thetangent bundle to thecotangent bundle, sometimes called themusical isomorphism.[6] This isomorphism is obtained by setting, for each tangent vectorXp ∈ TpM,
thelinear functional onTpM which sends a tangent vectorYp atp togp(Xp,Yp). That is, in terms of the pairing[−, −] betweenTpM and itsdual spaceT∗
pM,
for all tangent vectorsXp andYp. The mappingSg is alinear transformation fromTpM toT∗
pM. It follows from the definition of non-degeneracy that thekernel ofSg is reduced to zero, and so by therank–nullity theorem,Sg is alinear isomorphism. Furthermore,Sg is asymmetric linear transformation in the sense that
for all tangent vectorsXp andYp.
Conversely, any linear isomorphismS : TpM → T∗
pM defines a non-degenerate bilinear form onTpM by means of
This bilinear form is symmetric if and only ifS is symmetric. There is thus a natural one-to-one correspondence between symmetric bilinear forms onTpM and symmetric linear isomorphisms ofTpM to the dualT∗
pM.
Asp varies overM,Sg defines a section of the bundleHom(TM, T*M) ofvector bundle isomorphisms of the tangent bundle to the cotangent bundle. This section has the same smoothness asg: it is continuous, differentiable, smooth, or real-analytic according asg. The mappingSg, which associates to every vector field onM a covector field onM gives an abstract formulation of "lowering the index" on a vector field. The inverse ofSg is a mappingT*M → TM which, analogously, gives an abstract formulation of "raising the index" on a covector field.
The inverseS−1
g defines a linear mapping
which is nonsingular and symmetric in the sense that
for all covectorsα,β. Such a nonsingular symmetric mapping gives rise (by thetensor-hom adjunction) to a map
or by thedouble dual isomorphism to a section of the tensor product
Suppose thatg is a Riemannian metric onM. In a local coordinate systemxi,i = 1, 2, …,n, the metric tensor appears as amatrix, denoted here byG, whose entries are the componentsgij of the metric tensor relative to the coordinate vector fields.
Letγ(t) be a piecewise-differentiableparametric curve inM, fora ≤t ≤b. Thearclength of the curve is defined by
In connection with this geometrical application, thequadraticdifferential form
is called thefirst fundamental form associated to the metric, whileds is theline element. Whends2 ispulled back to the image of a curve inM, it represents the square of the differential with respect to arclength.
For a pseudo-Riemannian metric, the length formula above is not always defined, because the term under the square root may become negative. We generally only define the length of a curve when the quantity under the square root is always of one sign or the other. In this case, define
While these formulas use coordinate expressions, they are in fact independent of the coordinates chosen; they depend only on the metric, and the curve along which the formula is integrated.
Given a segment of a curve, another frequently defined quantity is the (kinetic)energy of the curve:
This usage comes fromphysics, specifically,classical mechanics, where the integralE can be seen to directly correspond to thekinetic energy of apoint particle moving on the surface of a manifold. Thus, for example, in Jacobi's formulation ofMaupertuis' principle, the metric tensor can be seen to correspond to the mass tensor of a moving particle.
In many cases, whenever a calculation calls for the length to be used, a similar calculation using the energy may be done as well. This often leads to simpler formulas by avoiding the need for the square-root. Thus, for example, thegeodesic equations may be obtained by applyingvariational principles to either the length or the energy. In the latter case, the geodesic equations are seen to arise from theprinciple of least action: they describe the motion of a "free particle" (a particle feeling no forces) that is confined to move on the manifold, but otherwise moves freely, with constant momentum, within the manifold.[7]
In analogy with the case of surfaces, a metric tensor on ann-dimensional paracompact manifoldM gives rise to a natural way to measure then-dimensionalvolume of subsets of the manifold. The resulting natural positiveBorel measure allows one to develop a theory of integrating functions on the manifold by means of the associatedLebesgue integral.
A measure can be defined, by theRiesz representation theorem, by giving apositive linear functionalΛ on the spaceC0(M) ofcompactly supportedcontinuous functions onM. More precisely, ifM is a manifold with a (pseudo-)Riemannian metric tensorg, then there is a unique positiveBorel measureμg such that for anycoordinate chart(U,φ),for allf supported inU. Heredetg is thedeterminant of the matrix formed by the components of the metric tensor in the coordinate chart. ThatΛ is well-defined on functions supported in coordinate neighborhoods is justified byJacobian change of variables. It extends to a unique positive linear functional onC0(M) by means of apartition of unity.
IfM is alsooriented, then it is possible to define a naturalvolume form from the metric tensor. In apositively oriented coordinate system(x1, ...,xn) the volume form is represented aswhere thedxi are thecoordinate differentials and∧ denotes theexterior product in the algebra ofdifferential forms. The volume form also gives a way to integrate functions on the manifold, and this geometric integral agrees with the integral obtained by the canonical Borel measure.
The most familiar example is that of elementaryEuclidean geometry: the two-dimensionalEuclidean metric tensor. In the usualCartesian(x,y) coordinates, we can write
The length of a curve reduces to the formula:
The Euclidean metric in some other common coordinate systems can be written as follows.
Polar coordinates(r,θ):
So
In general, in aCartesian coordinate systemxi on aEuclidean space, the partial derivatives∂ / ∂xi areorthonormal with respect to the Euclidean metric. Thus the metric tensor is theKronecker delta δij in this coordinate system. The metric tensor with respect to arbitrary (possibly curvilinear) coordinatesqi is given by
The unit sphere inℝ3 comes equipped with a natural metric induced from the ambient Euclidean metric, through the process explained in theinduced metric section. In standard spherical coordinates(θ,φ), withθ thecolatitude, the angle measured from thez-axis, andφ the angle from thex-axis in thexy-plane, the metric takes the form
This is usually written in the form
In flatMinkowski space (special relativity), with coordinates
the metric is, depending on choice ofmetric signature,
For a curve with—for example—constant time coordinate, the length formula with this metric reduces to the usual length formula. For atimelike curve, the length formula gives theproper time along the curve.
In this case, thespacetime interval is written as
TheSchwarzschild metric describes the spacetime around a spherically symmetric body, such as a planet, or ablack hole. With coordinates
we can write the metric as
whereG (inside the matrix) is thegravitational constant andM represents the totalmass–energy content of the central object.