Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Covariant derivative

From Wikipedia, the free encyclopedia
Specification of a derivative along a tangent vector of a manifold

For directional tensor derivatives in continuum mechanics, seeTensor derivative (continuum mechanics). For the covariant derivative used in gauge theories, seeGauge covariant derivative.

Inmathematics, thecovariant derivative is a way of specifying aderivative alongtangent vectors of amanifold. Alternatively, the covariant derivative is a way of introducing and working with aconnection on a manifold by means of adifferential operator, to be contrasted with the approach given by aprincipal connection on theframe bundle – seeaffine connection. In the special case of a manifoldisometrically embedded into a higher-dimensionalEuclidean space, the covariant derivative can be viewed as theorthogonal projection of the Euclideandirectional derivative onto the manifold's tangent space. In this case the Euclidean derivative is broken into two parts, the extrinsic normal component (dependent on the embedding) and the intrinsic covariant derivative component.

The name is motivated by the importance ofchanges of coordinate inphysics: the covariant derivative transformscovariantly under a general coordinate transformation, that is, linearly via theJacobian matrix of the transformation.[1]

This article presents an introduction to the covariant derivative of avector field with respect to a vector field, both in a coordinate-free language and using a localcoordinate system and the traditional index notation. The covariant derivative of atensor field is presented as an extension of the same concept. The covariant derivative generalizes straightforwardly to a notion of differentiation associated to aconnection on a vector bundle, also known as aKoszul connection.

History

[edit]

Historically, at the turn of the 20th century, the covariant derivative was introduced byGregorio Ricci-Curbastro andTullio Levi-Civita in the theory ofRiemannian andpseudo-Riemannian geometry.[2] Ricci and Levi-Civita (following ideas ofElwin Bruno Christoffel) observed that theChristoffel symbols used to define thecurvature could also provide a notion ofdifferentiation which generalized the classicaldirectional derivative ofvector fields on a manifold.[3][4] This new derivative – theLevi-Civita connection – wascovariant in the sense that it satisfied Riemann's requirement that objects in geometry should be independent of their description in a particular coordinate system.

It was soon noted by other mathematicians, prominent among these beingHermann Weyl,Jan Arnoldus Schouten, andÉlie Cartan,[5] that a covariant derivative could be defined abstractly without the presence of ametric. The crucial feature was not a particular dependence on the metric, but that the Christoffel symbols satisfied a certain precise second-order transformation law. This transformation law could serve as a starting point for defining the derivative in a covariant manner. Thus the theory of covariant differentiation forked off from the strictly Riemannian context to include a wider range of possible geometries.

In the 1940s, practitioners ofdifferential geometry began introducing other notions of covariant differentiation in generalvector bundles which were, in contrast to the classical bundles of interest to geometers, not part of thetensor analysis of the manifold. By and large, these generalized covariant derivatives had to be specifiedad hoc by some version of the connection concept. In 1950,Jean-Louis Koszul unified these new ideas of covariant differentiation in a vector bundle by means of what is known today as aKoszul connection or a connection on a vector bundle.[6] Using ideas fromLie algebra cohomology, Koszul successfully converted many of the analytic features of covariant differentiation into algebraic ones. In particular, Koszul connections eliminated the need for awkward manipulations ofChristoffel symbols (and other analogous non-tensorial objects) in differential geometry. Thus they quickly supplanted the classical notion of covariant derivative in many post-1950 treatments of the subject.

Motivation

[edit]

Thecovariant derivative is a generalization of thedirectional derivative fromvector calculus. As with the directional derivative, the covariant derivative is a rule,uv{\displaystyle \nabla _{\mathbf {u} }{\mathbf {v} }}, which takes as its inputs: (1) a vector,u, defined at a pointP, and (2) avector fieldv defined in aneighborhood ofP.[7] The output is the vectoruv(P){\displaystyle \nabla _{\mathbf {u} }{\mathbf {v} }(P)}, also at the pointP. The primary difference from the usual directional derivative is thatuv{\displaystyle \nabla _{\mathbf {u} }{\mathbf {v} }} must, in a certain precise sense, beindependent of the manner in which it is expressed in acoordinate system.

A vector may bedescribed as a list of numbers in terms of abasis, but as a geometrical object the vector retains its identity regardless of how it is described. For a geometric vector written in components with respect to one basis, when the basis is changed the components transform according to achange of basis formula, with the coordinates undergoing acovariant transformation. The covariant derivative is required to transform, under a change in coordinates, by a covariant transformation in the same way as a basis does (hence the name).

In the case ofEuclidean space, one usually defines the directional derivative of a vector field in terms of the difference between two vectors at two nearby points.In such a system onetranslates one of the vectors to the origin of the other, keeping it parallel, then takes their difference within the same vector space. With a Cartesian (fixedorthonormal) coordinate system "keeping it parallel" amounts to keeping the components constant. This ordinary directional derivative on Euclidean space is the first example of a covariant derivative.

Next, one must take into account changes of the coordinate system. For example, if the Euclidean plane is described by polar coordinates, "keeping it parallel" doesnot amount to keeping the polar components constant under translation, since the coordinate grid itself "rotates". Thus, the same covariant derivative written inpolar coordinates contains extra terms that describe how the coordinate grid itself rotates, or how in more general coordinates the grid expands, contracts, twists, interweaves, etc.

Consider the example of a particle moving along a curveγ(t) in the Euclidean plane. In polar coordinates,γ may be written in terms of its radial and angular coordinates byγ(t) = (r(t),θ(t)). A vector at a particular timet[8] (for instance, a constant acceleration of the particle) is expressed in terms of(er,eθ){\displaystyle (\mathbf {e} _{r},\mathbf {e} _{\theta })}, whereer{\displaystyle \mathbf {e} _{r}} andeθ{\displaystyle \mathbf {e} _{\theta }} are unit tangent vectors for the polar coordinates, serving as a basis to decompose a vector in terms of radial andtangential components. At a slightly later time, the new basis in polar coordinates appears slightly rotated with respect to the first set. The covariant derivative of the basis vectors (theChristoffel symbols) serve to express this change.

In a curved space, such as the surface of the Earth (regarded as a sphere), thetranslation of tangent vectors between different points is not well defined, and its analog,parallel transport, depends on the path along which the vector is translated. A vector on a globe on the equator at pointQ is directed to the north. Suppose we transport the vector (keeping it parallel) first along the equator to the pointP, then drag it along a meridian to theN pole, and finally transport it along another meridian back toQ. Then we notice that the parallel-transported vector along a closed circuit does not return as the same vector; instead, it has another orientation. This would not happen in Euclidean space and is caused by thecurvature of the surface of the globe. The same effect occurs if we drag the vector along an infinitesimally small closed surface subsequently along two directions and then back. This infinitesimal change of the vector is a measure of thecurvature, and can be defined in terms of the covariant derivative.

Remarks

[edit]
  • The definition of the covariant derivative does not use the metric in space. However, for each metric there is a uniquetorsion-free covariant derivative called theLevi-Civita connection such that the covariant derivative of the metric is zero.
  • The properties of a derivative imply thatvu{\displaystyle \nabla _{\mathbf {v} }\mathbf {u} } depends on the values ofu in a neighborhood of a pointp in the same way as e.g. the derivative of a scalar functionf along a curve at a given pointp depends on the values off in a neighborhood ofp.
  • The information in a neighborhood of a pointp in the covariant derivative can be used to defineparallel transport of a vector. Also thecurvature,torsion, andgeodesics may be defined only in terms of the covariant derivative or other related variation on the idea of alinear connection.
  • Some equations involving covariant derivative can be locally solved using Chen's iterated integrals[9] of using approach based on linear homotopy operator[10].

Informal definition using an embedding into Euclidean space

[edit]

Suppose an open subsetU of ad-dimensionalRiemannian manifoldM is embedded into Euclidean space(Rn,,){\displaystyle (\mathbb {R} ^{n},\langle \cdot ,\cdot \rangle )} via atwice continuously-differentiable (C2) mappingΨ:RdURn{\displaystyle {\vec {\Psi }}:\mathbb {R} ^{d}\supset U\to \mathbb {R} ^{n}} such that the tangent space atΨ(p){\displaystyle {\vec {\Psi }}(p)} is spanned by the vectors{Ψxi|p:i{1,,d}}{\displaystyle \left\{\left.{\frac {\partial {\vec {\Psi }}}{\partial x^{i}}}\right|_{p}:i\in \{1,\dots ,d\}\right\}}and the scalar product,{\displaystyle \left\langle \cdot ,\cdot \right\rangle } onRn{\displaystyle \mathbb {R} ^{n}} is compatible with the metric onM:gij=Ψxi,Ψxj.{\displaystyle g_{ij}=\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{i}}},{\frac {\partial {\vec {\Psi }}}{\partial x^{j}}}\right\rangle .}

(Since the manifold metric is always assumed to be regular,[clarification needed] the compatibility condition implies linear independence of the partial derivative tangent vectors.)

For a tangent vector field,V=vjΨxj{\displaystyle {\vec {V}}=v^{j}{\frac {\partial {\vec {\Psi }}}{\partial x^{j}}}}, one hasVxi=xi(vjΨxj)=vjxiΨxj+vj2Ψxixj.{\displaystyle {\frac {\partial {\vec {V}}}{\partial x^{i}}}={\frac {\partial }{\partial x^{i}}}\left(v^{j}{\frac {\partial {\vec {\Psi }}}{\partial x^{j}}}\right)={\frac {\partial v^{j}}{\partial x^{i}}}{\frac {\partial {\vec {\Psi }}}{\partial x^{j}}}+v^{j}{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}}.}

The last term is not tangential toM, but can be expressed as a linear combination of the tangent space base vectors using theChristoffel symbols as linear factors plus a vector orthogonal to the tangent space:vj2Ψxixj=vjΓkijΨxk+n.{\displaystyle v^{j}{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}}=v^{j}{\Gamma ^{k}}_{ij}{\frac {\partial {\vec {\Psi }}}{\partial x^{k}}}+{\vec {n}}.}

In the case of theLevi-Civita connection, the covariant derivativeeiV{\displaystyle \nabla _{\mathbf {e} _{i}}{\vec {V}}}, also writteniV{\displaystyle \nabla _{i}{\vec {V}}}, is defined as the orthogonal projection of the usual derivative onto tangent space:eiV:=Vxin=(vkxi+vjΓkij)Ψxk.{\displaystyle \nabla _{\mathbf {e} _{i}}{\vec {V}}:={\frac {\partial {\vec {V}}}{\partial x^{i}}}-{\vec {n}}=\left({\frac {\partial v^{k}}{\partial x^{i}}}+v^{j}{\Gamma ^{k}}_{ij}\right){\frac {\partial {\vec {\Psi }}}{\partial x^{k}}}.}

From here it may be computationally convenient to obtain a relation between the Christoffel symbols for the Levi-Civita connection and the metric. To do this we first note that, since the vectorn{\displaystyle {\vec {n}}} in the previous equation is orthogonal to the tangent space,2Ψxixj,Ψxl=ΓkijΨxk+n,Ψxl=Ψxk,ΨxlΓkij=gklΓkij.{\displaystyle \left\langle {\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}},{\frac {\partial {\vec {\Psi }}}{\partial x^{l}}}\right\rangle =\left\langle {\Gamma ^{k}}_{ij}{\frac {\partial {\vec {\Psi }}}{\partial x^{k}}}+{\vec {n}},{\frac {\partial {\vec {\Psi }}}{\partial x^{l}}}\right\rangle =\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{k}}},{\frac {\partial {\vec {\Psi }}}{\partial x^{l}}}\right\rangle {\Gamma ^{k}}_{ij}=g_{kl}\,{\Gamma ^{k}}_{ij}.}

Then, since the partial derivative of a componentgab{\displaystyle g_{ab}} of the metric with respect to a coordinatexc{\displaystyle x^{c}} isgabxc=xcΨxa,Ψxb=2Ψxcxa,Ψxb+Ψxa,2Ψxcxb,{\displaystyle {\frac {\partial g_{ab}}{\partial x^{c}}}={\frac {\partial }{\partial x^{c}}}\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{a}}},{\frac {\partial {\vec {\Psi }}}{\partial x^{b}}}\right\rangle =\left\langle {\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{c}\,\partial x^{a}}},{\frac {\partial {\vec {\Psi }}}{\partial x^{b}}}\right\rangle +\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{a}}},{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{c}\,\partial x^{b}}}\right\rangle ,}any tripleti,j,k of indices yields a system of equations{gjkxi=Ψxj,2Ψxkxi+Ψxk,2Ψxixjgkixj=Ψxi,2Ψxjxk+Ψxk,2Ψxixjgijxk=Ψxi,2Ψxjxk+Ψxj,2Ψxkxi.{\displaystyle \left\{{\begin{alignedat}{2}{\frac {\partial g_{jk}}{\partial x^{i}}}=&&\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{j}}},{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{k}\partial x^{i}}}\right\rangle &+\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{k}}},{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\partial x^{j}}}\right\rangle \\{\frac {\partial g_{ki}}{\partial x^{j}}}=&\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{i}}},{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{j}\partial x^{k}}}\right\rangle &&+\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{k}}},{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\partial x^{j}}}\right\rangle \\{\frac {\partial g_{ij}}{\partial x^{k}}}=&\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{i}}},{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{j}\partial x^{k}}}\right\rangle &+\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{j}}},{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{k}\partial x^{i}}}\right\rangle &&.\end{alignedat}}\right.}(Here the symmetry of the scalar product has been used and the order of partial differentiations has been swapped.)

Adding the first two equations and subtracting the third, we obtaingjkxi+gkixjgijxk=2Ψxk,2Ψxixj.{\displaystyle {\frac {\partial g_{jk}}{\partial x^{i}}}+{\frac {\partial g_{ki}}{\partial x^{j}}}-{\frac {\partial g_{ij}}{\partial x^{k}}}=2\left\langle {\frac {\partial {\vec {\Psi }}}{\partial x^{k}}},{\frac {\partial ^{2}{\vec {\Psi }}}{\partial x^{i}\,\partial x^{j}}}\right\rangle .}

Thus the Christoffel symbols for the Levi-Civita connection are related to the metric bygklΓkij=12(gjlxi+glixjgijxl).{\displaystyle g_{kl}{\Gamma ^{k}}_{ij}={\frac {1}{2}}\left({\frac {\partial g_{jl}}{\partial x^{i}}}+{\frac {\partial g_{li}}{\partial x^{j}}}-{\frac {\partial g_{ij}}{\partial x^{l}}}\right).}

Ifg is nondegenerate thenΓkij{\displaystyle {\Gamma ^{k}}_{ij}} can be solved for directly asΓkij=12gkl(gjlxi+glixjgijxl).{\displaystyle {\Gamma ^{k}}_{ij}={\frac {1}{2}}g^{kl}\left({\frac {\partial g_{jl}}{\partial x^{i}}}+{\frac {\partial g_{li}}{\partial x^{j}}}-{\frac {\partial g_{ij}}{\partial x^{l}}}\right).}

For a very simple example that captures the essence of the description above, draw a circle on a flat sheet of paper. Travel around the circle at a constant speed. The derivative of your velocity, your acceleration vector, always points radially inward. Roll this sheet of paper into a cylinder. Now the (Euclidean) derivative of your velocity has a component that sometimes points inward toward the axis of the cylinder depending on whether you're near a solstice or an equinox. (At the point of the circle when you are moving parallel to the axis, there is no inward acceleration. Conversely, at a point (1/4 of a circle later) when the velocity is along the cylinder's bend, the inward acceleration is maximum.) This is the (Euclidean) normal component. The covariant derivative component is the component parallel to the cylinder's surface, and is the same as that before you rolled the sheet into a cylinder.

Formal definition

[edit]

A covariant derivative is a(Koszul) connection on thetangent bundle and othertensor bundles: it differentiates vector fields in a way analogous to the usual differential on functions. The definition extends to a differentiation on the dual of vector fields (i.e.covector fields) and to arbitrarytensor fields, in a unique way that ensures compatibility with the tensor product and trace operations (tensor contraction).

Functions

[edit]

Given a pointpM{\displaystyle p\in M} of the manifoldM, a real functionf:MR{\displaystyle f:M\to \mathbb {R} } on the manifold and a tangent vectorvTpM{\displaystyle \mathbf {v} \in T_{p}M}, the covariant derivative off atp alongv is the scalar atp, denoted(vf)p{\displaystyle \left(\nabla _{\mathbf {v} }f\right)_{p}}, that represents theprincipal part of the change in the value off when the argument off is changed by the infinitesimal displacement vectorv. (This is thedifferential off evaluated against the vectorv.) Formally, there is a differentiable curveϕ:[1,1]M{\displaystyle \phi :[-1,1]\to M} such thatϕ(0)=p{\displaystyle \phi (0)=p} andϕ(0)=v{\displaystyle \phi '(0)=\mathbf {v} }, and the covariant derivative off atp is defined by(vf)p=(fϕ)(0)=limt0f(ϕ(t))f(p)t.{\displaystyle \left(\nabla _{\mathbf {v} }f\right)_{p}=\left(f\circ \phi \right)^{\prime }\left(0\right)=\lim _{t\to 0}{\frac {f(\phi \left(t\right))-f(p)}{t}}.}

Whenv:MTpM{\displaystyle \mathbf {v} :M\to T_{p}M} is a vector field onM, the covariant derivativevf:MR{\displaystyle \nabla _{\mathbf {v} }f:M\to \mathbb {R} } is the function that associates with each pointp in the common domain off andv the scalar(vf)p{\displaystyle \left(\nabla _{\mathbf {v} }f\right)_{p}}.

For a scalar functionf and vector fieldv, the covariant derivativevf{\displaystyle \nabla _{\mathbf {v} }f} coincides with theLie derivativeLv(f){\displaystyle L_{\mathbf {v} }(f)}, and with theexterior derivativedf(v){\displaystyle df(\mathbf {v} )}.

Vector fields

[edit]

Given a pointp of the manifoldM, a vector fieldu:MTpM{\displaystyle \mathbf {u} :M\to T_{p}M} defined in a neighborhood ofp and a tangent vectorvTpM{\displaystyle \mathbf {v} \in T_{p}M}, the covariant derivative ofu atp alongv is the tangent vector atp, denoted(vu)p{\displaystyle (\nabla _{\mathbf {v} }\mathbf {u} )_{p}}, such that the following properties hold (for any tangent vectorsv,x andy atp, vector fieldsu andw defined in a neighborhood ofp, scalar valuesg andh atp, and scalar functionf defined in a neighborhood ofp):

  1. (vu)p{\displaystyle \left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}} is linear inv{\displaystyle \mathbf {v} } so(gx+hyu)p=g(p)(xu)p+h(p)(yu)p{\displaystyle \left(\nabla _{g\mathbf {x} +h\mathbf {y} }\mathbf {u} \right)_{p}=g(p)\left(\nabla _{\mathbf {x} }\mathbf {u} \right)_{p}+h(p)\left(\nabla _{\mathbf {y} }\mathbf {u} \right)_{p}}
  2. (vu)p{\displaystyle \left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}} is additive inu{\displaystyle \mathbf {u} } so:(v[u+w])p=(vu)p+(vw)p{\displaystyle \left(\nabla _{\mathbf {v} }\left[\mathbf {u} +\mathbf {w} \right]\right)_{p}=\left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}+\left(\nabla _{\mathbf {v} }\mathbf {w} \right)_{p}}
  3. (vu)p{\displaystyle (\nabla _{\mathbf {v} }\mathbf {u} )_{p}} obeys theproduct rule; i.e., wherevf{\displaystyle \nabla _{\mathbf {v} }f} is defined above,(v[fu])p=f(p)(vu)p+(vf)pup.{\displaystyle \left(\nabla _{\mathbf {v} }\left[f\mathbf {u} \right]\right)_{p}=f(p)\left(\nabla _{\mathbf {v} }\mathbf {u} )_{p}+(\nabla _{\mathbf {v} }f\right)_{p}\mathbf {u} _{p}.}

Note that(vu)p{\displaystyle \left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}} depends not only on the value ofu atp but also on values ofu in a neighborhood ofp, because the last property, the product rule, involves the directional derivative off (by the vectorv).

Ifu andv are both vector fields defined over a common domain, thenvu{\displaystyle \nabla _{\mathbf {v} }\mathbf {u} } denotes the vector field whose value at each pointp of the domain is the tangent vector(vu)p{\displaystyle \left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}}.

Covector fields

[edit]

Given a field ofcovectors (orone-form)α{\displaystyle \alpha } defined in a neighborhood ofp, its covariant derivative(vα)p{\displaystyle (\nabla _{\mathbf {v} }\alpha )_{p}} is defined in a way to make the resulting operation compatible with tensor contraction and the product rule. That is,(vα)p{\displaystyle (\nabla _{\mathbf {v} }\alpha )_{p}} is defined as the unique one-form atp such that the following identity is satisfied for all vector fieldsu in a neighborhood ofp(vα)p(up)=v[α(u)]pαp[(vu)p].{\displaystyle \left(\nabla _{\mathbf {v} }\alpha \right)_{p}\left(\mathbf {u} _{p}\right)=\nabla _{\mathbf {v} }\left[\alpha \left(\mathbf {u} \right)\right]_{p}-\alpha _{p}\left[\left(\nabla _{\mathbf {v} }\mathbf {u} \right)_{p}\right].}

The covariant derivative of a covector field along a vector fieldv is again a covector field.

Tensor fields

[edit]

Once the covariant derivative is defined for fields of vectors and covectors it can be defined for arbitrarytensor fields by imposing the following identities for every pair of tensor fieldsφ{\displaystyle \varphi } andψ{\displaystyle \psi } in a neighborhood of the pointp:v(φψ)p=(vφ)pψ(p)+φ(p)(vψ)p,{\displaystyle \nabla _{\mathbf {v} }\left(\varphi \otimes \psi \right)_{p}=\left(\nabla _{\mathbf {v} }\varphi \right)_{p}\otimes \psi (p)+\varphi (p)\otimes \left(\nabla _{\mathbf {v} }\psi \right)_{p},}and forφ{\displaystyle \varphi } andψ{\displaystyle \psi } of the same valencev(φ+ψ)p=(vφ)p+(vψ)p.{\displaystyle \nabla _{\mathbf {v} }(\varphi +\psi )_{p}=(\nabla _{\mathbf {v} }\varphi )_{p}+(\nabla _{\mathbf {v} }\psi )_{p}.}The covariant derivative of a tensor field along a vector fieldv is again a tensor field of the same type.

Explicitly, letT be a tensor field of type(p,q). ConsiderT to be a differentiablemultilinear map ofsmoothsectionsα1,α2, ...,αq of the cotangent bundleTM and of sectionsX1,X2, ...,Xp of thetangent bundleTM, writtenT(α1,α2, ...,X1,X2, ...) intoR. The covariant derivative ofT alongY is given by the formula(YT)(α1,α2,,X1,X2,)=Y(T(α1,α2,,X1,X2,))T(Yα1,α2,,X1,X2,)T(α1,Yα2,,X1,X2,)T(α1,α2,,YX1,X2,)T(α1,α2,,X1,YX2,){\displaystyle {\begin{aligned}(\nabla _{Y}T)\left(\alpha _{1},\alpha _{2},\ldots ,X_{1},X_{2},\ldots \right)=&{}\nabla _{Y}\left(T\left(\alpha _{1},\alpha _{2},\ldots ,X_{1},X_{2},\ldots \right)\right)\\&{}-T\left(\nabla _{Y}\alpha _{1},\alpha _{2},\ldots ,X_{1},X_{2},\ldots \right)-T\left(\alpha _{1},\nabla _{Y}\alpha _{2},\ldots ,X_{1},X_{2},\ldots \right)-\cdots \\&{}-T\left(\alpha _{1},\alpha _{2},\ldots ,\nabla _{Y}X_{1},X_{2},\ldots \right)-T\left(\alpha _{1},\alpha _{2},\ldots ,X_{1},\nabla _{Y}X_{2},\ldots \right)-\cdots \end{aligned}}}

Coordinate description

[edit]
This section uses theEinstein summation convention.

Given coordinate functionsxi, i=0,1,2,,{\displaystyle x^{i},\ i=0,1,2,\dots ,} anytangent vector can be described by its components in the basisei=xi.{\displaystyle \mathbf {e} _{i}={\frac {\partial }{\partial x^{i}}}.}

The covariant derivative of a basis vector along a basis vector is again a vector and so can be expressed as a linear combinationΓkek{\displaystyle \Gamma ^{k}\mathbf {e} _{k}}.To specify the covariant derivative it is enough to specify the covariant derivative of each basis vector fieldei{\displaystyle \mathbf {e} _{i}} alongej{\displaystyle \mathbf {e} _{j}}.ejei=Γkijek,{\displaystyle \nabla _{\mathbf {e} _{j}}\mathbf {e} _{i}={\Gamma ^{k}}_{ij}\mathbf {e} _{k},}

the coefficientsΓkij{\displaystyle \Gamma ^{k}{}_{ij}} are the components of the connection with respect to a system of local coordinates. In the theory of Riemannian and pseudo-Riemannian manifolds, the components of the Levi-Civita connection with respect to a system of local coordinates are calledChristoffel symbols.

Then using the rules in the definition, we find that for general vector fieldsv=vjej{\displaystyle \mathbf {v} =v^{j}\mathbf {e} _{j}} andu=uiei{\displaystyle \mathbf {u} =u^{i}\mathbf {e} _{i}} we getvu=vjejuiei=vjejuiei=vjuiejei+vjeiejui=vjuiΓkijek+vjuixjei{\displaystyle {\begin{aligned}\nabla _{\mathbf {v} }\mathbf {u} &=\nabla _{v^{j}\mathbf {e} _{j}}u^{i}\mathbf {e} _{i}\\&=v^{j}\nabla _{\mathbf {e} _{j}}u^{i}\mathbf {e} _{i}\\&=v^{j}u^{i}\nabla _{\mathbf {e} _{j}}\mathbf {e} _{i}+v^{j}\mathbf {e} _{i}\nabla _{\mathbf {e} _{j}}u^{i}\\&=v^{j}u^{i}{\Gamma ^{k}}_{ij}\mathbf {e} _{k}+v^{j}{\partial u^{i} \over \partial x^{j}}\mathbf {e} _{i}\end{aligned}}}sovu=(vjuiΓkij+vjukxj)ek.{\displaystyle \nabla _{\mathbf {v} }\mathbf {u} =\left(v^{j}u^{i}{\Gamma ^{k}}_{ij}+v^{j}{\partial u^{k} \over \partial x^{j}}\right)\mathbf {e} _{k}.}

The first term in this formula is responsible for "twisting" the coordinate system with respect to the covariant derivative and the second for changes of components of the vector fieldu. In particulareju=ju=(uixj+ukΓikj)ei{\displaystyle \nabla _{\mathbf {e} _{j}}\mathbf {u} =\nabla _{j}\mathbf {u} =\left({\frac {\partial u^{i}}{\partial x^{j}}}+u^{k}{\Gamma ^{i}}_{kj}\right)\mathbf {e} _{i}}

In words: the covariant derivative is the usual derivative along the coordinates with correction terms which tell how the coordinates change.

For covectors similarly we haveejθ=(θixjθkΓkij)ei,{\displaystyle \nabla _{\mathbf {e} _{j}}{\mathbf {\theta } }=\left({\frac {\partial \theta _{i}}{\partial x^{j}}}-\theta _{k}{\Gamma ^{k}}_{ij}\right){\mathbf {e} ^{*}}^{i},}whereei(ej)=δij{\displaystyle {\mathbf {e} ^{*}}^{i}(\mathbf {e} _{j})={\delta ^{i}}_{j}}.

The covariant derivative of a type(r,s) tensor field alongec{\displaystyle e_{c}} is given by the expression:

(ecT)a1arb1bs=xcTa1arb1bs+Γa1dcTda2arb1bs++ΓardcTa1ar1db1bsΓdb1cTa1ardb2bsΓdbscTa1arb1bs1d.{\displaystyle {\begin{aligned}{(\nabla _{e_{c}}T)^{a_{1}\ldots a_{r}}}_{b_{1}\ldots b_{s}}={}&{\frac {\partial }{\partial x^{c}}}{T^{a_{1}\ldots a_{r}}}_{b_{1}\ldots b_{s}}\\&+\,{\Gamma ^{a_{1}}}_{dc}{T^{da_{2}\ldots a_{r}}}_{b_{1}\ldots b_{s}}+\cdots +{\Gamma ^{a_{r}}}_{dc}{T^{a_{1}\ldots a_{r-1}d}}_{b_{1}\ldots b_{s}}\\&-\,{\Gamma ^{d}}_{b_{1}c}{T^{a_{1}\ldots a_{r}}}_{db_{2}\ldots b_{s}}-\cdots -{\Gamma ^{d}}_{b_{s}c}{T^{a_{1}\ldots a_{r}}}_{b_{1}\ldots b_{s-1}d}.\end{aligned}}}Or, in words: take the partial derivative of the tensor and add:+Γaidc{\displaystyle +{\Gamma ^{a_{i}}}_{dc}} for every upper indexai{\displaystyle a_{i}}, andΓdbic{\displaystyle -{\Gamma ^{d}}_{b_{i}c}} for every lower indexbi{\displaystyle b_{i}}.

If instead of a tensor, one is trying to differentiate atensor density (of weight +1), then one also adds a termΓddcTa1arb1bs.{\displaystyle -{\Gamma ^{d}}_{dc}{T^{a_{1}\ldots a_{r}}}_{b_{1}\ldots b_{s}}.}If it is a tensor density of weightW, then multiply that term byW.For example,g{\textstyle {\sqrt {-g}}} is a scalar density (of weight +1), so we get:(g);c=(g),cgΓddc{\displaystyle \left({\sqrt {-g}}\right)_{;c}=\left({\sqrt {-g}}\right)_{,c}-{\sqrt {-g}}\,{\Gamma ^{d}}_{dc}}where the semicolon ";" indicates covariant differentiation and the comma "," indicates partial differentiation. Incidentally, this particular expression is equal to zero, because the covariant derivative of a function solely of the metric is always zero.

Notation

[edit]

In textbooks on physics, the covariant derivative is sometimes simply stated in terms of its components in this equation.

Often a notation is used in which the covariant derivative is given with asemicolon, while a normalpartial derivative is indicated by acomma. In this notation we write the same as:ejv =def vs;jesvi;j=vi,j+vkΓikj{\displaystyle \nabla _{e_{j}}\mathbf {v} \ {\stackrel {\mathrm {def} }{=}}\ {v^{s}}_{;j}\mathbf {e} _{s}\;\;\;\;\;\;{v^{i}}_{;j}={v^{i}}_{,j}+v^{k}{\Gamma ^{i}}_{kj}}In case two or more indexes appear after the semicolon, all of them must be understood as covariant derivatives:ek(ejv) =def vs;jkes{\displaystyle \nabla _{e_{k}}\left(\nabla _{e_{j}}\mathbf {v} \right)\ {\stackrel {\mathrm {def} }{=}}\ {v^{s}}_{;jk}\mathbf {e} _{s}}

In some older texts (notably Adler, Bazin & Schiffer,Introduction to General Relativity), the covariant derivative is denoted by a double pipe and the partial derivative by single pipe:ejv =def vi||j=vi|j+vkΓikj{\displaystyle \nabla _{e_{j}}\mathbf {v} \ {\stackrel {\mathrm {def} }{=}}\ {v^{i}}_{||j}={v^{i}}_{|j}+v^{k}{\Gamma ^{i}}_{kj}}

Covariant derivative by field type

[edit]

For a scalar fieldϕ{\displaystyle \phi \,}, covariant differentiation is simply partial differentiation:ϕ;aaϕ{\displaystyle \phi _{;a}\equiv \partial _{a}\phi }

For a contravariant vector fieldλa{\displaystyle \lambda ^{a}}, we have:λa;bbλa+Γabcλc{\displaystyle {\lambda ^{a}}_{;b}\equiv \partial _{b}\lambda ^{a}+{\Gamma ^{a}}_{bc}\lambda ^{c}}

For a covariant vector fieldλa{\displaystyle \lambda _{a}}, we have:λa;ccλaΓbcaλb{\displaystyle \lambda _{a;c}\equiv \partial _{c}\lambda _{a}-{\Gamma ^{b}}_{ca}\lambda _{b}}

For a type (2,0) tensor fieldτab{\displaystyle \tau ^{ab}}, we have:τab;ccτab+Γacdτdb+Γbcdτad{\displaystyle {\tau ^{ab}}_{;c}\equiv \partial _{c}\tau ^{ab}+{\Gamma ^{a}}_{cd}\tau ^{db}+{\Gamma ^{b}}_{cd}\tau ^{ad}}

For a type (0,2) tensor fieldτab{\displaystyle \tau _{ab}}, we have:τab;ccτabΓdcaτdbΓdcbτad{\displaystyle \tau _{ab;c}\equiv \partial _{c}\tau _{ab}-{\Gamma ^{d}}_{ca}\tau _{db}-{\Gamma ^{d}}_{cb}\tau _{ad}}

For a type (1,1) tensor fieldτab{\displaystyle {\tau ^{a}}_{b}}, we have:τab;ccτab+ΓacdτdbΓdcbτad{\displaystyle {\tau ^{a}}_{b;c}\equiv \partial _{c}{\tau ^{a}}_{b}+{\Gamma ^{a}}_{cd}{\tau ^{d}}_{b}-{\Gamma ^{d}}_{cb}{\tau ^{a}}_{d}}

The notation above is meant in the senseτab;c(ecτ)ab{\displaystyle {\tau ^{ab}}_{;c}\equiv \left(\nabla _{\mathbf {e} _{c}}\tau \right)^{ab}}

Properties

[edit]

In general, covariant derivatives do not commute. By example, the covariant derivatives of vector fieldλa;bcλa;cb{\displaystyle \lambda _{a;bc}\neq \lambda _{a;cb}}. TheRiemann tensorRdabc{\displaystyle {R^{d}}_{abc}} is defined such that:λa;bcλa;cb=Rdabcλd{\displaystyle \lambda _{a;bc}-\lambda _{a;cb}={R^{d}}_{abc}\lambda _{d}}or, equivalently,λa;bcλa;cb=Radbcλd{\displaystyle {\lambda ^{a}}_{;bc}-{\lambda ^{a}}_{;cb}=-{R^{a}}_{dbc}\lambda ^{d}}

The covariant derivative of a (2,0)-tensor field fulfills:τab;cdτab;dc=RaecdτebRbecdτae{\displaystyle {\tau ^{ab}}_{;cd}-{\tau ^{ab}}_{;dc}=-{R^{a}}_{ecd}\tau ^{eb}-{R^{b}}_{ecd}\tau ^{ae}}

The latter can be shown by taking (without loss of generality) thatτab=λaμb{\displaystyle \tau ^{ab}=\lambda ^{a}\mu ^{b}}.

Derivative along a curve

[edit]

Since the covariant derivativeXT{\displaystyle \nabla _{X}T} of a tensor fieldT at a pointp depends only on the value of the vector fieldX atp one can define the covariant derivative along a smooth curveγ(t){\displaystyle \gamma (t)} in a manifold:DtT=γ˙(t)T.{\displaystyle D_{t}T=\nabla _{{\dot {\gamma }}(t)}T.}Note that the tensor fieldT only needs to be defined on the curveγ(t){\displaystyle \gamma (t)} for this definition to make sense.

In particular,γ˙(t){\displaystyle {\dot {\gamma }}(t)} is a vector field along the curveγ{\displaystyle \gamma } itself. Ifγ˙(t)γ˙(t){\displaystyle \nabla _{{\dot {\gamma }}(t)}{\dot {\gamma }}(t)} vanishes then the curve is called a geodesic of the covariant derivative. If the covariant derivative is theLevi-Civita connection of apositive-definite metric then the geodesics for the connection are precisely thegeodesics of the metric that are parametrized byarc length.

The derivative along a curve is also used to define theparallel transport along the curve.

Sometimes the covariant derivative along a curve is calledabsolute orintrinsic derivative.

Relation to Lie derivative

[edit]

A covariant derivative introduces an extra geometric structure on a manifold that allows vectors in neighboring tangent spaces to be compared: there is no canonical way to compare vectors from different tangent spaces because there is no canonical coordinate system.

There is however another generalization of directional derivatives whichis canonical: theLie derivative, which evaluates the change of one vector field along the flow of another vector field. Thus, one must know both vector fields in a neighborhood, not merely at a single point. The covariant derivative on the other hand introduces its own change for vectors in a given direction, and it only depends on the vector direction at a single point, rather than a vector field in a neighborhood of a point. In other words, the covariant derivative is linear (overC(M)) in the direction argument, while the Lie derivative is linear in neither argument.

Note that the antisymmetrized covariant derivativeuv − ∇vu, and the Lie derivativeLuv differ by thetorsion of the connection, so that if a connection is torsion free, then its antisymmetrizationis the Lie derivative.

See also

[edit]

Notes

[edit]
  1. ^Einstein, Albert (1922). "The General Theory of Relativity".The Meaning of Relativity.
  2. ^Ricci, G.; Levi-Civita, T. (1901)."Méthodes de calcul différential absolu et leurs applications".Mathematische Annalen.54 (1–2):125–201.doi:10.1007/bf01454201.S2CID 120009332.
  3. ^Riemann, G. F. B. (1866). "Über die Hypothesen, welche der Geometrie zu Grunde liegen".Gesammelte Mathematische Werke.; reprint, ed. Weber, H. (1953), New York: Dover.
  4. ^Christoffel, E. B. (1869)."Über die Transformation der homogenen Differentialausdrücke zweiten Grades".Journal für die reine und angewandte Mathematik.70:46–70.
  5. ^cf. withCartan, É (1923)."Sur les variétés à connexion affine et la theorie de la relativité généralisée".Annales Scientifiques de l'École Normale Supérieure.40:325–412.doi:10.24033/asens.751.
  6. ^Koszul, J. L. (1950)."Homologie et cohomologie des algebres de Lie".Bulletin de la Société Mathématique de France.78:65–127.doi:10.24033/bsmf.1410.
  7. ^The covariant derivative is also denoted variously byvu,Dvu, or other notations.
  8. ^In many applications, it may be better not to think oft as corresponding to time, at least for applications ingeneral relativity. It is simply regarded as an abstract parameter varying smoothly and monotonically along the path.
  9. ^Chen, Kuo-Tsai (1973)."Iterated Integrals of Differential Forms and Loop Space Homology".Annals of Mathematics.97 (2):217–246.doi:10.2307/1970846.ISSN 0003-486X.
  10. ^Kycia, Radosław Antoni; Šilhan, Josef (2025-05-21)."Inverting covariant exterior derivative".Analysis and Mathematical Physics.15 (4): 82.doi:10.1007/s13324-025-01085-3.ISSN 1664-235X.

References

[edit]
Basic concepts
Main theorems(list)
Maps
Types of
manifolds
Tensors
Vectors
Covectors
Bundles
Connections
Related
Generalizations
Basic concepts
Types of manifolds
Main results
Generalizations
Applications
Scope
Mathematics
Notation
Tensor
definitions
Operations
Related
abstractions
Notable tensors
Mathematics
Physics
Mathematicians
Retrieved from "https://en.wikipedia.org/w/index.php?title=Covariant_derivative&oldid=1315356811"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp