Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Minkowski space

From Wikipedia, the free encyclopedia
Mathematical description of spacetime used in relativity
For the use in algebraic number theory, seeMinkowski space (number field). For the use in normed geometry, seeNormed vector space.
Hermann Minkowski (1864–1909) found that the theory of special relativity could be best understood as a four-dimensional space, since known as the Minkowski spacetime.

Inphysics,Minkowski space (orMinkowski spacetime) (/mɪŋˈkɔːfski,-ˈkɒf-/[1]) is the main mathematical description ofspacetime in the absence ofgravitation. It combinesinertialspace andtimemanifolds into afour-dimensional model.

The model helps show how aspacetime interval between any twoevents is independent of theinertial frame of reference in which they are recorded. MathematicianHermann Minkowski developed it from the work ofHendrik Lorentz,Henri Poincaré, and others, and said it "was grown on experimental physical grounds".

Minkowski space is closely associated withEinstein's theories ofspecial relativity andgeneral relativity and is the most common mathematical structure by which special relativity is formalized. While the individual components in Euclidean space and time might differ due tolength contraction andtime dilation, in Minkowski spacetime, all frames of reference will agree on the total interval in spacetime between events.[nb 1] Minkowski space differs fromfour-dimensional Euclidean space insofar as it treats time differently from the three spatial dimensions.

In 3-dimensionalEuclidean space, theisometry group (maps preserving the regularEuclidean distance) is theEuclidean group. It is generated byrotations,reflections andtranslations. When time is appended as a fourth dimension, the further transformations of translations in time andLorentz boosts are added, and the group of all these transformations is called thePoincaré group. Minkowski's model follows special relativity, where motion causestime dilation changing the scale applied to the frame in motion and shifts the phase of light.

Minkowski space is apseudo-Euclidean space equipped with anisotropic quadratic form called thespacetime interval or theMinkowski norm squared. An event in Minkowski space for which the spacetime interval is zero is on thenull cone of the origin, called thelight cone in Minkowski space. Using thepolarization identity the quadratic form is converted to asymmetric bilinear form called theMinkowski inner product, though it is not a geometricinner product. Another misnomer isMinkowski metric,[2] but Minkowski space is not ametric space.

Thegroup of transformations for Minkowski space that preserves the spacetime interval (as opposed to the spatial Euclidean distance) is theLorentz group (as opposed to theGalilean group).

History

[edit]
Part of a series on
Spacetime

Complex Minkowski spacetime

[edit]
See also:Four-dimensional space

In his second relativity paper in 1905,Henri Poincaré showed[3] how, by taking time to be an imaginary fourthspacetime coordinateict, wherec is thespeed of light andi is theimaginary unit,Lorentz transformations can be visualized as ordinary rotations of the four-dimensional Euclidean sphere. The four-dimensional spacetime can be visualized as a four-dimensional space, with each point representing an event in spacetime. TheLorentz transformations can then be thought of as rotations in this four-dimensional space, where the rotation axis corresponds to the direction of relative motion between the two observers and the rotation angle is related to their relative velocity.

To understand this concept, one should consider the coordinates of an event in spacetime represented as a four-vector(t,x,y,z). A Lorentz transformation is represented by amatrix that acts on the four-vector, changing its components. This matrix can be thought of as a rotation matrix in four-dimensional space, which rotates the four-vector around a particular axis.x2+y2+z2+(ict)2=constant.{\displaystyle x^{2}+y^{2}+z^{2}+(ict)^{2}={\text{constant}}.}

Rotations in planes spanned by two space unit vectors appear in coordinate space as well as in physical spacetime as Euclidean rotations and are interpreted in the ordinary sense. The "rotation" in a plane spanned by a spaceunit vector and a time unit vector, while formally still a rotation in coordinate space, is aLorentz boost in physical spacetime withreal inertial coordinates. The analogy with Euclidean rotations is only partial since the radius of the sphere is actually imaginary, which turns rotations into rotations in hyperbolic space (seehyperbolic rotation).

This idea, which was mentioned only briefly by Poincaré, was elaborated by Minkowski in a paper inGerman published in 1908 called "The Fundamental Equations for Electromagnetic Processes in Moving Bodies".[4] He reformulatedMaxwell equations as a symmetrical set of equations in the four variables(x,y,z,ict) combined with redefined vector variables for electromagnetic quantities, and he was able to show directly and very simply their invariance under Lorentz transformation. He also made other important contributions and used matrix notation for the first time in this context.From his reformulation, he concluded that time and space should be treated equally, and so arose his concept of events taking place in a unified four-dimensionalspacetime continuum.

Real Minkowski spacetime

[edit]

In a further development in his 1908 "Space and Time" lecture,[5] Minkowski gave an alternative formulation of this idea that used a real time coordinate instead of an imaginary one, representing the four variables(x,y,z,t) of space and time in the coordinate form in a four-dimensional realvector space. Points in this space correspond to events in spacetime. In this space, there is a definedlight-cone associated with each point, and events not on the light cone are classified by their relation to the apex asspacelike ortimelike. It is principally this view of spacetime that is current nowadays, although the older view involvingimaginary time has also influenced special relativity.

In the English translation of Minkowski's paper, the Minkowski metric, as defined below, is referred to as theline element. The Minkowski inner product below appears unnamed when referring toorthogonality (which he callsnormality) of certain vectors, and the Minkowski norm squared is referred to as "sum" (a word choice that might be attributable to language translation).

Minkowski's principal tool is theMinkowski diagram, and he uses it to define concepts and demonstrate properties of Lorentz transformations (e.g.,proper time andlength contraction) and to provide geometrical interpretation to the generalization of Newtonian mechanics torelativistic mechanics. For these special topics, see the referenced articles, as the presentation below will be principally confined to the mathematical structure (Minkowski metric and from it derived quantities and thePoincaré group as symmetry group of spacetime)following from the invariance of the spacetime interval on the spacetime manifold as consequences of the postulates of special relativity, not to specific application orderivation of the invariance of the spacetime interval. This structure provides the background setting of all present relativistic theories, barring general relativity for whichflat Minkowski spacetime still provides a springboard as curved spacetime is locally Lorentzian.

Minkowski, aware of the fundamental restatement of the theory which he had made, said

The views of space and time which I wish to lay before you have sprung from the soil of experimental physics, and therein lies their strength. They are radical. Henceforth, space by itself and time by itself are doomed to fade away into mere shadows, and only a kind of union of the two will preserve an independent reality.

— Hermann Minkowski, 1908, 1909[5]

Though Minkowski took an important step for physics,Albert Einstein saw its limitation:

At a time when Minkowski was giving the geometrical interpretation of special relativity by extending the Euclidean three-space to aquasi-Euclidean four-space that included time, Einstein was already aware that this is not valid, because it excludes the phenomenon ofgravitation. He was still far from the study ofcurvilinear coordinates andRiemannian geometry, and the heavy mathematical apparatus entailed.[6]

For further historical information see referencesGalison (1979),Corry (1997) andWalter (1999).

Causal structure

[edit]
Main article:Causal structure
Subdivision of Minkowski spacetime with respect to an event in four disjoint sets: thelight cone, thecausal future (also called theabsolute future), thecausal past (also called theabsolute past), andelsewhere. The terminology is fromSard (1970), and fromCausal structure.

Wherev is velocity,x,y, andz areCartesian coordinates in 3-dimensional space,c is the constant representing the universal speed limit, andt is time, the four-dimensional vectorv = (ct,x,y,z) = (ct,r) is classified according to the sign ofc2t2r2. A vector istimelike ifc2t2 >r2,spacelike ifc2t2 <r2, andnull orlightlike ifc2t2 =r2. This can be expressed in terms of the sign ofη(v,v), also calledscalar product, as well, which depends on the signature. The classification of any vector will be the same in all frames of reference that are related by a Lorentz transformation (but not by a general Poincaré transformation because the origin may then be displaced) because of the invariance of the spacetime interval under Lorentz transformation.

The set of allnull vectors at an event[nb 2] of Minkowski space constitutes thelight cone of that event. Given a timelike vectorv, there is aworldline of constant velocity associated with it, represented by a straight line in a Minkowski diagram.

Once a direction of time is chosen,[nb 3] timelike and null vectors can be further decomposed into various classes. For timelike vectors, one has

  1. future-directed timelike vectors whose first component is positive (tip of vector located in causal future (also called the absolute future) in the figure) and
  2. past-directed timelike vectors whose first component is negative (causal past (also called the absolute past)).

Null vectors fall into three classes:

  1. the zero vector, whose components in any basis are(0, 0, 0, 0) (origin),
  2. future-directed null vectors whose first component is positive (upper light cone), and
  3. past-directed null vectors whose first component is negative (lower light cone).

Together with spacelike vectors, there are 6 classes in all.

Anorthonormal basis for Minkowski space necessarily consists of one timelike and three spacelike unit vectors. If one wishes to work with non-orthonormal bases, it is possible to have other combinations of vectors. For example, one can easily construct a (non-orthonormal) basis consisting entirely of null vectors, called anull basis.

Vector fields are called timelike, spacelike, or null if the associated vectors are timelike, spacelike, or null at each point where the field is defined.

Properties of time-like vectors

[edit]

Time-like vectors have special importance in the theory of relativity as they correspond to events that are accessible to the observer at (0, 0, 0, 0) with a speed less than that of light. Of most interest are time-like vectors that aresimilarly directed, i.e. all either in the forward or in the backward cones. Such vectors have several properties not shared by space-like vectors. These arise because both forward and backward cones are convex, whereas the space-like region is not convex.

Scalar product

[edit]

Thescalar product of two time-like vectorsu1 = (t1,x1,y1,z1) andu2 = (t2,x2,y2,z2) isη(u1,u2)=u1u2=c2t1t2x1x2y1y2z1z2.{\displaystyle \eta (u_{1},u_{2})=u_{1}\cdot u_{2}=c^{2}t_{1}t_{2}-x_{1}x_{2}-y_{1}y_{2}-z_{1}z_{2}.}

Positivity of scalar product: An important property is that the scalar product of two similarly directed time-like vectors is always positive. This can be seen from the reversedCauchy–Schwarz inequality below. It follows that if the scalar product of two vectors is zero, then one of these, at least, must be space-like. The scalar product of two space-like vectors can be positive or negative as can be seen by considering the product of two space-like vectors having orthogonal spatial components and times either of different or the same signs.

Using the positivity property of time-like vectors, it is easy to verify that a linear sum with positive coefficients of similarly directed time-like vectors is also similarly directed time-like (the sum remains within the light cone because of convexity).

Norm and reversed Cauchy inequality

[edit]

The norm of a time-like vectoru = (ct,x,y,z) is defined asu=η(u,u)=c2t2x2y2z2{\displaystyle \left\|u\right\|={\sqrt {\eta (u,u)}}={\sqrt {c^{2}t^{2}-x^{2}-y^{2}-z^{2}}}}

The reversed Cauchy inequality is another consequence of the convexity of either light cone.[7] For two distinct similarly directed time-like vectorsu1 andu2 this inequality isη(u1,u2)>u1u2{\displaystyle \eta (u_{1},u_{2})>\left\|u_{1}\right\|\left\|u_{2}\right\|}or algebraically,c2t1t2x1x2y1y2z1z2>(c2t12x12y12z12)(c2t22x22y22z22){\displaystyle c^{2}t_{1}t_{2}-x_{1}x_{2}-y_{1}y_{2}-z_{1}z_{2}>{\sqrt {\left(c^{2}t_{1}^{2}-x_{1}^{2}-y_{1}^{2}-z_{1}^{2}\right)\left(c^{2}t_{2}^{2}-x_{2}^{2}-y_{2}^{2}-z_{2}^{2}\right)}}}

From this, the positive property of the scalar product can be seen.

Reversed triangle inequality

[edit]

For two similarly directed time-like vectorsu andw, the inequality is[8]u+wu+w,{\displaystyle \left\|u+w\right\|\geq \left\|u\right\|+\left\|w\right\|,}where the equality holds when the vectors arelinearly dependent.

The proof uses the algebraic definition with the reversed Cauchy inequality:[9]u+w2=u2+2(u,w)+w2u2+2uw+w2=(u+w)2.{\displaystyle {\begin{aligned}\left\|u+w\right\|^{2}&=\left\|u\right\|^{2}+2\left(u,w\right)+\left\|w\right\|^{2}\\[5mu]&\geq \left\|u\right\|^{2}+2\left\|u\right\|\left\|w\right\|+\left\|w\right\|^{2}=\left(\left\|u\right\|+\left\|w\right\|\right)^{2}.\end{aligned}}}

The result now follows by taking the square root on both sides.

Mathematical structure

[edit]

It is assumed below that spacetime is endowed with a coordinate system corresponding to aninertial frame. This provides anorigin, which is necessary for spacetime to be modeled as a vector space. This addition is not required, and more complex treatments analogous to anaffine space can remove the extra structure. However, this is not the introductory convention and is not covered here.

For an overview, Minkowski space is a4-dimensionalrealvector space equipped with a non-degenerate,symmetric bilinear form on thetangent space at each point in spacetime, here simply called theMinkowski inner product, withmetric signature either(+ − − −) or(− + + +). The tangent space at each event is a vector space of the same dimension as spacetime,4.

Tangent vectors

[edit]
A pictorial representation of the tangent space at a point,x, on asphere. This vector space can be thought of as a subspace ofR3 itself. Then vectors in it would be calledgeometrical tangent vectors. By the same principle, the tangent space at a point in flat spacetime can be thought of as a subspace of spacetime, which happens to beall of spacetime.

In practice, one need not be concerned with the tangent spaces. The vector space structure of Minkowski space allows for the canonical identification of vectors in tangent spaces at points (events) with vectors (points, events) in Minkowski space itself. See e.g.Lee (2003, Proposition 3.8.) orLee (2012, Proposition 3.13.) These identifications are routinely done in mathematics. They can be expressed formally in Cartesian coordinates as[10](x0,x1,x2,x3)  x0e0|p+x1e1|p+x2e2|p+x3e3|p x0e0|q+x1e1|q+x2e2|q+x3e3|q{\displaystyle {\begin{aligned}\left(x^{0},\,x^{1},\,x^{2},\,x^{3}\right)\ &\leftrightarrow \ \left.x^{0}\mathbf {e} _{0}\right|_{p}+\left.x^{1}\mathbf {e} _{1}\right|_{p}+\left.x^{2}\mathbf {e} _{2}\right|_{p}+\left.x^{3}\mathbf {e} _{3}\right|_{p}\\&\leftrightarrow \ \left.x^{0}\mathbf {e} _{0}\right|_{q}+\left.x^{1}\mathbf {e} _{1}\right|_{q}+\left.x^{2}\mathbf {e} _{2}\right|_{q}+\left.x^{3}\mathbf {e} _{3}\right|_{q}\end{aligned}}}with basis vectors in the tangent spaces defined byeμ|p=xμ|p or e0|p=(1000), etc.{\displaystyle \left.\mathbf {e} _{\mu }\right|_{p}=\left.{\frac {\partial }{\partial x^{\mu }}}\right|_{p}{\text{ or }}\mathbf {e} _{0}|_{p}=\left({\begin{matrix}1\\0\\0\\0\end{matrix}}\right){\text{, etc}}.}

Here,p andq are any two events, and the second basis vector identification is referred to asparallel transport. The first identification is the canonical identification of vectors in the tangent space at any point with vectors in the space itself. The appearance of basis vectors in tangent spaces as first-order differential operators is due to this identification. It is motivated by the observation that a geometrical tangent vector can be associated in a one-to-one manner with adirectional derivative operator on the set of smooth functions. This is promoted to adefinition of tangent vectors in manifoldsnot necessarily being embedded inRn. This definition of tangent vectors is not the only possible one, as ordinaryn-tuples can be used as well.

Definitions of tangent vectors as ordinary vectors

A tangent vector at a pointp may be defined, here specialized to Cartesian coordinates in Lorentz frames, as4 × 1 column vectorsv associated toeach Lorentz frame related by Lorentz transformationΛ such that the vectorv in a frame related to some frame byΛ transforms according tov → Λv. This is thesame way in which the coordinatesxμ transform. Explicitly,xμ=Λμνxν,vμ=Λμνvν.{\displaystyle {\begin{aligned}x'^{\mu }&={\Lambda ^{\mu }}_{\nu }x^{\nu },\\v'^{\mu }&={\Lambda ^{\mu }}_{\nu }v^{\nu }.\end{aligned}}}

This definition is equivalent to the definition given above under a canonical isomorphism.

For some purposes, it is desirable to identify tangent vectors at a pointp withdisplacement vectors atp, which is, of course, admissible by essentially the same canonical identification.[11] The identifications of vectors referred to above in the mathematical setting can correspondingly be found in a more physical and explicitly geometrical setting inMisner, Thorne & Wheeler (1973). They offer various degrees of sophistication (and rigor) depending on which part of the material one chooses to read.

Metric signature

[edit]

The metric signature refers to which sign the Minkowski inner product yields when given space (spacelike to be specific, defined further down) and time basis vectors (timelike) as arguments. Further discussion about this theoretically inconsequential but practically necessary choice for purposes of internal consistency and convenience is deferred to the hide box below. See also the page treating sign convention in Relativity.

The choice of metric signature

In general, but with several exceptions, mathematicians and general relativists prefer spacelike vectors to yield a positive sign,(− + + +), while particle physicists tend to prefer timelike vectors to yield a positive sign,(+ − − −). Authors covering several areas of physics, e.g.Steven Weinberg andLandau and Lifshitz((− + + +) and(+ − − −), respectively) stick to one choice regardless of topic. Arguments for the former convention include "continuity" from the Euclidean case corresponding to the non-relativistic limitc → ∞. Arguments for the latter include that minus signs, otherwise ubiquitous in particle physics, go away. Yet other authors, especially of introductory texts, e.g.Kleppner & Kolenkow (1978), donot choose a signature at all, but instead, opt to coordinatize spacetime such that the timecoordinate (but not time itself!) is imaginary. This removes the need for theexplicit introduction of ametric tensor (which may seem like an extra burden in an introductory course), and one neednot be concerned withcovariant vectors andcontravariant vectors (or raising and lowering indices) to be described below. The inner product is instead effected by a straightforward extension of thedot product fromR3{\displaystyle \mathbb {R} ^{3}} over toC×R3.{\displaystyle \mathbb {C} \times \mathbb {R} ^{3}.} This works in the flat spacetime of special relativity, but not in the curved spacetime of general relativity, seeMisner, Thorne & Wheeler (1973, Box 2.1, "Farewell toi c t ")(who, by the way use(− + + +)). MTW also argues that it hides the trueindefinite nature of the metric and the true nature of Lorentz boosts, which are not rotations. It also needlessly complicates the use of tools ofdifferential geometry that are otherwise immediately available and useful for geometrical description and calculation – even in the flat spacetime of special relativity, e.g. of the electromagnetic field.

Terminology

[edit]

Mathematically associated with the bilinear form is atensor of type(0,2) at each point in spacetime, called theMinkowski metric.[nb 4] The Minkowski metric, the bilinear form, and the Minkowski inner product are all the same object; it is a bilinear function that accepts two (contravariant) vectors and returns a real number. In coordinates, this is the4×4 matrix representing the bilinear form.

For comparison, ingeneral relativity, aLorentzian manifoldL is likewise equipped with ametric tensorg, which is a nondegenerate symmetric bilinear form on the tangent spaceTpL at each pointp ofL. In coordinates, it may be represented by a4×4 matrixdepending on spacetime position. Minkowski space is thus a comparatively simple special case of aLorentzian manifold. Its metric tensor is in coordinates with the samesymmetric matrix at every point ofM, and its arguments can, per above, be taken as vectors in spacetime itself.

Introducing more terminology (but not more structure), Minkowski space is thus apseudo-Euclidean space with total dimensionn = 4 andsignature(1, 3) or(3, 1). Elements of Minkowski space are calledevents. Minkowski space is often denotedR1,3 orR3,1 to emphasize the chosen signature, or justM. It is an example of apseudo-Riemannian manifold.

Then mathematically, the metric is a bilinear form on an abstract four-dimensional real vector spaceV, that is,η:V×VR{\displaystyle \eta :V\times V\rightarrow \mathbf {R} }whereη has signature(+, -, -, -), and signature is a coordinate-invariant property ofη. The space of bilinear maps forms a vector space which can be identified withMM{\displaystyle M^{*}\otimes M^{*}}, andη may be equivalently viewed as an element of this space. By making a choice of orthonormal basis{eμ}{\displaystyle \{e_{\mu }\}},M:=(V,η){\displaystyle M:=(V,\eta )} can be identified with the spaceR1,3:=(R4,ημν){\displaystyle \mathbf {R} ^{1,3}:=(\mathbf {R} ^{4},\eta _{\mu \nu })}. The notation is meant to emphasize the fact thatM andR1,3{\displaystyle \mathbf {R} ^{1,3}} are not just vector spaces but have added structure.ημν=diag(+1,1,1,1){\displaystyle \eta _{\mu \nu }={\text{diag}}(+1,-1,-1,-1)}.

An interesting example of non-inertial coordinates for (part of) Minkowski spacetime is theBorn coordinates. Another useful set of coordinates is thelight-cone coordinates.

Pseudo-Euclidean metrics

[edit]
Main articles:Pseudo-Euclidean space andLorentzian manifolds

The Minkowski inner product is not aninner product, since it has non-zeronull vectors. Since it is not adefinite bilinear form it is calledindefinite.

The Minkowski metricη is the metric tensor of Minkowski space. It is a pseudo-Euclidean metric, or more generally, aconstant pseudo-Riemannian metric in Cartesian coordinates. As such, it is a nondegenerate symmetric bilinear form, a type(0, 2) tensor. It accepts two argumentsup,vp, vectors inTpM,pM, the tangent space atp inM. Due to the above-mentioned canonical identification ofTpM withM itself, it accepts argumentsu,v with bothu andv inM.

As a notational convention, vectorsv inM, called4-vectors, are denoted in italics, and not, as is common in the Euclidean setting, with boldfacev. The latter is generally reserved for the3-vector part (to be introduced below) of a4-vector.

The definition[12]uv=η(u,v){\displaystyle u\cdot v=\eta (u,\,v)}yields an inner product-like structure onM, previously and also henceforth, called theMinkowski inner product, similar to the Euclideaninner product, but it describes a different geometry. It is also called therelativistic dot product. If the two arguments are the same,uu=η(u,u)u2u2,{\displaystyle u\cdot u=\eta (u,u)\equiv \|u\|^{2}\equiv u^{2},}the resulting quantity will be called theMinkowski norm squared. The Minkowski inner product satisfies the following properties.

Linearity in the first argument
η(au+v,w)=aη(u,w)+η(v,w),u,vM,aR{\displaystyle \eta (au+v,\,w)=a\eta (u,\,w)+\eta (v,\,w),\quad \forall u,\,v\in M,\;\forall a\in \mathbb {R} }
Symmetry
η(u,v)=η(v,u){\displaystyle \eta (u,\,v)=\eta (v,\,u)}
Non-degeneracy
η(u,v)=0,vM  u=0{\displaystyle \eta (u,\,v)=0,\;\forall v\in M\ \Rightarrow \ u=0}

The first two conditions imply bilinearity.

The most important feature of the inner product and norm squared is thatthese are quantities unaffected by Lorentz transformations. In fact, it can be taken as the defining property of a Lorentz transformation in that it preserves the inner product (i.e. the value of the corresponding bilinear form on two vectors). This approach is taken more generally forall classical groups definable this way inclassical group. There, the matrixΦ is identical in the caseO(3, 1) (the Lorentz group) to the matrixη to be displayed below.

Orthogonality

[edit]
The blue axes are hyperbolically-orthogonal; light speed is constant over all such pairs

Minkowski space is constructed so that thespeed of light will be the same constant regardless of the reference frame in which it is measured. This property results from the relation of the time axis to a space axis. Two eventsu andv areorthogonal when the bilinear form is zero for them:η(v,w) = 0.

When bothu andv are both space-like, then they areperpendicular, but if one is time-like and the other space-like, then the relation ishyperbolic orthogonality. The relation is preserved in a change of reference frames and consequently the computation of light speed yields a constant result. The change of reference frame is called aLorentz boost and in mathematics it is ahyperbolic rotation. Each reference frame is associated with ahyperbolic angle, which is zero for the rest frame in Minkowski space. Such a hyperbolic angle has been labelledrapidity since it is associated with the speed of the frame.

Minkowski metric

[edit]
Not to be confused withMinkowski distance which is also called Minkowski metric.

From thesecond postulate of special relativity, together with homogeneity of spacetime and isotropy of space, it follows that thespacetime interval between two arbitrary events called1 and2 is:[13]c2(t1t2)2(x1x2)2(y1y2)2(z1z2)2.{\displaystyle c^{2}\left(t_{1}-t_{2}\right)^{2}-\left(x_{1}-x_{2}\right)^{2}-\left(y_{1}-y_{2}\right)^{2}-\left(z_{1}-z_{2}\right)^{2}.}This quantity is not consistently named in the literature. The interval is sometimes referred to as the square root of the interval as defined here.[14][15]

The invariance of the interval under coordinate transformations between inertial frames follows from the invariance ofc2t2x2y2z2{\displaystyle c^{2}t^{2}-x^{2}-y^{2}-z^{2}}provided the transformations are linear. Thisquadratic form can be used to define a bilinear formuv=c2t1t2x1x2y1y2z1z2{\displaystyle u\cdot v=c^{2}t_{1}t_{2}-x_{1}x_{2}-y_{1}y_{2}-z_{1}z_{2}}via thepolarization identity. This bilinear form can in turn be written asuv=uT[η]v,{\displaystyle u\cdot v=u^{\textsf {T}}\,[\eta ]\,v,}where[η] is a4×4{\displaystyle 4\times 4} matrix associated withη. While possibly confusing, it is common practice to denote[η] with justη. The matrix is read off from the explicit bilinear form asη=(1000010000100001),{\displaystyle \eta =\left({\begin{array}{r}1&0&0&0\\0&-1&0&0\\0&0&-1&0\\0&0&0&-1\end{array}}\right)\!,}and the bilinear formuv=η(u,v),{\displaystyle u\cdot v=\eta (u,v),}with which this section started by assuming its existence, is now identified.

For definiteness and shorter presentation, the signature(− + + +) is adopted below. This choice (or the other possible choice) has no (known) physical implications. The symmetry group preserving the bilinear form with one choice of signature is isomorphic (under the map givenhere) with the symmetry group preserving the other choice of signature. This means that both choices are in accord with the two postulates of relativity. Switching between the two conventions is straightforward. If the metric tensorη has been used in a derivation, go back to the earliest point where it was used, substituteη forη, and retrace forward to the desired formula with the desired metric signature.

Standard basis

[edit]

A standard or orthonormal basis for Minkowski space is a set of four mutually orthogonal vectors{e0,e1,e2,e3} such thatη(e0,e0)=η(e1,e1)=η(e2,e2)=η(e3,e3)=1{\displaystyle \eta (e_{0},e_{0})=-\eta (e_{1},e_{1})=-\eta (e_{2},e_{2})=-\eta (e_{3},e_{3})=1}and for whichη(eμ,eν)=0{\displaystyle \eta (e_{\mu },e_{\nu })=0} whenμν.{\textstyle \mu \neq \nu \,.}

These conditions can be written compactly in the formη(eμ,eν)=ημν.{\displaystyle \eta (e_{\mu },e_{\nu })=\eta _{\mu \nu }.}

Relative to a standard basis, the components of a vectorv are written(v0,v1,v2,v3) where theEinstein notation is used to writev =vμeμ. The componentv0 is called thetimelike component ofv while the other three components are called thespatial components. The spatial components of a4-vectorv may be identified with a3-vectorv = (v1,v2,v3).

In terms of components, the Minkowski inner product between two vectorsv andw is given by

η(v,w)=ημνvμwν=v0w0+v1w1+v2w2+v3w3=vμwμ=vμwμ,{\displaystyle \eta (v,w)=\eta _{\mu \nu }v^{\mu }w^{\nu }=v^{0}w_{0}+v^{1}w_{1}+v^{2}w_{2}+v^{3}w_{3}=v^{\mu }w_{\mu }=v_{\mu }w^{\mu },}andη(v,v)=ημνvμvν=v0v0+v1v1+v2v2+v3v3=vμvμ.{\displaystyle \eta (v,v)=\eta _{\mu \nu }v^{\mu }v^{\nu }=v^{0}v_{0}+v^{1}v_{1}+v^{2}v_{2}+v^{3}v_{3}=v^{\mu }v_{\mu }.}

Herelowering of an index with the metric was used.

There are many possible choices of standard basis obeying the conditionη(eμ,eν)=ημν.{\displaystyle \eta (e_{\mu },e_{\nu })=\eta _{\mu \nu }.} Any two such bases are related in some sense by a Lorentz transformation, either by a change-of-basis matrixΛνμ{\displaystyle \Lambda _{\nu }^{\mu }}, a real4 × 4 matrix satisfyingΛρμημνΛσν=ηρσ.{\displaystyle \Lambda _{\rho }^{\mu }\eta _{\mu \nu }\Lambda _{\sigma }^{\nu }=\eta _{\rho \sigma }.}orΛ, a linear map on the abstract vector space satisfying, for any pair of vectorsu,v,η(Λu,Λv)=η(u,v).{\displaystyle \eta (\Lambda u,\Lambda v)=\eta (u,v).}

Then if two different bases exist,{e0,e1,e2,e3} and{e0,e1,e2,e3},eμ=eνΛμν{\displaystyle e_{\mu }'=e_{\nu }\Lambda _{\mu }^{\nu }} can be represented aseμ=eνΛμν{\displaystyle e_{\mu }'=e_{\nu }\Lambda _{\mu }^{\nu }} oreμ=Λeμ{\displaystyle e_{\mu }'=\Lambda e_{\mu }}. While it might be tempting to think ofΛνμ{\displaystyle \Lambda _{\nu }^{\mu }} andΛ as the same thing, mathematically, they are elements of different spaces, and act on the space of standard bases from different sides.

Raising and lowering of indices

[edit]
Main articles:Raising and lowering indices andtensor contraction
Linear functionals (1-forms)α,β and their sumσ and vectorsu,v,w, in3dEuclidean space. The number of (1-form)hyperplanes intersected by a vector equals theinner product.[16]

Technically, a non-degenerate bilinear form provides a map between a vector space and its dual; in this context, the map is between the tangent spaces ofM and thecotangent spaces ofM. At a point inM, the tangent and cotangent spaces aredual vector spaces (so the dimension of the cotangent space at an event is also4). Just as an authentic inner product on a vector space with one argument fixed, byRiesz representation theorem, may be expressed as the action of alinear functional on the vector space, the same holds for the Minkowski inner product of Minkowski space.[17]

Thus ifvμ are the components of a vector in tangent space, thenημνvμ =vν are the components of a vector in the cotangent space (a linear functional). Due to the identification of vectors in tangent spaces with vectors inM itself, this is mostly ignored, and vectors with lower indices are referred to ascovariant vectors. In this latter interpretation, the covariant vectors are (almost always implicitly) identified with vectors (linear functionals) in the dual of Minkowski space. The ones with upper indices arecontravariant vectors. In the same fashion, the inverse of the map from tangent to cotangent spaces, explicitly given by the inverse ofη in matrix representation, can be used to defineraising of an index. The components of this inverse are denotedημν. It happens thatημν =ημν. These maps between a vector space and its dual can be denotedη (eta-flat) andη (eta-sharp) by the musical analogy.[18]

Contravariant and covariant vectors are geometrically very different objects. The first can and should be thought of as arrows. A linear function can be characterized by two objects: itskernel, which is ahyperplane passing through the origin, and its norm. Geometrically thus, covariant vectors should be viewed as a set of hyperplanes, with spacing depending on the norm (bigger = smaller spacing), with one of them (the kernel) passing through the origin. The mathematical term for a covariant vector is 1-covector or1-form (though the latter is usually reserved for covectorfields).

One quantum mechanical analogy explored in the literature is that of ade Broglie wave (scaled by a factor of Planck's reduced constant) associated with amomentum four-vector to illustrate how one could imagine a covariant version of a contravariant vector. The inner product of two contravariant vectors could equally well be thought of as the action of the covariant version of one of them on the contravariant version of the other. The inner product is then how many times the arrow pierces the planes.[16] The mathematical reference,Lee (2003), offers the same geometrical view of these objects (but mentions no piercing).

Theelectromagnetic field tensor is adifferential 2-form, which geometrical description can as well be found in MTW.

One may, of course, ignore geometrical views altogether (as is the style in e.g.Weinberg (2002) andLandau & Lifshitz 2002) and proceed algebraically in a purely formal fashion. The time-proven robustness of the formalism itself, sometimes referred to asindex gymnastics, ensures that moving vectors around and changing from contravariant to covariant vectors and vice versa (as well as higher order tensors) is mathematically sound. Incorrect expressions tend to reveal themselves quickly.

Coordinate free raising and lowering

[edit]

Given a bilinear form η:M×MR ,{\displaystyle \ \eta :M\times M\rightarrow \mathbb {R} \ ,} the lowered version of a vector can be thought of as the partial evaluation of η ,{\displaystyle \ \eta \ ,} that is, there is an associated partial evaluation mapη(,):MM  ,vη(v,) .{\displaystyle \eta (\cdot ,-):M\rightarrow M^{*}\ ~,\quad v\mapsto \eta (v,\cdot )~.}

The lowered vector η(v,)M {\displaystyle \ \eta (v,\cdot )\in M^{*}\ } is then the dual map uη(v,u) .{\displaystyle \ u\mapsto \eta (v,u)~.} Note it does not matter which argument is partially evaluated due to the symmetry of η .{\displaystyle \ \eta ~.}

Non-degeneracy is then equivalent to injectivity of the partial evaluation map, or equivalently non-degeneracy indicates that the kernel of the map is trivial. In finite dimension, as is the case here, and noting that the dimension of a finite-dimensional space is equal to the dimension of the dual, this is enough to conclude the partial evaluation map is a linear isomorphism from M {\displaystyle \ M\ } to M .{\displaystyle \ M^{*}~.} This then allows the definition of the inverse partial evaluation map,η1:MM ,{\displaystyle \eta ^{-1}:M^{*}\rightarrow M\ ,}which allows the inverse metric to be defined asη1:M×MR  ,η1(α,β) = η( η1(α), η1(β) ) {\displaystyle \eta ^{-1}:M^{*}\times M^{*}\rightarrow \mathbb {R} \ ~,\quad \eta ^{-1}\!(\alpha ,\beta )\ =\ \eta {\bigl (}\ \eta ^{-1}\!(\alpha ),\ \eta ^{-1}\!(\beta )\ {\bigr )}\ }where the two different usages ofη1 {\displaystyle \;\eta ^{-1}\ } can be told apart by the argument each is evaluated on. This can then be used to raise indices. If a coordinate basis is used, the metricη−1 is indeed the matrix inverse toη .

Formalism of the Minkowski metric

[edit]

The present purpose is to show semi-rigorously howformally one may apply the Minkowski metric to two vectors and obtain a real number, i.e. to display the role of the differentials and how they disappear in a calculation. The setting is that of smooth manifold theory, and concepts such as convector fields and exterior derivatives are introduced.

A formal approach to the Minkowski metric

A full-blown version of the Minkowski metric in coordinates as a tensor field on spacetime has the appearanceημνdxμdxν=ημνdxμdxν=ημνdxμdxν .{\displaystyle \eta _{\mu \nu }\operatorname {d} x^{\mu }\otimes \operatorname {d} x^{\nu }=\eta _{\mu \nu }\operatorname {d} x^{\mu }\odot \operatorname {d} x^{\nu }=\eta _{\mu \nu }\operatorname {d} x^{\mu }\operatorname {d} x^{\nu }~.}

Explanation: The coordinate differentials are 1-form fields. They are defined as theexterior derivative of the coordinate functionsxμ. These quantities evaluated at a pointp provide a basis for the cotangent space atp. Thetensor product (denoted by the symbol) yields a tensor field of type(0, 2), i.e. the type that expects two contravariant vectors as arguments. On the right-hand side, thesymmetric product (denoted by the symbol or by juxtaposition) has been taken. The equality holds since, by definition, the Minkowski metric is symmetric.[19] The notation on the far right is also sometimes used for the related, but different,line element. It isnot a tensor. For elaboration on the differences and similarities, seeMisner, Thorne & Wheeler (1973, Box 3.2 and section 13.2.)

Tangent vectors are, in this formalism, given in terms of a basis of differential operators of the first order,xμ|p ,{\displaystyle \left.{\frac {\partial }{\partial x^{\mu }}}\right|_{p}\ ,}wherep is an event. This operator applied to a functionf gives thedirectional derivative off atp in the direction of increasingxμ withxν,νμ fixed. They provide a basis for the tangent space atp.

The exterior derivativedf of a functionf is acovector field, i.e. an assignment of a cotangent vector to each pointp, by definition such thatdf(X)=X f,{\displaystyle \operatorname {d} f(X)=X\ f,}for eachvector fieldX. A vector field is an assignment of a tangent vector to each pointp. In coordinatesX can be expanded at each pointp in the basis given by the/xν|p . Applying this withf =xμ, the coordinate function itself, andX =/xν , called acoordinate vector field, one obtainsdxμ(xν)=xμxν=δνμ .{\displaystyle \operatorname {d} x^{\mu }\left({\frac {\partial }{\partial x^{\nu }}}\right)={\frac {\partial x^{\mu }}{\partial x^{\nu }}}=\delta _{\nu }^{\mu }~.}

Since this relation holds at each pointp, thedxμ|p provide a basis for the cotangent space at eachp and the basesdxμ|p and/xν|p aredual to each other,dxμ|p(xν|p)=δνμ .{\displaystyle {\Bigl .}\operatorname {d} x^{\mu }{\Bigr |}_{p}\left(\left.{\frac {\partial }{\partial x^{\nu }}}\right|_{p}\right)=\delta _{\nu }^{\mu }~.}at eachp. Furthermore, one hasα  β(a,b) = α(a) β(b) {\displaystyle \alpha \ \otimes \ \beta (a,b)\ =\ \alpha (a)\ \beta (b)\ }for general one-forms on a tangent spaceα,β and general tangent vectorsa,b. (This can be taken as a definition, but may also be proved in a more general setting.)

Thus when the metric tensor is fed two vectors fieldsa,b, both expanded in terms of the basis coordinate vector fields, the result isημν dxμdxν(a,b) = ημν aμ bν ,{\displaystyle \eta _{\mu \nu }\ \operatorname {d} x^{\mu }\otimes \operatorname {d} x^{\nu }(a,b)\ =\ \eta _{\mu \nu }\ a^{\mu }\ b^{\nu }\ ,}whereaμ,bν are thecomponent functions of the vector fields. The above equation holds at each pointp, and the relation may as well be interpreted as the Minkowski metric atp applied to two tangent vectors atp.

As mentioned, in a vector space, such as modeling the spacetime of special relativity, tangent vectors can be canonically identified with vectors in the space itself, and vice versa. This means that the tangent spaces at each point are canonically identified with each other and with the vector space itself. This explains how the right-hand side of the above equation can be employed directly, without regard to the spacetime point the metric is to be evaluated and from where (which tangent space) the vectors come from.

This situation changes ingeneral relativity. There one hasgμν(p) dxμ|p dxν|p(a,b) = gμν(p) aμ bν ,{\displaystyle g_{\mu \nu }\!(p)\ {\Bigl .}\operatorname {d} x^{\mu }{\Bigr |}_{p}\ \left.\operatorname {d} x^{\nu }\right|_{p}(a,b)\ =\ g_{\mu \nu }\!(p)\ a^{\mu }\ b^{\nu }\ ,}where nowηg(p), i.e.,g is still a metric tensor but now depending on spacetime and is a solution ofEinstein's field equations. Moreover,a,bmust be tangent vectors at spacetime pointp and can no longer be moved around freely.

Chronological and causality relations

[edit]

Letx,yM. Here,

  1. xchronologically precedesy ifyx is future-directed timelike. This relation has thetransitive property and so can be writtenx <y.
  2. xcausally precedesy ifyx is future-directed null or future-directed timelike. It gives apartial ordering of spacetime and so can be writtenxy.

SupposexM is timelike. Then thesimultaneous hyperplane forx is{y :η(x,y) = 0}. Since thishyperplane varies asx varies, there is arelativity of simultaneity in Minkowski space.

Generalizations

[edit]
Main articles:Lorentzian manifold andSuper Minkowski space

A Lorentzian manifold is a generalization of Minkowski space in two ways. The total number of spacetime dimensions is not restricted to be4 (2 or more) and a Lorentzian manifold need not be flat, i.e. it allows for curvature.

Complexified Minkowski space

[edit]

Complexified Minkowski space is defined asMc =MiM.[20] Its real part is the Minkowski space offour-vectors, such as thefour-velocity and thefour-momentum, which are independent of the choice oforientation of the space. The imaginary part, on the other hand, may consist of four pseudovectors, such asangular velocity andmagnetic moment, which change their direction with a change of orientation. Apseudoscalari is introduced, which also changes sign with a change of orientation. Thus, elements ofMc are independent of the choice of the orientation.

Theinner product-like structure onMc is defined asuv =η(u,v) for anyu,vMc. A relativistic purespin of anelectron or any half spin particle is described byρ Mc asρ =u +is, whereu is the four-velocity of the particle, satisfyingu2 = 1 ands is the 4D spin vector,[21] which is also thePauli–Lubanski pseudovector satisfyings2 = −1 andus = 0.

Generalized Minkowski space

[edit]

Minkowski space refers to a mathematical formulation in four dimensions. However, the mathematics can easily be extended or simplified to create an analogous generalized Minkowski space in any number of dimensions. Ifn ≥ 2,n-dimensional Minkowski space is a vector space of real dimensionn on which there is a constant Minkowski metric of signature(n − 1, 1) or(1,n − 1). These generalizations are used in theories where spacetime is assumed to have more or less than4 dimensions.String theory andM-theory are two examples wheren > 4. In string theory there appearconformal field theories with1 + 1 spacetime dimensions.

de Sitter space can be formulated as a submanifold of generalized Minkowski space as can the model spaces ofhyperbolic geometry (see below).

Curvature

[edit]
Main article:Curvature

As aflat spacetime, the three spatial components of Minkowski spacetime always obey thePythagorean Theorem. Minkowski space is a suitable basis for special relativity, a good description of physical systems over finite distances in systems without significantgravitation. However, in order to take gravity into account, physicists use the theory ofgeneral relativity, which is formulated in the mathematics ofdifferential geometry ofdifferential manifolds. When this geometry is used as a model of spacetime, it is known ascurved spacetime.

Even in curved spacetime, Minkowski space is still a good description in aninfinitesimal region surrounding any point (barring gravitational singularities).[nb 5] More abstractly, it can be said that in the presence of gravity spacetime is described by a curved 4-dimensionalmanifold for which thetangent space to any point is a 4-dimensional Minkowski space. Thus, the structure of Minkowski space is still essential in the description of general relativity.

Geometry

[edit]
Main article:Hyperboloid model

The meaning of the termgeometry for the Minkowski space depends heavily on the context. Minkowski space is not endowed with Euclidean geometry, and not with any of the generalized Riemannian geometries with intrinsic curvature, those exposed by themodel spaces inhyperbolic geometry (negative curvature) and the geometry modeled by thesphere (positive curvature). The reason is the indefiniteness of the Minkowski metric. Minkowski space is, in particular, not ametric space and not a Riemannian manifold with a Riemannian metric. However, Minkowski space containssubmanifolds endowed with a Riemannian metric yielding hyperbolic geometry.

Model spaces of hyperbolic geometry of low dimension, say 2 or 3,cannot be isometrically embedded in Euclidean space with one more dimension, i.e.R3{\displaystyle \mathbb {R} ^{3}} orR4{\displaystyle \mathbb {R} ^{4}} respectively, with the Euclidean metricg¯{\displaystyle {\overline {g}}}, preventing easy visualization.[nb 6][22] By comparison, model spaces with positive curvature are just spheres in Euclidean space of one higher dimension.[23] Hyperbolic spacescan be isometrically embedded in spaces of one more dimension when the embedding space is endowed with the Minkowski metricη{\displaystyle \eta }.

DefineHR1(n)Mn+1{\displaystyle \mathbf {H} _{R}^{1(n)}\subset \mathbf {M} ^{n+1}}to be the upper sheet (ct>0{\displaystyle ct>0}) of thehyperboloidHR1(n)={(ct,x1,,xn)Mn:c2t2(x1)2(xn)2=R2,ct>0}{\displaystyle \mathbf {H} _{R}^{1(n)}=\left\{\left(ct,x^{1},\ldots ,x^{n}\right)\in \mathbf {M} ^{n}:c^{2}t^{2}-\left(x^{1}\right)^{2}-\cdots -\left(x^{n}\right)^{2}=R^{2},ct>0\right\}}in generalized Minkowski spaceMn+1{\displaystyle \mathbf {M} ^{n+1}} of spacetime dimensionn+1.{\displaystyle n+1.} This is one of thesurfaces of transitivity of the generalized Lorentz group. Theinduced metric on this submanifold,hR1(n)=ιη,{\displaystyle h_{R}^{1(n)}=\iota ^{*}\eta ,}thepullback of the Minkowski metricη{\displaystyle \eta } under inclusion, is aRiemannian metric. With this metricHR1(n){\displaystyle \mathbf {H} _{R}^{1(n)}} is aRiemannian manifold. It is one of the model spaces of Riemannian geometry, thehyperboloid model ofhyperbolic space. It is a space of constant negative curvature1/R2{\displaystyle -1/R^{2}}.[24] The 1 in the upper index refers to an enumeration of the different model spaces of hyperbolic geometry, and then for its dimension. A2(2){\displaystyle 2(2)} corresponds to thePoincaré disk model, while3(n){\displaystyle 3(n)} corresponds to thePoincaré half-space model of dimensionn.{\displaystyle n.}

Preliminaries

[edit]

In the definition aboveι:HR1(n)Mn+1{\displaystyle \iota :\mathbf {H} _{R}^{1(n)}\rightarrow \mathbf {M} ^{n+1}} is theinclusion map and the superscript star denotes thepullback. The present purpose is to describe this and similar operations as a preparation for the actual demonstration thatHR1(n){\displaystyle \mathbf {H} _{R}^{1(n)}} actually is a hyperbolic space.

Behavior of tensors under inclusion, pullback of covariant tensors under general maps and pushforward of vectors under general maps

Behavior of tensors under inclusion:
For inclusion maps from a submanifoldS intoM and a covariant tensorα of orderk onM it holds thatια(X1,X2,,Xk)=α(ιX1,ιX2,,ιXk)=α(X1,X2,,Xk),{\displaystyle \iota ^{*}\alpha \left(X_{1},\,X_{2},\,\ldots ,\,X_{k}\right)=\alpha \left(\iota _{*}X_{1},\,\iota _{*}X_{2},\,\ldots ,\,\iota _{*}X_{k}\right)=\alpha \left(X_{1},\,X_{2},\,\ldots ,\,X_{k}\right),}whereX1,X1, ...,Xk are vector fields onS. The subscript star denotes the pushforward (to be introduced later), and it is in this special case simply the identity map (as is the inclusion map). The latter equality holds because a tangent space to a submanifold at a point is in a canonical way a subspace of the tangent space of the manifold itself at the point in question. One may simply writeια=α|S,{\displaystyle \iota ^{*}\alpha =\alpha |_{S},}meaning (with slightabuse of notation) the restriction ofα to accept as input vectors tangent to somesS only.

Pullback of tensors under general maps:
The pullback of a covariantk-tensorα (one taking only contravariant vectors as arguments) under a mapF:MN is a linear mapF:TF(p)kNTpkM,{\displaystyle F^{*}\colon T_{F(p)}^{k}N\rightarrow T_{p}^{k}M,}where for any vector spaceV,TkV=VVVk times.{\displaystyle T^{k}V=\underbrace {V^{*}\otimes V^{*}\otimes \cdots \otimes V^{*}} _{k{\text{ times}}}.}

It is defined byF(α)(X1,X2,,Xk)=α(FX1,FX2,,FXk),{\displaystyle F^{*}(\alpha )\left(X_{1},\,X_{2},\,\ldots ,\,X_{k}\right)=\alpha \left(F_{*}X_{1},\,F_{*}X_{2},\,\ldots ,\,F_{*}X_{k}\right),}where the subscript star denotes thepushforward of the mapF, andX1,X2, ..., Xk are vectors inTpM. (This is in accord with what was detailed about the pullback of the inclusion map. In the general case here, one cannot proceed as simply becauseFX1X1 in general.)

The pushforward of vectors under general maps:
Heuristically, pulling back a tensor topM fromF(p) ∈N feeding it vectors residing atpM is by definition the same as pushing forward the vectors frompM toF(p) ∈N feeding them to the tensor residing atF(p) ∈N.

Further unwinding the definitions, the pushforwardF:TMpTNF(p) of a vector field under a mapF:MN between manifolds is defined byF(X)f=X(fF),{\displaystyle F_{*}(X)f=X(f\circ F),}wheref is a function onN. WhenM =Rm,N=Rn the pushforward ofF reduces toDF:RmRn, the ordinarydifferential, which is given by theJacobian matrix of partial derivatives of the component functions. The differential is the best linear approximation of a functionF fromRm toRn. The pushforward is the smooth manifold version of this. It acts between tangent spaces, and is in coordinates represented by the Jacobian matrix of thecoordinate representation of the function.

The corresponding pullback is thedual map from the dual of the range tangent space to the dual of the domain tangent space, i.e. it is a linear map,F:TF(p)NTpM.{\displaystyle F^{*}\colon T_{F(p)}^{*}N\rightarrow T_{p}^{*}M.}

Hyperbolic stereographic projection

[edit]
Red circular arc is geodesic inPoincaré disk model; it projects to the brown geodesic on the green hyperboloid.

In order to exhibit the metric, it is necessary to pull it back via a suitableparametrization. A parametrization of a submanifoldS of a manifoldM is a mapURmM whose range is an open subset ofS. IfS has the same dimension asM, a parametrization is just the inverse of a coordinate mapφ:MURm. The parametrization to be used is the inverse ofhyperbolic stereographic projection. This is illustrated in the figure to the right forn = 2. It is instructive to compare tostereographic projection for spheres.

Stereographic projectionσ:Hn
R
Rn
and its inverseσ−1:RnHn
R
are given byσ(τ,x)=u=RxR+τ,σ1(u)=(τ,x)=(RR2+|u|2R2|u|2,2R2uR2|u|2),{\displaystyle {\begin{aligned}\sigma (\tau ,\mathbf {x} )=\mathbf {u} &={\frac {R\mathbf {x} }{R+\tau }},\\\sigma ^{-1}(\mathbf {u} )=(\tau ,\mathbf {x} )&=\left(R{\frac {R^{2}+|u|^{2}}{R^{2}-|u|^{2}}},{\frac {2R^{2}\mathbf {u} }{R^{2}-|u|^{2}}}\right),\end{aligned}}}where, for simplicity,τct. The(τ,x) are coordinates onMn+1 and theu are coordinates onRn.

Detailed derivation

LetHRn={(τ,x1,,xn)M:τ2+(x1)2++(xn)2=R2,τ>0}{\displaystyle \mathbf {H} _{R}^{n}=\left\{\left(\tau ,x^{1},\ldots ,x^{n}\right)\subset \mathbf {M} :-\tau ^{2}+\left(x^{1}\right)^{2}+\cdots +\left(x^{n}\right)^{2}=-R^{2},\tau >0\right\}}and letS=(R,0,,0).{\displaystyle S=(-R,0,\ldots ,0).}

IfP=(τ,x1,,xn)HRn,{\displaystyle P=\left(\tau ,x^{1},\ldots ,x^{n}\right)\in \mathbf {H} _{R}^{n},}then it is geometrically clear that the vectorPS{\displaystyle {\overrightarrow {PS}}}intersects the hyperplane{(τ,x1,,xn)M:τ=0}{\displaystyle \left\{\left(\tau ,x^{1},\ldots ,x^{n}\right)\in M:\tau =0\right\}}once in point denotedU=(0,u1(P),,un(P))(0,u).{\displaystyle U=\left(0,u^{1}(P),\ldots ,u^{n}(P)\right)\equiv (0,\mathbf {u} ).}

One hasS+SU=USU=US,S+SP=PSP=PS{\displaystyle {\begin{aligned}S+{\overrightarrow {SU}}&=U\Rightarrow {\overrightarrow {SU}}=U-S,\\S+{\overrightarrow {SP}}&=P\Rightarrow {\overrightarrow {SP}}=P-S\end{aligned}}}orSU=(0,u)(R,0)=(R,u),SP=(τ,x)(R,0)=(τ+R,x)..{\displaystyle {\begin{aligned}{\overrightarrow {SU}}&=(0,\mathbf {u} )-(-R,\mathbf {0} )=(R,\mathbf {u} ),\\{\overrightarrow {SP}}&=(\tau ,\mathbf {x} )-(-R,\mathbf {0} )=(\tau +R,\mathbf {x} ).\end{aligned}}.}

By construction of stereographic projection one hasSU=λ(τ)SP.{\displaystyle {\overrightarrow {SU}}=\lambda (\tau ){\overrightarrow {SP}}.}

This leads to the system of equationsR=λ(τ+R),u=λx.{\displaystyle {\begin{aligned}R&=\lambda (\tau +R),\\\mathbf {u} &=\lambda \mathbf {x} .\end{aligned}}}

The first of these is solved forλ and one obtains for stereographic projectionσ(τ,x)=u=RxR+τ.{\displaystyle \sigma (\tau ,\mathbf {x} )=\mathbf {u} ={\frac {R\mathbf {x} }{R+\tau }}.}

Next, the inverseσ−1(u) = (τ,x) must be calculated. Use the same considerations as before, but now withU=(0,u)P=(τ(u),x(u)).,{\displaystyle {\begin{aligned}U&=(0,\mathbf {u} )\\P&=(\tau (\mathbf {u} ),\mathbf {x} (\mathbf {u} )).\end{aligned}},}one getsτ=R(1λ)λ,x=uλ,{\displaystyle {\begin{aligned}\tau &={\frac {R(1-\lambda )}{\lambda }},\\\mathbf {x} &={\frac {\mathbf {u} }{\lambda }},\end{aligned}}}but now withλ depending onu. The condition forP lying in the hyperboloid isτ2+|x|2=R2,{\displaystyle -\tau ^{2}+|\mathbf {x} |^{2}=-R^{2},}orR2(1λ)2λ2+|u|2λ2=R2,{\displaystyle -{\frac {R^{2}(1-\lambda )^{2}}{\lambda ^{2}}}+{\frac {|\mathbf {u} |^{2}}{\lambda ^{2}}}=-R^{2},}leading toλ=R2|u|22R2.{\displaystyle \lambda ={\frac {R^{2}-|u|^{2}}{2R^{2}}}.}

With thisλ, one obtainsσ1(u)=(τ,x)=(RR2+|u|2R2|u|2,2R2uR2|u|2).{\displaystyle \sigma ^{-1}(\mathbf {u} )=(\tau ,\mathbf {x} )=\left(R{\frac {R^{2}+|u|^{2}}{R^{2}-|u|^{2}}},{\frac {2R^{2}\mathbf {u} }{R^{2}-|u|^{2}}}\right).}

Pulling back the metric

[edit]

One hashR1(n)=η|HR1(n)=(dx1)2++(dxn)2dτ2{\displaystyle h_{R}^{1(n)}=\eta |_{\mathbf {H} _{R}^{1(n)}}=\left(dx^{1}\right)^{2}+\cdots +\left(dx^{n}\right)^{2}-d\tau ^{2}}and the mapσ1:RnHR1(n);σ1(u)=(τ(u),x(u))=(RR2+|u|2R2|u|2,2R2uR2|u|2).{\displaystyle \sigma ^{-1}:\mathbf {R} ^{n}\rightarrow \mathbf {H} _{R}^{1(n)};\quad \sigma ^{-1}(\mathbf {u} )=(\tau (\mathbf {u} ),\,\mathbf {x} (\mathbf {u} ))=\left(R{\frac {R^{2}+|u|^{2}}{R^{2}-|u|^{2}}},\,{\frac {2R^{2}\mathbf {u} }{R^{2}-|u|^{2}}}\right).}

The pulled back metric can be obtained by straightforward methods of calculus;(σ1)η|HR1(n)=(dx1(u))2++(dxn(u))2(dτ(u))2.{\displaystyle \left.\left(\sigma ^{-1}\right)^{*}\eta \right|_{\mathbf {H} _{R}^{1(n)}}=\left(dx^{1}(\mathbf {u} )\right)^{2}+\cdots +\left(dx^{n}(\mathbf {u} )\right)^{2}-\left(d\tau (\mathbf {u} )\right)^{2}.}

One computes according to the standard rules for computing differentials (though one is really computing the rigorously defined exterior derivatives),dx1(u)=d(2R2u1R2|u|2)=u12R2u1R2|u|2du1++un2R2u1R2|u|2dun+τ2R2u1R2|u|2dτ,  dxn(u)=d(2R2unR2|u|2)=,dτ(u)=d(RR2+|u|2R2|u|2)=,{\displaystyle {\begin{aligned}dx^{1}(\mathbf {u} )&=d\left({\frac {2R^{2}u^{1}}{R^{2}-|u|^{2}}}\right)={\frac {\partial }{\partial u^{1}}}{\frac {2R^{2}u^{1}}{R^{2}-|u|^{2}}}du^{1}+\cdots +{\frac {\partial }{\partial u^{n}}}{\frac {2R^{2}u^{1}}{R^{2}-|u|^{2}}}du^{n}+{\frac {\partial }{\partial \tau }}{\frac {2R^{2}u^{1}}{R^{2}-|u|^{2}}}d\tau ,\\&\ \ \vdots \\dx^{n}(\mathbf {u} )&=d\left({\frac {2R^{2}u^{n}}{R^{2}-|u|^{2}}}\right)=\cdots ,\\d\tau (\mathbf {u} )&=d\left(R{\frac {R^{2}+|u|^{2}}{R^{2}-|u|^{2}}}\right)=\cdots ,\end{aligned}}}and substitutes the results into the right hand side. This yields(σ1)hR1(n)=4R2[(du1)2++(dun)2](R2|u|2)2hR2(n).{\displaystyle \left(\sigma ^{-1}\right)^{*}h_{R}^{1(n)}={\frac {4R^{2}\left[\left(du^{1}\right)^{2}+\cdots +\left(du^{n}\right)^{2}\right]}{\left(R^{2}-|u|^{2}\right)^{2}}}\equiv h_{R}^{2(n)}.}

Detailed outline of computation

One hasu12R2u1R2|u|2du1=2(R2|u|2)+4R2(u1)2(R2|u|2)2du1,u22R2u1R2|u|2du2=4R2u1u2(R2|u|2)2du2,{\displaystyle {\begin{aligned}{\frac {\partial }{\partial u^{1}}}{\frac {2R^{2}u^{1}}{R^{2}-|u|^{2}}}du^{1}&={\frac {2\left(R^{2}-|u|^{2}\right)+4R^{2}\left(u^{1}\right)^{2}}{\left(R^{2}-|u|^{2}\right)^{2}}}du^{1},\\{\frac {\partial }{\partial u^{2}}}{\frac {2R^{2}u^{1}}{R^{2}-|u|^{2}}}du^{2}&={\frac {4R^{2}u^{1}u^{2}}{\left(R^{2}-|u|^{2}\right)^{2}}}du^{2},\end{aligned}}}andτ2R2u1R2|u|2dτ2=0.{\displaystyle {\frac {\partial }{\partial \tau }}{\frac {2R^{2}u^{1}}{R^{2}-|u|^{2}}}d\tau ^{2}=0.}

With this one may writedx1(u)=2R2(R2|u|2)du1+4R2u1(udu)(R2|u|2)2,{\displaystyle dx^{1}(\mathbf {u} )={\frac {2R^{2}\left(R^{2}-|u|^{2}\right)du^{1}+4R^{2}u^{1}(\mathbf {u} \cdot d\mathbf {u} )}{\left(R^{2}-|u|^{2}\right)^{2}}},}from which(dx1(u))2=4R2(r2|u|2)2(du1)2+16R4(R2|u|2)(udu)u1du1+16R4(u1)2(udu)2(R2|u|2)4.{\displaystyle \left(dx^{1}(\mathbf {u} )\right)^{2}={\frac {4R^{2}\left(r^{2}-|u|^{2}\right)^{2}\left(du^{1}\right)^{2}+16R^{4}\left(R^{2}-|u|^{2}\right)\left(\mathbf {u} \cdot d\mathbf {u} \right)u^{1}du^{1}+16R^{4}\left(u^{1}\right)^{2}\left(\mathbf {u} \cdot d\mathbf {u} \right)^{2}}{\left(R^{2}-|u|^{2}\right)^{4}}}.}

Summing this formula one obtains(dx1(u))2++(dxn(u))2=4R2(R2|u|2)2[(du1)2++(dun)2]+16R4(R2|u|2)(udu)(udu)+16R4|u|2(udu)2(R2|u|2)4=4R2(R2|u|2)2[(du1)2++(dun)2](R2|u|2)4+R216R4(udu)(R2|u|2)4.{\displaystyle {\begin{aligned}&\left(dx^{1}(\mathbf {u} )\right)^{2}+\cdots +\left(dx^{n}(\mathbf {u} )\right)^{2}\\={}&{\frac {4R^{2}\left(R^{2}-|u|^{2}\right)^{2}\left[\left(du^{1}\right)^{2}+\cdots +\left(du^{n}\right)^{2}\right]+16R^{4}\left(R^{2}-|u|^{2}\right)(\mathbf {u} \cdot d\mathbf {u} )(\mathbf {u} \cdot d\mathbf {u} )+16R^{4}|u|^{2}(\mathbf {u} \cdot d\mathbf {u} )^{2}}{\left(R^{2}-|u|^{2}\right)^{4}}}\\={}&{\frac {4R^{2}\left(R^{2}-|u|^{2}\right)^{2}\left[\left(du^{1}\right)^{2}+\cdots +\left(du^{n}\right)^{2}\right]}{\left(R^{2}-|u|^{2}\right)^{4}}}+R^{2}{\frac {16R^{4}(\mathbf {u} \cdot d\mathbf {u} )}{\left(R^{2}-|u|^{2}\right)^{4}}}.\end{aligned}}}

Similarly, forτ one getsdτ=i=1nuiRR2+|u|2R2+|u|2dui+τRR2+|u|2R2+|u|2dτ=i=1nR44R2uidui(R2|u|2),{\displaystyle d\tau =\sum _{i=1}^{n}{\frac {\partial }{\partial u^{i}}}R{\frac {R^{2}+|u|^{2}}{R^{2}+|u|^{2}}}du^{i}+{\frac {\partial }{\partial \tau }}R{\frac {R^{2}+|u|^{2}}{R^{2}+|u|^{2}}}d\tau =\sum _{i=1}^{n}R^{4}{\frac {4R^{2}u^{i}du^{i}}{\left(R^{2}-|u|^{2}\right)}},}yieldingdτ2=(R4R4(udu)(R2|u|2)2)2=R216R4(udu)2(R2|u|2)4.{\displaystyle -d\tau ^{2}=-\left(R{\frac {4R^{4}\left(\mathbf {u} \cdot d\mathbf {u} \right)}{\left(R^{2}-|u|^{2}\right)^{2}}}\right)^{2}=-R^{2}{\frac {16R^{4}(\mathbf {u} \cdot d\mathbf {u} )^{2}}{\left(R^{2}-|u|^{2}\right)^{4}}}.}

Now add this contribution to finally get(σ1)hR1(n)=4R2[(du1)2++(dun)2](R2|u|2)2hR2(n).{\displaystyle \left(\sigma ^{-1}\right)^{*}h_{R}^{1(n)}={\frac {4R^{2}\left[\left(du^{1}\right)^{2}+\cdots +\left(du^{n}\right)^{2}\right]}{\left(R^{2}-|u|^{2}\right)^{2}}}\equiv h_{R}^{2(n)}.}

This last equation shows that the metric on the ball is identical to the Riemannian metrich2(n)
R
in thePoincaré ball model, another standard model of hyperbolic geometry.

Alternative calculation using the pushforward

The pullback can be computed in a different fashion. By definition,(σ1)hR1(n)(V,V)=hR1(n)((σ1)V,(σ1)V)=η|HR1(n)((σ1)V,(σ1)V).{\displaystyle \left(\sigma ^{-1}\right)^{*}h_{R}^{1(n)}(V,\,V)=h_{R}^{1(n)}\left(\left(\sigma ^{-1}\right)_{*}V,\,\left(\sigma ^{-1}\right)_{*}V\right)=\eta |_{\mathbf {H} _{R}^{1(n)}}\left(\left(\sigma ^{-1}\right)_{*}V,\,\left(\sigma ^{-1}\right)_{*}V\right).}

In coordinates,(σ1)V=(σ1)Viui=Vixjuixj+Viτuiτ=Vixjuixj+Viτuiτ=Vxjxj+Vττ.{\displaystyle \left(\sigma ^{-1}\right)_{*}V=\left(\sigma ^{-1}\right)_{*}V^{i}{\frac {\partial }{\partial u^{i}}}=V^{i}{\frac {\partial x^{j}}{\partial u^{i}}}{\frac {\partial }{\partial x^{j}}}+V^{i}{\frac {\partial \tau }{\partial u^{i}}}{\frac {\partial }{\partial \tau }}=V^{i}{\frac {\partial }{x}}^{j}{\partial u^{i}}{\frac {\partial }{\partial x^{j}}}+V^{i}{\frac {\partial }{\tau }}{\partial u^{i}}{\frac {\partial }{\partial \tau }}=Vx^{j}{\frac {\partial }{\partial x^{j}}}+V\tau {\frac {\partial }{\partial \tau }}.}

One has from the formula forσ–1Vxj=Viui(2R2ujR2|u|2)=2R2VjR2|u|24R2ujV,u(R2|u|2)2,(here V|u|2=2k=1nVkuk2V,u)Vτ=V(RR2+|u|2R2|u|2)=4R3V,u(R2|u|2)2.{\displaystyle {\begin{aligned}Vx^{j}&=V^{i}{\frac {\partial }{\partial u^{i}}}\left({\frac {2R^{2}u^{j}}{R^{2}-|u|^{2}}}\right)={\frac {2R^{2}V^{j}}{R^{2}-|u|^{2}}}-{\frac {4R^{2}u^{j}\langle \mathbf {V} ,\,\mathbf {u} \rangle }{\left(R^{2}-|u|^{2}\right)^{2}}},\quad \left({\text{here }}V|u|^{2}=2\sum _{k=1}^{n}V^{k}u^{k}\equiv 2\langle \mathbf {V} ,\,\mathbf {u} \rangle \right)\\V\tau &=V\left(R{\frac {R^{2}+|u|^{2}}{R^{2}-|u|^{2}}}\right)={\frac {4R^{3}\langle \mathbf {V} ,\,\mathbf {u} \rangle }{\left(R^{2}-|u|^{2}\right)^{2}}}.\end{aligned}}}

Lastly,η(σ1V,σ1V)=j=1n(Vxj)2(Vτ)2=4R4|V|2(R2|u|2)2=hR2(n)(V,z,V),{\displaystyle \eta \left(\sigma _{*}^{-1}V,\,\sigma _{*}^{-1}V\right)=\sum _{j=1}^{n}\left(Vx^{j}\right)^{2}-(V\tau )^{2}={\frac {4R^{4}|V|^{2}}{\left(R^{2}-|u|^{2}\right)^{2}}}=h_{R}^{2(n)}(V,z,V),}and the same conclusion is reached.

See also

[edit]

Remarks

[edit]
  1. ^This makes spacetime distance aninvariant.
  2. ^Translate the coordinate system so that the event is the new origin.
  3. ^This corresponds to the time coordinate either increasing or decreasing when the proper time for any particle increases. An application ofT flips this direction.
  4. ^For comparison and motivation of terminology, take aRiemannian metric, which provides a positive definite symmetric bilinear form, i. e. aninner product proper at each point on a manifold.
  5. ^This similarity betweenflat space and curved space at infinitesimally small distance scales is foundational to the definition of amanifold in general.
  6. ^Thereis an isometric embedding intoRn{\displaystyle \mathbb {R} ^{n}} according to theNash embedding theorem (Nash (1956)), but the embedding dimension is much higher,n = (m/2)(m + 1)(3m + 11) for a Riemannian manifold of dimensionm.

Notes

[edit]
  1. ^"Minkowski"Archived 2019-06-22 at theWayback Machine.Random House Webster's Unabridged Dictionary.
  2. ^Lee 1997, p. 31
  3. ^Poincaré 1905–1906, pp. 129–176 Wikisource translation:On the Dynamics of the Electron
  4. ^Minkowski 1907–1908, pp. 53–111 *Wikisource translation:s:Translation:The Fundamental Equations for Electromagnetic Processes in Moving Bodies
  5. ^abMinkowski 1908–1909, pp. 75–88 Various English translations on Wikisource: "Space and Time"
  6. ^Cornelius Lanczos (1972) "Einstein's Path from Special to General Relativity", pages 5–19 ofGeneral Relativity: Papers in Honour of J. L. Synge, L. O'Raifeartaigh editor,Clarendon Press, see page 11
  7. ^See Schutz's proof p 148, also Naber p. 48
  8. ^Schutz p. 148, Naber p. 49
  9. ^Schutz p. 148
  10. ^Lee 1997, p. 15
  11. ^Lee 2003, See Lee's discussion on geometric tangent vectors early in chapter 3.
  12. ^Giulini 2008 pp. 5, 6
  13. ^Sean M. Carroll (2019).Spacetime and Geometry (illustrated, herdruk ed.). Cambridge University Press. p. 7.ISBN 978-1-108-48839-6.
  14. ^Sard 1970, p. 71
  15. ^Minkowski,Landau & Lifshitz 2002, p. 4
  16. ^abMisner, Thorne & Wheeler 1973
  17. ^Lee 2003. One point in Lee's proof of the existence of this map needs modification (Lee deals withRiemannian metrics.). Where Lee refers to positive definiteness to show the injectivity of the map, one needs instead appeal to non-degeneracy.
  18. ^Lee 2003, The tangent-cotangent isomorphism p. 282
  19. ^Lee 2003
  20. ^Y. Friedman, A Physically Meaningful Relativistic Description of the Spin State of an Electron, Symmetry 2021, 13(10), 1853;https://doi.org/10.3390/sym13101853Archived 2023-08-13 at theWayback Machine
  21. ^Jackson, J.D., Classical Electrodynamics, 3rd ed.; John Wiley \& Sons: Hoboken, NJ, US, 1998
  22. ^Lee 1997, p. 66
  23. ^Lee 1997, p. 33
  24. ^Lee 1997

References

[edit]

External links

[edit]

Media related toMinkowski diagrams at Wikimedia Commons

Special
relativity
Background
Fundamental
concepts
Formulation
Phenomena
Spacetime
General
relativity
Background
Fundamental
concepts
Formulation
Phenomena
Advanced
theories
Solutions
Scientists
International
National
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Minkowski_space&oldid=1317870941"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp