Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Lorentz transformation

From Wikipedia, the free encyclopedia
Family of linear transformations
Hendrik Lorentz in 1916.
Part of a series on
Spacetime

Inphysics, theLorentz transformations are a six-parameter family oflineartransformations from acoordinate frame inspacetime to another frame that moves at a constantvelocity relative to the former. The respective inverse transformation is then parameterized by the negative of this velocity. The transformations are named after the DutchphysicistHendrik Lorentz.

The most common form of the transformation, parametrized by the real constantv,{\displaystyle v,} representing a velocity confined to thex-direction, is expressed as[1][2]t=γ(tvxc2)x=γ(xvt)y=yz=z{\displaystyle {\begin{aligned}t'&=\gamma \left(t-{\frac {vx}{c^{2}}}\right)\\x'&=\gamma \left(x-vt\right)\\y'&=y\\z'&=z\end{aligned}}}where(t,x,y,z) and(t′,x′,y′,z′) are the coordinates of an event in two frames with the spatial origins coinciding att =t′ = 0, where the primed frame is seen from the unprimed frame as moving with speedv along thex-axis, wherec is thespeed of light, andγ=11v2/c2{\displaystyle \gamma ={\frac {1}{\sqrt {1-v^{2}/c^{2}}}}}is theLorentz factor. When speedv is much smaller thanc, the Lorentz factor is negligibly different from 1, but asv approachesc,γ{\displaystyle \gamma } grows without bound. The value ofv must be smaller thanc for the transformation to make sense.

Expressing the speed as a fraction of the speed of light,β=v/c,{\textstyle \beta =v/c,} an equivalent form of the transformation is[3]ct=γ(ctβx)x=γ(xβct)y=yz=z.{\displaystyle {\begin{aligned}ct'&=\gamma \left(ct-\beta x\right)\\x'&=\gamma \left(x-\beta ct\right)\\y'&=y\\z'&=z.\end{aligned}}}

Frames of reference can be divided into two groups:inertial (relative motion with constant velocity) andnon-inertial (accelerating, moving in curved paths, rotational motion with constantangular velocity, etc.). The term "Lorentz transformations" only refers to transformations betweeninertial frames, usually in the context of special relativity.

In eachreference frame, an observer can use a local coordinate system (usuallyCartesian coordinates in this context) to measure lengths, and a clock to measure time intervals. Anevent is something that happens at a point in space at an instant of time, or more formally a point inspacetime. The transformations connect the space and time coordinates of anevent as measured by an observer in each frame.[nb 1]

They supersede theGalilean transformation ofNewtonian physics, which assumes anabsolute space and time (seeGalilean relativity). The Galilean transformation is a good approximation only at relative speeds much less than the speed of light. Lorentz transformations have a number of unintuitive features that do not appear in Galilean transformations. For example, they reflect the fact that observers moving at differentvelocities may measure differentdistances,elapsed times, and even differentorderings of events, but always such that thespeed of light is the same in all inertial reference frames. The invariance of light speed is one of thepostulates of special relativity.

Historically, the transformations were the result of attempts by Lorentz and others to explain how the speed oflight was observed to be independent of thereference frame, and to understand the symmetries of the laws ofelectromagnetism. The transformations later became a cornerstone forspecial relativity.

The Lorentz transformation is alinear transformation. It may include a rotation of space; a rotation-free Lorentz transformation is called aLorentz boost. InMinkowski space—the mathematical model of spacetime in special relativity—the Lorentz transformations preserve thespacetime interval between any two events. They describe only the transformations in which the spacetime event at the origin is left fixed. They can be considered as ahyperbolic rotation of Minkowski space. The more general set of transformations that also includes translations is known as thePoincaré group.

History

[edit]
Main article:History of Lorentz transformations

Many physicists—includingWoldemar Voigt,George FitzGerald,Joseph Larmor, andHendrik Lorentz[4] himself—had been discussing the physics implied by these equations since 1887.[5] Early in 1889,Oliver Heaviside had shown fromMaxwell's equations that theelectric field surrounding a spherical distribution of charge should cease to havespherical symmetry once the charge is in motion relative to theluminiferous aether. FitzGerald then conjectured that Heaviside's distortion result might be applied to a theory of intermolecular forces. Some months later, FitzGerald published the conjecture that bodies in motion are being contracted, in order to explain the baffling outcome of the1887 aether-wind experiment of Michelson and Morley. In 1892, Lorentz independently presented the same idea in a more detailed manner, which was subsequently calledFitzGerald–Lorentz contraction hypothesis.[6] Their explanation was widely known before 1905.[7]

Lorentz (1892–1904) and Larmor (1897–1900), who believed the luminiferous aether hypothesis, also looked for the transformation under whichMaxwell's equations are invariant when transformed from the aether to a moving frame. They extended theFitzGerald–Lorentz contraction hypothesis and found out that the time coordinate has to be modified as well ("local time").Henri Poincaré gave a physical interpretation to local time (to first order inv/c, the relative velocity of the two reference frames normalized to the speed of light) as the consequence of clock synchronization, under the assumption that the speed of light is constant in moving frames.[8] Larmor is credited to have been the first to understand the crucialtime dilation property inherent in his equations.[9]

In 1905, Poincaré was the first to recognize that the transformation has the properties of amathematical group,and he named it after Lorentz.[10]Later in the same yearAlbert Einstein published what is now calledspecial relativity, by deriving the Lorentz transformation under the assumptions of theprinciple of relativity and the constancy of the speed of light in anyinertial reference frame, and by abandoning the mechanistic aether as unnecessary.[11]

Derivation of the group of Lorentz transformations

[edit]
Main articles:Derivations of the Lorentz transformations andLorentz group

Anevent is something that happens at a certain point in spacetime, or more generally, the point in spacetime itself. In any inertial frame an event is specified by a time coordinatect and a set ofCartesian coordinatesx,y,z to specify position in space in that frame. Subscripts label individual events.

From Einstein'ssecond postulate of relativity (invariance ofc) it follows that:

c2(t2t1)2(x2x1)2(y2y1)2(z2z1)2=0(lightlike separated events 1, 2){\displaystyle c^{2}(t_{2}-t_{1})^{2}-(x_{2}-x_{1})^{2}-(y_{2}-y_{1})^{2}-(z_{2}-z_{1})^{2}=0\quad {\text{(lightlike separated events 1, 2)}}}D1

in all inertial frames for events connected bylight signals. The quantity on the left is called thespacetime interval between eventsa1 = (t1,x1,y1,z1) anda2 = (t2,x2,y2,z2). The interval betweenany two events, not necessarily separated by light signals, is in fact invariant, i.e., independent of the state of relative motion of observers in different inertial frames, as isshown using homogeneity and isotropy of space. The transformation sought after thus must possess the property that:

c2(t2t1)2(x2x1)2(y2y1)2(z2z1)2=c2(t2t1)2(x2x1)2(y2y1)2(z2z1)2(all events 1, 2).{\displaystyle {\begin{aligned}c^{2}(t_{2}-t_{1})^{2}-(x_{2}-x_{1})^{2}-(y_{2}-y_{1})^{2}-(z_{2}-z_{1})^{2}&=&\\[6pt]c^{2}(t_{2}'-t_{1}')^{2}-(x_{2}'-x_{1}')^{2}-(y_{2}'-y_{1}')^{2}-(z_{2}'-z_{1}')^{2}&&\quad {\text{(all events 1, 2)}}.\end{aligned}}}D2

where(t,x,y,z) are the spacetime coordinates used to define events in one frame, and(t′,x′,y′,z′) are the coordinates in another frame. First one observes that (D2) is satisfied if an arbitrary4-tupleb of numbers are added to eventsa1 anda2. Such transformations are calledspacetime translations and are not dealt with further here. Then one observes that alinear solution preserving the origin of the simpler problem solves the general problem too:

c2t2x2y2z2=c2t2x2y2z2orc2t1t2x1x2y1y2z1z2=c2t1t2x1x2y1y2z1z2{\displaystyle {\begin{aligned}c^{2}t^{2}-x^{2}-y^{2}-z^{2}&=c^{2}t'^{2}-x'^{2}-y'^{2}-z'^{2}&\quad {\text{or}}\\[6pt]c^{2}t_{1}t_{2}-x_{1}x_{2}-y_{1}y_{2}-z_{1}z_{2}&=c^{2}t'_{1}t'_{2}-x'_{1}x'_{2}-y'_{1}y'_{2}-z'_{1}z'_{2}&\end{aligned}}}D3

(a solution satisfying the first formula automatically satisfies the second one as well; seePolarization identity). Finding the solution to the simpler problem is just a matter of look-up in the theory ofclassical groups that preservebilinear forms of various signature.[nb 2] First equation in (D3) can be written more compactly as:

(a,a)=(a,a)oraa=aa,{\displaystyle (a,a)=(a',a')\quad {\text{or}}\quad a\cdot a=a'\cdot a',}D4

where(·, ·) refers to the bilinear form ofsignature(1, 3) onR4 exposed by the right hand side formula in (D3). The alternative notation defined on the right is referred to as therelativistic dot product. Spacetime mathematically viewed asR4 endowed with this bilinear form is known asMinkowski spaceM. The Lorentz transformation is thus an element of the groupO(1, 3), theLorentz group or, for those that prefer the othermetric signature,O(3, 1) (also called the Lorentz group).[nb 3] One has:

(a,a)=(Λa,Λa)=(a,a),ΛO(1,3),a,aM,{\displaystyle (a,a)=(\Lambda a,\Lambda a)=(a',a'),\quad \Lambda \in \mathrm {O} (1,3),\quad a,a'\in M,}D5

which is precisely preservation of the bilinear form (D3) which implies (by linearity ofΛ and bilinearity of the form) that (D2) is satisfied. The elements of the Lorentz group arerotations andboosts and mixes thereof. If the spacetime translations are included, then one obtains theinhomogeneous Lorentz group or thePoincaré group.

Generalities

[edit]

The relations between the primed and unprimed spacetime coordinates are theLorentz transformations, each coordinate in one frame is alinear function of all the coordinates in the other frame, and theinverse functions are the inverse transformation. Depending on how the frames move relative to each other, and how they are oriented in space relative to each other, other parameters that describe direction, speed, and orientation enter the transformation equations.

Transformations describing relative motion with constant (uniform) velocity and without rotation of the space coordinate axes are calledLorentz boosts or simplyboosts, and the relative velocity between the frames is the parameter of the transformation. The other basic type of Lorentz transformation is only a rotation in the spatial coordinates. Unlike boosts, these are inertial transformations since there is no relative motion, the frames are simply tilted (and not continuously rotating), and in this case quantities defining the rotation are the parameters of the transformation (e.g.,axis–angle representation, orEuler angles, etc.). A combination of a rotation and a boost is ahomogeneous transformation, which transforms the origin back to the origin.

The full Lorentz groupO(3, 1) also contains special transformations that are neither rotations nor boosts, but ratherreflections in a plane through the origin. Two of these can be singled out;spatial inversion in which the spatial coordinates of all events are reversed in sign andtemporal inversion in which the time coordinate for each event gets its sign reversed.

Boosts should not be conflated with mere displacements in spacetime; in this case, the coordinate systems are simply shifted and there is no relative motion. However, these also count as symmetries forced by special relativity since they leave the spacetime interval invariant. A combination of a rotation with a boost, followed by a shift in spacetime, is aninhomogeneous Lorentz transformation, an element of the Poincaré group, which is also called the inhomogeneous Lorentz group.

Physical formulation of Lorentz boosts

[edit]
Further information:Derivations of the Lorentz transformations

Coordinate transformation

[edit]

The spacetime coordinates of an event, as measured by each observer in their inertial reference frame (in standard configuration) are shown in the speech bubbles.
Top: frameF moves at velocityv along thex-axis of frameF.
Bottom: frameF moves at velocity −v along thex-axis of frameF.[12]

A "stationary" observer in frameF defines events with coordinatest,x,y,z. Another frameF moves with velocityv relative toF, and an observer in this "moving" frameF defines events using the coordinatest,x,y,z.

The coordinate axes in each frame are parallel (thex andx axes are parallel, they andy axes are parallel, and thez andz axes are parallel), remain mutually perpendicular, and relative motion is along the coincidentxx axes. Att =t′ = 0, the origins of both coordinate systems are the same,(x,y,z) = (x′,y′,z′) = (0, 0, 0). In other words, the times and positions are coincident at this event. If all these hold, then the coordinate systems are said to be instandard configuration, orsynchronized.

If an observer inF records an eventt,x,y,z, then an observer inF records thesame event with coordinates[13]

Lorentz boost (x direction)

t=γ(tvxc2)x=γ(xvt)y=yz=z{\displaystyle {\begin{aligned}t'&=\gamma \left(t-{\frac {vx}{c^{2}}}\right)\\x'&=\gamma \left(x-vt\right)\\y'&=y\\z'&=z\end{aligned}}}

wherev is the relative velocity between frames in thex-direction,c is thespeed of light, andγ=11v2c2{\displaystyle \gamma ={\frac {1}{\sqrt {1-{\frac {v^{2}}{c^{2}}}}}}}(lowercasegamma) is theLorentz factor.

Here,v is theparameter of the transformation, for a given boost it is a constant number, but can take a continuous range of values. In the setup used here, positive relative velocityv > 0 is motion along the positive directions of thexx axes, zero relative velocityv = 0 is no relative motion, while negative relative velocityv < 0 is relative motion along the negative directions of thexx axes. The magnitude of relative velocityv cannot equal or exceedc, so only subluminal speedsc <v <c are allowed. The corresponding range ofγ is1 ≤γ < ∞.

The transformations are not defined ifv is outside these limits. At the speed of light (v =c)γ is infinite, andfaster than light (v >c)γ is acomplex number, each of which make the transformations unphysical. The space and time coordinates are measurable quantities and numerically must be real numbers.

As anactive transformation, an observer inF notices the coordinates of the event to be "boosted" in the negative directions of thexx axes, because of thev in the transformations. This has the equivalent effect of thecoordinate systemF boosted in the positive directions of thexx axes, while the event does not change and is simply represented in another coordinate system, apassive transformation.

The inverse relations (t,x,y,z in terms oft,x,y,z) can be found by algebraically solving the original set of equations. A more efficient way is to use physical principles. HereF is the "stationary" frame whileF is the "moving" frame. According to the principle of relativity, there is no privileged frame of reference, so the transformations fromF toF must take exactly the same form as the transformations fromF toF. The only difference isF moves with velocityv relative toF (i.e., the relative velocity has the same magnitude but is oppositely directed). Thus if an observer inF notes an eventt,x,y,z, then an observer inF notes thesame event with coordinates

Inverse Lorentz boost (x direction)

t=γ(t+vxc2)x=γ(x+vt)y=yz=z,{\displaystyle {\begin{aligned}t&=\gamma \left(t'+{\frac {vx'}{c^{2}}}\right)\\x&=\gamma \left(x'+vt'\right)\\y&=y'\\z&=z',\end{aligned}}}

and the value ofγ remains unchanged. This "trick" of simply reversing the direction of relative velocity while preserving its magnitude, and exchanging primed and unprimed variables, always applies to finding the inverse transformation of every boost in any direction.[14][15]

Sometimes it is more convenient to useβ =v/c (lowercasebeta) instead ofv, so thatct=γ(ctβx),x=γ(xβct),{\displaystyle {\begin{aligned}ct'&=\gamma \left(ct-\beta x\right)\,,\\x'&=\gamma \left(x-\beta ct\right)\,,\\\end{aligned}}}which shows much more clearly the symmetry in the transformation. From the allowed ranges ofv and the definition ofβ, it follows−1 <β < 1. The use ofβ andγ is standard throughout the literature. In the case of three spatial dimensions[ct,x,y,z], where the boostβ{\displaystyle \beta } is in thex direction, theeigenstates of the transformation are[1, 1, 0, 0] with eigenvalue(1β)/(1+β){\displaystyle {\sqrt {(1-\beta )/(1+\beta )}}},[1, −1, 0, 0] with eigenvalue(1+β)/(1β){\displaystyle {\sqrt {(1+\beta )/(1-\beta )}}}, and[0, 0, 1, 0] and[0, 0, 0, 1], the latter two with eigenvalue 1. 

When the boost velocityv{\displaystyle {\boldsymbol {v}}} is in an arbitrary vector direction with the boost vectorβ=v/c{\displaystyle {\boldsymbol {\beta }}={\boldsymbol {v}}/c}, then the transformation from an unprimed spacetime coordinate system to a primed coordinate system is given by[16][17]

[ctγβxx1+γ21+γβx2yγ21+γβxβyzγ21+γβyβz]=[γγβxγβyγβzγβx1+γ21+γβx2γ21+γβxβyγ21+γβxβzγβyγ21+γβxβy1+γ21+γβy2γ21+γβyβzγβzγ21+γβxβzγ21+γβyβz1+γ21+γβz2][ctγβxx1+γ21+γβx2yγ21+γβxβyzγ21+γβyβz],{\displaystyle {\begin{bmatrix}ct'{\vphantom {-\gamma \beta _{\text{x}}}}\\x'{\vphantom {1+{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}^{2}}}\\y'{\vphantom {{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}\beta _{\text{y}}}}\\z'{\vphantom {{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{y}}\beta _{\text{z}}}}\end{bmatrix}}={\begin{bmatrix}\gamma &-\gamma \beta _{\text{x}}&-\gamma \beta _{\text{y}}&-\gamma \beta _{\text{z}}\\-\gamma \beta _{\text{x}}&1+{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}^{2}&{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}\beta _{\text{y}}&{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}\beta _{\text{z}}\\-\gamma \beta _{\text{y}}&{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}\beta _{\text{y}}&1+{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{y}}^{2}&{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{y}}\beta _{\text{z}}\\-\gamma \beta _{\text{z}}&{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}\beta _{\text{z}}&{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{y}}\beta _{\text{z}}&1+{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{z}}^{2}\\\end{bmatrix}}{\begin{bmatrix}ct{\vphantom {-\gamma \beta _{\text{x}}}}\\x{\vphantom {1+{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}^{2}}}\\y{\vphantom {{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{x}}\beta _{\text{y}}}}\\z{\vphantom {{\frac {\gamma ^{2}}{1+\gamma }}\beta _{\text{y}}\beta _{\text{z}}}}\end{bmatrix}},}

where theLorentz factor isγ=1/1β2{\displaystyle \gamma =1/{\sqrt {1-{\boldsymbol {\beta }}^{2}}}}. Thedeterminant of the transformation matrix is +1 and itstrace is2(1+γ){\displaystyle 2(1+\gamma )}. The inverse of the transformation is given by reversing the sign ofβ{\displaystyle {\boldsymbol {\beta }}}. The quantityc2t2x2y2z2{\displaystyle c^{2}t^{2}-x^{2}-y^{2}-z^{2}} is invariant under the transformation: namely(ct2x2y2z2)=(ct2x2y2z2){\displaystyle (ct'^{2}-x'^{2}-y'^{2}-z'^{2})=(ct^{2}-x^{2}-y^{2}-z^{2})}.

The Lorentz transformations can also be derived in a way that resembles circular rotations in 3-dimensional space using thehyperbolic functions. For the boost in thex direction, the results are

Lorentz boost (x direction with rapidityζ)

ct=ctcoshζxsinhζx=xcoshζctsinhζy=yz=z{\displaystyle {\begin{aligned}ct'&=ct\cosh \zeta -x\sinh \zeta \\x'&=x\cosh \zeta -ct\sinh \zeta \\y'&=y\\z'&=z\end{aligned}}}

whereζ (lowercasezeta) is a parameter calledrapidity (many other symbols are used, includingθ,ϕ,φ,η,ψ,ξ). Given the strong resemblance to rotations of spatial coordinates in 3-dimensional space in the Cartesianxy,yz, andzx planes, a Lorentz boost can be thought of as ahyperbolic rotation of spacetime coordinates in the xt, yt, and zt Cartesian-time planes of 4-dimensionalMinkowski space. The parameterζ is thehyperbolic angle of rotation, analogous to the ordinary angle for circular rotations. This transformation can be illustrated with aMinkowski diagram.

The hyperbolic functions arise from thedifference between the squares of the time and spatial coordinates in the spacetime interval, rather than a sum. The geometric significance of the hyperbolic functions can be visualized by takingx = 0 orct = 0 in the transformations. Squaring and subtracting the results, one can derive hyperbolic curves of constant coordinate values but varyingζ, which parametrizes the curves according to the identitycosh2ζsinh2ζ=1.{\displaystyle \cosh ^{2}\zeta -\sinh ^{2}\zeta =1\,.}

Conversely thect andx axes can be constructed for varying coordinates but constantζ. The definitiontanhζ=sinhζcoshζ,{\displaystyle \tanh \zeta ={\frac {\sinh \zeta }{\cosh \zeta }}\,,}provides the link between a constant value of rapidity, and theslope of thect axis in spacetime. A consequence these two hyperbolic formulae is an identity that matches the Lorentz factorcoshζ=11tanh2ζ.{\displaystyle \cosh \zeta ={\frac {1}{\sqrt {1-\tanh ^{2}\zeta }}}\,.}

Comparing the Lorentz transformations in terms of the relative velocity and rapidity, or using the above formulae, the connections betweenβ,γ, andζ areβ=tanhζ,γ=coshζ,βγ=sinhζ.{\displaystyle {\begin{aligned}\beta &=\tanh \zeta \,,\\\gamma &=\cosh \zeta \,,\\\beta \gamma &=\sinh \zeta \,.\end{aligned}}}

Taking the inverse hyperbolic tangent gives the rapidityζ=tanh1β.{\displaystyle \zeta =\tanh ^{-1}\beta \,.}

Since−1 <β < 1, it follows−∞ <ζ < ∞. From the relation betweenζ andβ, positive rapidityζ > 0 is motion along the positive directions of thexx axes, zero rapidityζ = 0 is no relative motion, while negative rapidityζ < 0 is relative motion along the negative directions of thexx axes.

The inverse transformations are obtained by exchanging primed and unprimed quantities to switch the coordinate frames, and negating rapidityζ → −ζ since this is equivalent to negating the relative velocity. Therefore,

Inverse Lorentz boost (x direction with rapidityζ)

ct=ctcoshζ+xsinhζx=xcoshζ+ctsinhζy=yz=z{\displaystyle {\begin{aligned}ct&=ct'\cosh \zeta +x'\sinh \zeta \\x&=x'\cosh \zeta +ct'\sinh \zeta \\y&=y'\\z&=z'\end{aligned}}}

The inverse transformations can be similarly visualized by considering the cases whenx′ = 0 andct′ = 0.

So far the Lorentz transformations have been applied toone event. If there are two events, there is a spatial separation and time interval between them. It follows from thelinearity of the Lorentz transformations that two values of space and time coordinates can be chosen, the Lorentz transformations can be applied to each, then subtracted to get the Lorentz transformations of the differences:Δt=γ(ΔtvΔxc2),Δx=γ(ΔxvΔt),{\displaystyle {\begin{aligned}\Delta t'&=\gamma \left(\Delta t-{\frac {v\,\Delta x}{c^{2}}}\right)\,,\\\Delta x'&=\gamma \left(\Delta x-v\,\Delta t\right)\,,\end{aligned}}}with inverse relationsΔt=γ(Δt+vΔxc2),Δx=γ(Δx+vΔt).{\displaystyle {\begin{aligned}\Delta t&=\gamma \left(\Delta t'+{\frac {v\,\Delta x'}{c^{2}}}\right)\,,\\\Delta x&=\gamma \left(\Delta x'+v\,\Delta t'\right)\,.\end{aligned}}}whereΔ (uppercasedelta) indicates a difference of quantities; e.g.,Δx =x2x1 for two values ofx coordinates, and so on.

These transformations ondifferences rather than spatial points or instants of time are useful for a number of reasons:

  • in calculations and experiments, it is lengths between two points or time intervals that are measured or of interest (e.g., the length of a moving vehicle, or time duration it takes to travel from one place to another),
  • the transformations of velocity can be readily derived by making the difference infinitesimally small and dividing the equations, and the process repeated for the transformation of acceleration,
  • if the coordinate systems are never coincident (i.e., not in standard configuration), and if both observers can agree on an eventt0,x0,y0,z0 inF andt0,x0,y0,z0 inF, then they can use that event as the origin, and the spacetime coordinate differences are the differences between their coordinates and this origin, e.g.,Δx =xx0,Δx′ =x′ −x0, etc.

Physical implications

[edit]

A critical requirement of the Lorentz transformations is the invariance of the speed of light, a fact used in their derivation, and contained in the transformations themselves. If inF the equation for a pulse of light along thex direction isx =ct, then inF′ the Lorentz transformations givex′ =ct, and vice versa, for anyc <v <c.

For relative speeds much less than the speed of light, the Lorentz transformations reduce to theGalilean transformation:[18][19]ttxxvt{\displaystyle {\begin{aligned}t'&\approx t\\x'&\approx x-vt\end{aligned}}}in accordance with thecorrespondence principle. It is sometimes said that nonrelativistic physics is a physics of "instantaneous action at a distance".[20]

Three counterintuitive, but correct, predictions of the transformations are:

Relativity of simultaneity
Suppose two events occur along the x axis simultaneously (Δt = 0) inF, but separated by a nonzero displacementΔx. Then inF′, we find thatΔt=γvΔx/c2{\displaystyle \Delta t'=-\gamma {v\,\Delta x}/{c^{2}}}, so the events are no longer simultaneous according to a moving observer.
Time dilation
Suppose there is a clock at rest inF. If a time interval is measured at the same point in that frame, so thatΔx = 0, then the transformations give this interval inF byΔt′ =γΔt. Conversely, suppose there is a clock at rest inF. If an interval is measured at the same point in that frame, so thatΔx′ = 0, then the transformations give this interval inF byΔt =γΔt. Either way, each observer measures the time interval between ticks of a moving clock to be longer by a factorγ than the time interval between ticks of his own clock.
Length contraction
Suppose there is a rod at rest inF aligned along thex axis, with lengthΔx. InF, the rod moves with velocityv, so its length must be measured by taking two simultaneous (Δt′ = 0) measurements at opposite ends. Under these conditions, the inverse Lorentz transform shows thatΔx =γΔx. InF the two measurements are no longer simultaneous, but this does not matter because the rod is at rest inF. So each observer measures the distance between the end points of a moving rod to be shorter by a factor1/γ than the end points of an identical rod at rest in his own frame. Length contraction affects any geometric quantity related to lengths, so from the perspective of a moving observer, areas and volumes will also appear to shrink along the direction of motion.

Vector transformations

[edit]
Further information:Euclidean vector andvector projection
An observer in frameF observesF to move with velocityv, whileF observesF to move with velocityv.The coordinate axes of each frame are still parallel[according to whom?] and orthogonal. The position vector as measured in each frame is split into components parallel and perpendicular to the relative velocity vectorv.
Left: Standard configuration.Right: Inverse configuration.

The use of vectors allows positions and velocities to be expressed in arbitrary directions compactly. A single boost in any direction depends on the full relativevelocity vectorv with a magnitude|v| =v that cannot equal or exceedc, so that0 ≤v <c.

Only time and the coordinates parallel to the direction of relative motion change, while those coordinates perpendicular do not. With this in mind, split the spatialposition vectorr as measured inF, andr as measured inF, each into components perpendicular () and parallel ( ||) tov,r=r+r,r=r+r,{\displaystyle \mathbf {r} =\mathbf {r} _{\perp }+\mathbf {r} _{\|}\,,\quad \mathbf {r} '=\mathbf {r} _{\perp }'+\mathbf {r} _{\|}'\,,}then the transformations aret=γ(trvc2)r=γ(rvt)r=r{\displaystyle {\begin{aligned}t'&=\gamma \left(t-{\frac {\mathbf {r} _{\parallel }\cdot \mathbf {v} }{c^{2}}}\right)\\\mathbf {r} _{\|}'&=\gamma (\mathbf {r} _{\|}-\mathbf {v} t)\\\mathbf {r} _{\perp }'&=\mathbf {r} _{\perp }\end{aligned}}}where· is thedot product. The Lorentz factorγ retains its definition for a boost in any direction, since it depends only on the magnitude of the relative velocity. The definitionβ =v/c with magnitude0 ≤β < 1 is also used by some authors.

Introducing aunit vectorn =v/v =β/β in the direction of relative motion, the relative velocity isv =vn with magnitudev and directionn, andvector projection and rejection give respectivelyr=(rn)n,r=r(rn)n{\displaystyle \mathbf {r} _{\parallel }=(\mathbf {r} \cdot \mathbf {n} )\mathbf {n} \,,\quad \mathbf {r} _{\perp }=\mathbf {r} -(\mathbf {r} \cdot \mathbf {n} )\mathbf {n} }

Accumulating the results gives the full transformations,

Lorentz boost (in directionn with magnitudev)

t=γ(tvnrc2),r=r+(γ1)(rn)nγtvn.{\displaystyle {\begin{aligned}t'&=\gamma \left(t-{\frac {v\mathbf {n} \cdot \mathbf {r} }{c^{2}}}\right)\,,\\\mathbf {r} '&=\mathbf {r} +(\gamma -1)(\mathbf {r} \cdot \mathbf {n} )\mathbf {n} -\gamma tv\mathbf {n} \,.\end{aligned}}}

The projection and rejection also applies tor. For the inverse transformations, exchanger andr to switch observed coordinates, and negate the relative velocityv → −v (or simply the unit vectorn → −n since the magnitudev is always positive) to obtain

Inverse Lorentz boost (in directionn with magnitudev)

t=γ(t+rvnc2),r=r+(γ1)(rn)n+γtvn,{\displaystyle {\begin{aligned}t&=\gamma \left(t'+{\frac {\mathbf {r} '\cdot v\mathbf {n} }{c^{2}}}\right)\,,\\\mathbf {r} &=\mathbf {r} '+(\gamma -1)(\mathbf {r} '\cdot \mathbf {n} )\mathbf {n} +\gamma t'v\mathbf {n} \,,\end{aligned}}}

The unit vector has the advantage of simplifying equations for a single boost, allows eitherv orβ to be reinstated when convenient, and the rapidity parametrization is immediately obtained by replacingβ andβγ. It is not convenient for multiple boosts.

The vectorial relation between relative velocity and rapidity is[21]β=βn=ntanhζ,{\displaystyle {\boldsymbol {\beta }}=\beta \mathbf {n} =\mathbf {n} \tanh \zeta \,,}and the "rapidity vector" can be defined asζ=ζn=ntanh1β,{\displaystyle {\boldsymbol {\zeta }}=\zeta \mathbf {n} =\mathbf {n} \tanh ^{-1}\beta \,,}each of which serves as a useful abbreviation in some contexts. The magnitude ofζ is the absolute value of the rapidity scalar confined to0 ≤ζ < ∞, which agrees with the range0 ≤β < 1.

Transformation of velocities

[edit]
Further information:differential of a function andvelocity addition formula
The transformation of velocities provides the definitionrelativistic velocity addition, the ordering of vectors is chosen to reflect the ordering of the addition of velocities; firstv (the velocity ofF relative toF) thenu (the velocity ofX relative toF) to obtainu =vu (the velocity ofX relative toF).

Defining the coordinate velocities and Lorentz factor byu=drdt,u=drdt,γv=11vvc2{\displaystyle \mathbf {u} ={\frac {d\mathbf {r} }{dt}}\,,\quad \mathbf {u} '={\frac {d\mathbf {r} '}{dt'}}\,,\quad \gamma _{\mathbf {v} }={\frac {1}{\sqrt {1-{\dfrac {\mathbf {v} \cdot \mathbf {v} }{c^{2}}}}}}}taking the differentials in the coordinates and time of the vector transformations, then dividing equations, leads tou=11vuc2[uγvv+1c2γvγv+1(uv)v]{\displaystyle \mathbf {u} '={\frac {1}{1-{\frac {\mathbf {v} \cdot \mathbf {u} }{c^{2}}}}}\left[{\frac {\mathbf {u} }{\gamma _{\mathbf {v} }}}-\mathbf {v} +{\frac {1}{c^{2}}}{\frac {\gamma _{\mathbf {v} }}{\gamma _{\mathbf {v} }+1}}\left(\mathbf {u} \cdot \mathbf {v} \right)\mathbf {v} \right]}

The velocitiesu andu are the velocity of some massive object. They can also be for a third inertial frame (sayF′′), in which case they must beconstant. Denote either entity byX. ThenX moves with velocityu relative toF, or equivalently with velocityu relative toF′, in turnF′ moves with velocityv relative toF. The inverse transformations can be obtained in a similar way, or as with position coordinates exchangeu andu, and changev tov.

The transformation of velocity is useful instellar aberration, theFizeau experiment, and therelativistic Doppler effect.

TheLorentz transformations of acceleration can be similarly obtained by taking differentials in the velocity vectors, and dividing these by the time differential.

Transformation of other quantities

[edit]

In general, given four quantitiesA andZ = (Zx,Zy,Zz) and their Lorentz-boosted counterpartsA′ andZ′ = (Zx,Zy,Zz), a relation of the formA2ZZ=A2ZZ{\displaystyle A^{2}-\mathbf {Z} \cdot \mathbf {Z} ={A'}^{2}-\mathbf {Z} '\cdot \mathbf {Z} '}implies the quantities transform under Lorentz transformations similar to the transformation of spacetime coordinates;A=γ(AvnZc),Z=Z+(γ1)(Zn)nγAvnc.{\displaystyle {\begin{aligned}A'&=\gamma \left(A-{\frac {v\mathbf {n} \cdot \mathbf {Z} }{c}}\right)\,,\\\mathbf {Z} '&=\mathbf {Z} +(\gamma -1)(\mathbf {Z} \cdot \mathbf {n} )\mathbf {n} -{\frac {\gamma Av\mathbf {n} }{c}}\,.\end{aligned}}}

The decomposition ofZ (andZ) into components perpendicular and parallel tov is exactly the same as for the position vector, as is the process of obtaining the inverse transformations (exchange(A,Z) and(A′,Z′) to switch observed quantities, and reverse the direction of relative motion by the substitutionn ↦ −n).

The quantities(A,Z) collectively make up afour-vector, whereA is the "timelike component", andZ the "spacelike component". Examples ofA andZ are the following:

Four-vectorAZ
positionfour-vectortime (multiplied byc),ctposition vector,r
four-momentumenergy (divided byc),E/cmomentum,p
Four-wave vectorangular frequency (divided byc),ω/cwave vector,k
four-spin(No name),stspin,s
four-currentcharge density (multiplied byc),ρccurrent density,j
electromagnetic four-potentialelectric potential (divided byc),φ/cmagnetic vector potential,A

For a given object (e.g., particle, fluid, field, material), ifA orZ correspond to properties specific to the object like itscharge density,mass density,spin, etc., its properties can be fixed in the rest frame of that object. Then the Lorentz transformations give the corresponding properties in a frame moving relative to the object with constant velocity. This breaks some notions taken for granted in non-relativistic physics. For example, the energyE of an object is a scalar in non-relativistic mechanics, but not in relativistic mechanics because energy changes under Lorentz transformations; its value is different for various inertial frames. In the rest frame of an object, it has arest energy and zero momentum. In a boosted frame its energy is different and it appears to have a momentum. Similarly, in non-relativistic quantum mechanics the spin of a particle is a constant vector, but inrelativistic quantum mechanics spins depends on relative motion. In the rest frame of the particle, the spin pseudovector can be fixed to be its ordinary non-relativistic spin with a zero timelike quantityst, however a boosted observer will perceive a nonzero timelike component and an altered spin.[22]

Not all quantities are invariant in the form as shown above, for example orbitalangular momentumL does not have a timelike quantity, and neither does theelectric fieldE nor themagnetic fieldB. The definition of angular momentum isL =r ×p, and in a boosted frame the altered angular momentum isL′ =r′ ×p. Applying this definition using the transformations of coordinates and momentum leads to the transformation of angular momentum. It turns outL transforms with another vector quantityN = (E/c2)rtp related to boosts, seeRelativistic angular momentum for details. For the case of theE andB fields, the transformations cannot be obtained as directly using vector algebra. TheLorentz force is the definition of these fields, and inF it isF =q(E +v ×B) while inF′ it isF′ =q(E′ +v′ ×B′). A method of deriving the EM field transformations in an efficient way which also illustrates the unit of the electromagnetic field uses tensor algebra,given below.

Mathematical formulation

[edit]
Main article:Lorentz group
Further information:Matrix (mathematics),matrix product,linear algebra, androtation formalisms in three dimensions

Throughout, italic non-bold capital letters are4 × 4 matrices, while non-italic bold letters are3 × 3 matrices.

Homogeneous Lorentz group

[edit]

Writing the coordinates in column vectors and theMinkowski metricη as a square matrixX=[ctxyz],η=[1000010000100001],X=[ctxyz]{\displaystyle X'={\begin{bmatrix}c\,t'\\x'\\y'\\z'\end{bmatrix}}\,,\quad \eta ={\begin{bmatrix}-1&0&0&0\\0&1&0&0\\0&0&1&0\\0&0&0&1\end{bmatrix}}\,,\quad X={\begin{bmatrix}c\,t\\x\\y\\z\end{bmatrix}}}the spacetime interval takes the form (superscriptT denotestranspose)XX=XTηX=XTηX{\displaystyle X\cdot X=X^{\mathrm {T} }\eta X={X'}^{\mathrm {T} }\eta {X'}}and isinvariant under a Lorentz transformationX=ΛX{\displaystyle X'=\Lambda X}whereΛ is a square matrix which can depend on parameters.

Theset of all Lorentz transformationsΛ{\displaystyle \Lambda } in this article is denotedL{\displaystyle {\mathcal {L}}}. This set together with matrix multiplication forms agroup, in this context known as theLorentz group. Also, the above expressionX·X is aquadratic form of signature (3,1) on spacetime, and the group of transformations which leaves this quadratic form invariant is theindefinite orthogonal group O(3,1), aLie group. In other words, the Lorentz group is O(3,1). As presented in this article, any Lie groups mentioned arematrix Lie groups. In this context the operation of composition amounts tomatrix multiplication.

From the invariance of the spacetime interval it followsη=ΛTηΛ{\displaystyle \eta =\Lambda ^{\mathrm {T} }\eta \Lambda }and this matrix equation contains the general conditions on the Lorentz transformation to ensure invariance of the spacetime interval. Taking thedeterminant of the equation using the product rule[nb 4] gives immediately[det(Λ)]2=1det(Λ)=±1{\displaystyle \left[\det(\Lambda )\right]^{2}=1\quad \Rightarrow \quad \det(\Lambda )=\pm 1}

Writing the Minkowski metric as a block matrix, and the Lorentz transformation in the most general form,η=[100I],Λ=[ΓaTbM],{\displaystyle \eta ={\begin{bmatrix}-1&0\\0&\mathbf {I} \end{bmatrix}}\,,\quad \Lambda ={\begin{bmatrix}\Gamma &-\mathbf {a} ^{\mathrm {T} }\\-\mathbf {b} &\mathbf {M} \end{bmatrix}}\,,}carrying out the block matrix multiplications obtains general conditions onΓ,a,b,M to ensure relativistic invariance. Not much information can be directly extracted from all the conditions, however one of the resultsΓ2=1+bTb{\displaystyle \Gamma ^{2}=1+\mathbf {b} ^{\mathrm {T} }\mathbf {b} }is useful;bTb ≥ 0 always so it follows thatΓ21Γ1,Γ1{\displaystyle \Gamma ^{2}\geq 1\quad \Rightarrow \quad \Gamma \leq -1\,,\quad \Gamma \geq 1}

The negative inequality may be unexpected, becauseΓ multiplies the time coordinate and this has an effect ontime symmetry. If the positive equality holds, thenΓ is the Lorentz factor.

The determinant and inequality provide four ways to classifyLorentzTransformations (hereinLTs for brevity). Any particular LT has only one determinant signand only one inequality. There are four sets which include every possible pair given by theintersections ("n"-shaped symbol meaning "and") of these classifying sets.

Intersection, ∩Antichronous (or non-orthochronous) LTs
L={Λ:Γ1}{\displaystyle {\mathcal {L}}^{\downarrow }=\{\Lambda :\Gamma \leq -1\}}
Orthochronous LTs
L={Λ:Γ1}{\displaystyle {\mathcal {L}}^{\uparrow }=\{\Lambda :\Gamma \geq 1\}}
Proper LTs
L+={Λ:det(Λ)=+1}{\displaystyle {\mathcal {L}}_{+}=\{\Lambda :\det(\Lambda )=+1\}}
Proper antichronous LTs
L+=L+L{\displaystyle {\mathcal {L}}_{+}^{\downarrow }={\mathcal {L}}_{+}\cap {\mathcal {L}}^{\downarrow }}
Proper orthochronous LTs
L+=L+L{\displaystyle {\mathcal {L}}_{+}^{\uparrow }={\mathcal {L}}_{+}\cap {\mathcal {L}}^{\uparrow }}
Improper LTs
L={Λ:det(Λ)=1}{\displaystyle {\mathcal {L}}_{-}=\{\Lambda :\det(\Lambda )=-1\}}
Improper antichronous LTs
L=LL{\displaystyle {\mathcal {L}}_{-}^{\downarrow }={\mathcal {L}}_{-}\cap {\mathcal {L}}^{\downarrow }}
Improper orthochronous LTs
L=LL{\displaystyle {\mathcal {L}}_{-}^{\uparrow }={\mathcal {L}}_{-}\cap {\mathcal {L}}^{\uparrow }}

where "+" and "−" indicate the determinant sign, while "↑" for ≥ and "↓" for ≤ denote the inequalities.

The full Lorentz group splits into theunion ("u"-shaped symbol meaning "or") of fourdisjoint setsL=L+LL+L{\displaystyle {\mathcal {L}}={\mathcal {L}}_{+}^{\uparrow }\cup {\mathcal {L}}_{-}^{\uparrow }\cup {\mathcal {L}}_{+}^{\downarrow }\cup {\mathcal {L}}_{-}^{\downarrow }}

Asubgroup of a group must beclosed under the same operation of the group (here matrix multiplication). In other words, for two Lorentz transformationsΛ andL from a particular subgroup, the composite Lorentz transformationsΛL andLΛ must be in the same subgroup asΛ andL. This is not always the case: the composition of two antichronous Lorentz transformations is orthochronous, and the composition of two improper Lorentz transformations is proper. In other words, while the setsL+{\displaystyle {\mathcal {L}}_{+}^{\uparrow }},L+{\displaystyle {\mathcal {L}}_{+}},L{\displaystyle {\mathcal {L}}^{\uparrow }}, andL0=L+L{\displaystyle {\mathcal {L}}_{0}={\mathcal {L}}_{+}^{\uparrow }\cup {\mathcal {L}}_{-}^{\downarrow }} all form subgroups, the sets containing improper and/or antichronous transformations without enough proper orthochronous transformations (e.g.L+{\displaystyle {\mathcal {L}}_{+}^{\downarrow }},L{\displaystyle {\mathcal {L}}_{-}^{\downarrow }},L{\displaystyle {\mathcal {L}}_{-}^{\uparrow }}) do not form subgroups.

Proper transformations

[edit]

If a Lorentz covariant 4-vector is measured in one inertial frame with resultX{\displaystyle X}, and the same measurement made in another inertial frame (with the same orientation and origin) gives resultX{\displaystyle X'}, the two results will be related byX=B(v)X{\displaystyle X'=B(\mathbf {v} )X}where the boost matrixB(v){\displaystyle B(\mathbf {v} )} represents the rotation-free Lorentz transformation between the unprimed and primed frames andv{\displaystyle \mathbf {v} } is the velocity of the primed frame as seen from the unprimed frame. The matrix is given by[23]B(v)=[γγvx/cγvy/cγvz/cγvx/c1+(γ1)vx2v2(γ1)vxvyv2(γ1)vxvzv2γvy/c(γ1)vyvxv21+(γ1)vy2v2(γ1)vyvzv2γvz/c(γ1)vzvxv2(γ1)vzvyv21+(γ1)vz2v2]=[γγβTγβI+(γ1)ββTβ2],{\displaystyle B(\mathbf {v} )={\begin{bmatrix}\gamma &-\gamma v_{\text{x}}/c&-\gamma v_{\text{y}}/c&-\gamma v_{\text{z}}/c\\-\gamma v_{\text{x}}/c&1+(\gamma -1){\dfrac {v_{\text{x}}^{2}}{v^{2}}}&(\gamma -1){\dfrac {v_{\text{x}}v_{\text{y}}}{v^{2}}}&(\gamma -1){\dfrac {v_{\text{x}}v_{\text{z}}}{v^{2}}}\\-\gamma v_{\text{y}}/c&(\gamma -1){\dfrac {v_{\text{y}}v_{\text{x}}}{v^{2}}}&1+(\gamma -1){\dfrac {v_{\text{y}}^{2}}{v^{2}}}&(\gamma -1){\dfrac {v_{\text{y}}v_{\text{z}}}{v^{2}}}\\-\gamma v_{\text{z}}/c&(\gamma -1){\dfrac {v_{\text{z}}v_{\text{x}}}{v^{2}}}&(\gamma -1){\dfrac {v_{\text{z}}v_{\text{y}}}{v^{2}}}&1+(\gamma -1){\dfrac {v_{\text{z}}^{2}}{v^{2}}}\end{bmatrix}}={\begin{bmatrix}\gamma &-\gamma {\vec {\beta }}^{\text{T}}\\-\gamma {\vec {\beta }}&I+(\gamma -1){\dfrac {{\vec {\beta }}{\vec {\beta }}^{\text{T}}}{\beta ^{2}}}\end{bmatrix}},}

wherev=vx2+vy2+vz2{\textstyle v={\sqrt {v_{\text{x}}^{2}+v_{\text{y}}^{2}+v_{\text{z}}^{2}}}} is the magnitude of the velocity andγ=11v2/c2{\textstyle \gamma ={\frac {1}{\sqrt {1-{v^{2}}/{c^{2}}}}}} is the Lorentz factor. This formula represents a passive transformation, as it describes how the coordinates of the measured quantity changes from the unprimed frame to the primed frame. The active transformation is given byB(v){\displaystyle B(-\mathbf {v} )}.

If a frameF is boosted with velocityu relative to frameF, and another frameF′′ is boosted with velocityv relative toF, the separate boosts areX=B(v)X,X=B(u)X{\displaystyle X''=B(\mathbf {v} )X'\,,\quad X'=B(\mathbf {u} )X}and the composition of the two boosts connects the coordinates inF′′ andF,X=B(v)B(u)X.{\displaystyle X''=B(\mathbf {v} )B(\mathbf {u} )X\,.}Successive transformations act on the left. Ifu andv arecollinear (parallel or antiparallel along the same line of relative motion), the boost matricescommute:B(v)B(u) =B(u)B(v). This composite transformation happens to be another boost,B(w), wherew is collinear withu andv.

Ifu andv are not collinear but in different directions, the situation is considerably more complicated. Lorentz boosts along different directions do not commute:B(v)B(u) andB(u)B(v) are not equal. Although each of these compositions isnot a single boost, each composition is still a Lorentz transformation as it preserves the spacetime interval. It turns out the composition of any two Lorentz boosts is equivalent to a boost followed or preceded by a rotation on the spatial coordinates, in the form ofR(ρ)B(w) orB(w)R(ρ). Thew andw arecomposite velocities, whileρ andρ are rotation parameters (e.g.axis-angle variables,Euler angles, etc.). The rotation inblock matrix form is simplyR(ρ)=[100R(ρ)],{\displaystyle \quad R({\boldsymbol {\rho }})={\begin{bmatrix}1&0\\0&\mathbf {R} ({\boldsymbol {\rho }})\end{bmatrix}}\,,}whereR(ρ) is a3 × 3rotation matrix, which rotates any 3-dimensional vector in one sense (active transformation), or equivalently the coordinate frame in the opposite sense (passive transformation). It isnot simple to connectw andρ (orw andρ) to the original boost parametersu andv. In a composition of boosts, theR matrix is named theWigner rotation, and gives rise to theThomas precession. These articles give the explicit formulae for the composite transformation matrices, including expressions forw,ρ,w,ρ.

In this article theaxis-angle representation is used forρ. The rotation is about an axis in the direction of aunit vectore, through angleθ (positive anticlockwise, negative clockwise, according to theright-hand rule). The "axis-angle vector"θ=θe{\displaystyle {\boldsymbol {\theta }}=\theta \mathbf {e} }will serve as a useful abbreviation.

Spatial rotations alone are also Lorentz transformations since they leave the spacetime interval invariant. Like boosts, successive rotations about different axes do not commute. Unlike boosts, the composition of any two rotations is equivalent to a single rotation. Some other similarities and differences between the boost and rotation matrices include:

  • inverses:B(v)−1 =B(−v) (relative motion in the opposite direction), andR(θ)−1 =R(−θ) (rotation in the opposite sense about the same axis)
  • identity transformation for no relative motion/rotation:B(0) =R(0) =I
  • unitdeterminant:det(B) = det(R) = +1. This property makes them proper transformations.
  • matrix symmetry:B is symmetric (equalstranspose), whileR is nonsymmetric butorthogonal (transpose equalsinverse,RT =R−1).

The most general proper Lorentz transformationΛ(v,θ) includes a boost and rotation together, and is a nonsymmetric matrix. As special cases,Λ(0,θ) =R(θ) andΛ(v,0) =B(v). An explicit form of the general Lorentz transformation is cumbersome to write down and will not be given here. Nevertheless, closed form expressions for the transformation matrices will be given below using group theoretical arguments. It will be easier to use the rapidity parametrization for boosts, in which case one writesΛ(ζ,θ) andB(ζ).

Lie group SO+(3,1)

[edit]

The set of transformations{B(ζ),R(θ),Λ(ζ,θ)}{\displaystyle \{B({\boldsymbol {\zeta }}),R({\boldsymbol {\theta }}),\Lambda ({\boldsymbol {\zeta }},{\boldsymbol {\theta }})\}}with matrix multiplication as the operation of composition forms a group, called the "restricted Lorentz group", and is thespecial indefinite orthogonal group SO+(3,1). (The plus sign indicates that it preserves the orientation of the temporal dimension).

For simplicity, look at the infinitesimal Lorentz boost in thex direction (examining a boost in any other direction, or rotation about any axis, follows an identical procedure). The infinitesimal boost is a small boost away from the identity, obtained by theTaylor expansion of the boost matrix to first order aboutζ = 0,Bx=I+ζBxζ|ζ=0+{\displaystyle B_{\text{x}}=I+\zeta \left.{\frac {\partial B_{\text{x}}}{\partial \zeta }}\right|_{\zeta =0}+\cdots }where the higher order terms not shown are negligible becauseζ is small, andBx is simply the boost matrix in thex direction. Thederivative of the matrix is the matrix of derivatives (of the entries, with respect to the same variable), and it is understood the derivatives are found first then evaluated atζ = 0,Bxζ|ζ=0=Kx.{\displaystyle \left.{\frac {\partial B_{\text{x}}}{\partial \zeta }}\right|_{\zeta =0}=-K_{\text{x}}\,.}

For now,Kx is defined by this result (its significance will be explained shortly). In the limit of an infinite number of infinitely small steps, the finite boost transformation in the form of amatrix exponential is obtainedBx=limN(IζNKx)N=eζKx{\displaystyle B_{\text{x}}=\lim _{N\to \infty }\left(I-{\frac {\zeta }{N}}K_{\text{x}}\right)^{\!N}=e^{-\zeta K_{\text{x}}}}where thelimit definition of the exponential has been used (see alsoCharacterizations of the exponential function). More generally[nb 5]B(ζ)=eζK,R(θ)=eθJ.{\displaystyle B({\boldsymbol {\zeta }})=e^{-{\boldsymbol {\zeta }}\cdot \mathbf {K} }\,,\quad R({\boldsymbol {\theta }})=e^{{\boldsymbol {\theta }}\cdot \mathbf {J} }\,.}

The axis-angle vectorθ and rapidity vectorζ are altogether six continuous variables which make up the group parameters (in this particular representation), and the generators of the group areK = (Kx,Ky,Kz) andJ = (Jx,Jy,Jz), each vectors of matrices with the explicit forms[nb 6]Kx=[0100100000000000],Ky=[0010000010000000],Kz=[0001000000001000]Jx=[0000000000010010],Jy=[0000000100000100],Jz=[0000001001000000]{\displaystyle {\begin{alignedat}{3}K_{\text{x}}&={\begin{bmatrix}0&1&0&0\\1&0&0&0\\0&0&0&0\\0&0&0&0\\\end{bmatrix}}\,,\quad &K_{\text{y}}&={\begin{bmatrix}0&0&1&0\\0&0&0&0\\1&0&0&0\\0&0&0&0\end{bmatrix}}\,,\quad &K_{\text{z}}&={\begin{bmatrix}0&0&0&1\\0&0&0&0\\0&0&0&0\\1&0&0&0\end{bmatrix}}\\[10mu]J_{\text{x}}&={\begin{bmatrix}0&0&0&0\\0&0&0&0\\0&0&0&-1\\0&0&1&0\\\end{bmatrix}}\,,\quad &J_{\text{y}}&={\begin{bmatrix}0&0&0&0\\0&0&0&1\\0&0&0&0\\0&-1&0&0\end{bmatrix}}\,,\quad &J_{\text{z}}&={\begin{bmatrix}0&0&0&0\\0&0&-1&0\\0&1&0&0\\0&0&0&0\end{bmatrix}}\end{alignedat}}}

These are all defined in an analogous way toKx above, although the minus signs in the boost generators are conventional. Physically, the generators of the Lorentz group correspond to important symmetries in spacetime:J are therotation generators which correspond toangular momentum, andK are theboost generators which correspond to the motion of the system in spacetime. The derivative of any smooth curveC(t) withC(0) =I in the group depending on some group parametert with respect to that group parameter, evaluated att = 0, serves as a definition of a corresponding group generatorG, and this reflects an infinitesimal transformation away from the identity. The smooth curve can always be taken as an exponential as the exponential will always mapG smoothly back into the group viat → exp(tG) for allt; this curve will yieldG again when differentiated att = 0.

Expanding the exponentials in their Taylor series obtainsB(ζ)=Isinhζ(nK)+(coshζ1)(nK)2{\displaystyle B({\boldsymbol {\zeta }})=I-\sinh \zeta (\mathbf {n} \cdot \mathbf {K} )+(\cosh \zeta -1)(\mathbf {n} \cdot \mathbf {K} )^{2}}R(θ)=I+sinθ(eJ)+(1cosθ)(eJ)2.{\displaystyle R({\boldsymbol {\theta }})=I+\sin \theta (\mathbf {e} \cdot \mathbf {J} )+(1-\cos \theta )(\mathbf {e} \cdot \mathbf {J} )^{2}\,.}which compactly reproduce the boost and rotation matrices as given in the previous section.

It has been stated that the general proper Lorentz transformation is a product of a boost and rotation. At theinfinitesimal level the productΛ=(IζK+)(I+θJ+)=(I+θJ+)(IζK+)=IζK+θJ+{\displaystyle {\begin{aligned}\Lambda &=(I-{\boldsymbol {\zeta }}\cdot \mathbf {K} +\cdots )(I+{\boldsymbol {\theta }}\cdot \mathbf {J} +\cdots )\\&=(I+{\boldsymbol {\theta }}\cdot \mathbf {J} +\cdots )(I-{\boldsymbol {\zeta }}\cdot \mathbf {K} +\cdots )\\&=I-{\boldsymbol {\zeta }}\cdot \mathbf {K} +{\boldsymbol {\theta }}\cdot \mathbf {J} +\cdots \end{aligned}}}is commutative because only linear terms are required (products like(θ·J)(ζ·K) and(ζ·K)(θ·J) count as higher order terms and are negligible). Taking the limit as before leads to the finite transformation in the form of an exponentialΛ(ζ,θ)=eζK+θJ.{\displaystyle \Lambda ({\boldsymbol {\zeta }},{\boldsymbol {\theta }})=e^{-{\boldsymbol {\zeta }}\cdot \mathbf {K} +{\boldsymbol {\theta }}\cdot \mathbf {J} }.}

The converse is also true, but the decomposition of a finite general Lorentz transformation into such factors is nontrivial. In particular,eζK+θJeζKeθJ,{\displaystyle e^{-{\boldsymbol {\zeta }}\cdot \mathbf {K} +{\boldsymbol {\theta }}\cdot \mathbf {J} }\neq e^{-{\boldsymbol {\zeta }}\cdot \mathbf {K} }e^{{\boldsymbol {\theta }}\cdot \mathbf {J} },}because the generators do not commute. For a description of how to find the factors of a general Lorentz transformation in terms of a boost and a rotationin principle (this usually does not yield an intelligible expression in terms of generatorsJ andK), seeWigner rotation. If, on the other hand,the decomposition is given in terms of the generators, and one wants to find the product in terms of the generators, then theBaker–Campbell–Hausdorff formula applies.

Lie algebra so(3,1)

[edit]

Lorentz generators can be added together, or multiplied by real numbers, to obtain more Lorentz generators. In other words, theset of all Lorentz generatorsV={ζK+θJ}{\displaystyle V=\{{\boldsymbol {\zeta }}\cdot \mathbf {K} +{\boldsymbol {\theta }}\cdot \mathbf {J} \}}together with the operations of ordinarymatrix addition andmultiplication of a matrix by a number, forms avector space over the real numbers.[nb 7] The generatorsJx,Jy,Jz,Kx,Ky,Kz form abasis set ofV, and the components of the axis-angle and rapidity vectors,θx,θy,θz,ζx,ζy,ζz, are thecoordinates of a Lorentz generator with respect to this basis.[nb 8]

Three of thecommutation relations of the Lorentz generators are[Jx,Jy]=Jz,[Kx,Ky]=Jz,[Jx,Ky]=Kz,{\displaystyle [J_{\text{x}},J_{\text{y}}]=J_{\text{z}}\,,\quad [K_{\text{x}},K_{\text{y}}]=-J_{\text{z}}\,,\quad [J_{\text{x}},K_{\text{y}}]=K_{\text{z}}\,,}where the bracket[A,B] =ABBA is known as thecommutator, and the other relations can be found by takingcyclic permutations ofx,y,z components (i.e. changex toy,y toz, andz tox, repeat).

These commutation relations, and the vector space of generators, fulfill the definition of theLie algebraso(3,1){\displaystyle {\mathfrak {so}}(3,1)}. In summary, a Lie algebra is defined as avector spaceV over afield of numbers, and with abinary operation[ , ] (called aLie bracket in this context) on the elements of the vector space, satisfying the axioms ofbilinearity,alternatization, and theJacobi identity. Here the operation[ , ] is the commutator which satisfies all of these axioms, the vector space is the set of Lorentz generatorsV as given previously, and the field is the set of real numbers.

Linking terminology used in mathematics and physics: A group generator is any element of the Lie algebra. A group parameter is a component of a coordinate vector representing an arbitrary element of the Lie algebra with respect to some basis. A basis, then, is a set of generators being a basis of the Lie algebra in the usual vector space sense.

Theexponential map from the Lie algebra to the Lie group,exp:so(3,1)SO(3,1),{\displaystyle \exp :{\mathfrak {so}}(3,1)\to \mathrm {SO} (3,1),}provides a one-to-one correspondence between small enough neighborhoods of the origin of the Lie algebra and neighborhoods of the identity element of the Lie group. In the case of the Lorentz group, the exponential map is just thematrix exponential. Globally, the exponential map is not one-to-one, but in the case of the Lorentz group, it issurjective (onto). Hence any group element in the connected component of the identity can be expressed as an exponential of an element of the Lie algebra.

Improper transformations

[edit]

Lorentz transformations also includeparity inversionP=[100I]{\displaystyle P={\begin{bmatrix}1&0\\0&-\mathbf {I} \end{bmatrix}}}which negates all the spatial coordinates only, andtime reversalT=[100I]{\displaystyle T={\begin{bmatrix}-1&0\\0&\mathbf {I} \end{bmatrix}}}which negates the time coordinate only, because these transformations leave the spacetime interval invariant. HereI is the3 × 3identity matrix. These are both symmetric, they are their own inverses (seeInvolution (mathematics)), and each have determinant −1. This latter property makes them improper transformations.

IfΛ is a proper orthochronous Lorentz transformation, thenTΛ is improper antichronous,PΛ is improper orthochronous, andTPΛ =PTΛ is proper antichronous.

Inhomogeneous Lorentz group

[edit]

Two other spacetime symmetries have not been accounted for. In order for the spacetime interval to be invariant, it can be shown[24] that it is necessary and sufficient for the coordinate transformation to be of the formX=ΛX+C{\displaystyle X'=\Lambda X+C}whereC is a constant column containing translations in time and space. IfC ≠ 0, this is aninhomogeneous Lorentz transformation orPoincaré transformation.[25][26] IfC = 0, this is ahomogeneous Lorentz transformation. Poincaré transformations are not dealt further in this article.

Tensor formulation

[edit]
Main article:Representation theory of the Lorentz group
For the notation used, seeRicci calculus.

Contravariant vectors

[edit]

Writing the general matrix transformation of coordinates as the matrix equation[x0x1x2x3]=[Λ00Λ01Λ02Λ03x0Λ10Λ11Λ12Λ13x0Λ20Λ21Λ22Λ23x0Λ30Λ31Λ32Λ33x0][x0x0x1x0x2x0x3x0]{\displaystyle {\begin{bmatrix}{x'}^{0}\\{x'}^{1}\\{x'}^{2}\\{x'}^{3}\end{bmatrix}}={\begin{bmatrix}{\Lambda ^{0}}_{0}&{\Lambda ^{0}}_{1}&{\Lambda ^{0}}_{2}&{\Lambda ^{0}}_{3}{\vphantom {{x'}^{0}}}\\{\Lambda ^{1}}_{0}&{\Lambda ^{1}}_{1}&{\Lambda ^{1}}_{2}&{\Lambda ^{1}}_{3}{\vphantom {{x'}^{0}}}\\{\Lambda ^{2}}_{0}&{\Lambda ^{2}}_{1}&{\Lambda ^{2}}_{2}&{\Lambda ^{2}}_{3}{\vphantom {{x'}^{0}}}\\{\Lambda ^{3}}_{0}&{\Lambda ^{3}}_{1}&{\Lambda ^{3}}_{2}&{\Lambda ^{3}}_{3}{\vphantom {{x'}^{0}}}\\\end{bmatrix}}{\begin{bmatrix}x^{0}{\vphantom {{x'}^{0}}}\\x^{1}{\vphantom {{x'}^{0}}}\\x^{2}{\vphantom {{x'}^{0}}}\\x^{3}{\vphantom {{x'}^{0}}}\end{bmatrix}}}allows the transformation of other physical quantities that cannot be expressed as four-vectors; e.g.,tensors orspinors of any order in 4-dimensional spacetime, to be defined. In the correspondingtensor index notation, the above matrix expression isxν=Λνμxμ,{\displaystyle {x'}^{\nu }={\Lambda ^{\nu }}_{\mu }x^{\mu },}

where lower and upper indices labelcovariant and contravariant components respectively,[27] and thesummation convention is applied. It is a standard convention to useGreek indices that take the value 0 for time components, and 1, 2, 3 for space components, whileLatin indices simply take the values 1, 2, 3, for spatial components (the opposite for Landau and Lifshitz). Note that the first index (reading left to right) corresponds in the matrix notation to arow index. The second index corresponds to the column index.

The transformation matrix is universal for allfour-vectors, not just 4-dimensional spacetime coordinates. IfA is any four-vector, then intensor index notationAν=ΛνμAμ.{\displaystyle {A'}^{\nu }={\Lambda ^{\nu }}_{\mu }A^{\mu }\,.}

Alternatively, one writesAν=ΛνμAμ.{\displaystyle A^{\nu '}={\Lambda ^{\nu '}}_{\mu }A^{\mu }\,.} in which the primed indices denote the indices ofA in the primed frame. For a generaln-component object one may writeXα=Π(Λ)αβXβ,{\displaystyle {X'}^{\alpha }={\Pi (\Lambda )^{\alpha }}_{\beta }X^{\beta }\,,} whereΠ is the appropriaterepresentation of the Lorentz group, ann ×n matrix for everyΛ. In this case, the indices shouldnot be thought of as spacetime indices (sometimes called Lorentz indices), and they run from1 ton. E.g., ifX is abispinor, then the indices are calledDirac indices.

Covariant vectors

[edit]

There are also vector quantities with covariant indices. They are generally obtained from their corresponding objects with contravariant indices by the operation oflowering an index; e.g.,xν=ημνxμ,{\displaystyle x_{\nu }=\eta _{\mu \nu }x^{\mu },}whereη is themetric tensor. (The linked article also provides more information about what the operation of raising and lowering indices really is mathematically.) The inverse of this transformation is given byxμ=ημνxν,{\displaystyle x^{\mu }=\eta ^{\mu \nu }x_{\nu },}where, when viewed as matrices,ημν is the inverse ofημν. As it happens,ημν =ημν. This is referred to asraising an index. To transform a covariant vectorAμ, first raise its index, then transform it according to the same rule as for contravariant4-vectors, then finally lower the index;Aν=ηρνΛρσημσAμ.{\displaystyle {A'}_{\nu }=\eta _{\rho \nu }{\Lambda ^{\rho }}_{\sigma }\eta ^{\mu \sigma }A_{\mu }.}

ButηρνΛρσημσ=(Λ1)μν,{\displaystyle \eta _{\rho \nu }{\Lambda ^{\rho }}_{\sigma }\eta ^{\mu \sigma }={\left(\Lambda ^{-1}\right)^{\mu }}_{\nu },}

That is, it is the(μ,ν)-component of theinverse Lorentz transformation. One defines (as a matter of notation),Λνμ(Λ1)μν,{\displaystyle {\Lambda _{\nu }}^{\mu }\equiv {\left(\Lambda ^{-1}\right)^{\mu }}_{\nu },}and may in this notation writeAν=ΛνμAμ.{\displaystyle {A'}_{\nu }={\Lambda _{\nu }}^{\mu }A_{\mu }.}

Now for a subtlety. The implied summation on the right hand side ofAν=ΛνμAμ=(Λ1)μνAμ{\displaystyle {A'}_{\nu }={\Lambda _{\nu }}^{\mu }A_{\mu }={\left(\Lambda ^{-1}\right)^{\mu }}_{\nu }A_{\mu }}is running overa row index of the matrix representingΛ−1. Thus, in terms of matrices, this transformation should be thought of as theinverse transpose ofΛ acting on the column vectorAμ. That is, in pure matrix notation,A=(Λ1)TA.{\displaystyle A'=\left(\Lambda ^{-1}\right)^{\mathrm {T} }A.}

This means exactly that covariant vectors (thought of as column matrices) transform according to thedual representation of the standard representation of the Lorentz group. This notion generalizes to general representations, simply replaceΛ withΠ(Λ).

Tensors

[edit]

IfA andB are linear operators on vector spacesU andV, then a linear operatorAB may be defined on thetensor product ofU andV, denotedUV according to[28]

(AB)(uv)=AuBv,uU,vV,uvUV.{\displaystyle (A\otimes B)(u\otimes v)=Au\otimes Bv,\qquad u\in U,v\in V,u\otimes v\in U\otimes V.}             (T1)

From this it is immediately clear that ifu andv are a four-vectors inV, thenuvT2VVV transforms as

uvΛuΛv=ΛμνuνΛρσvσ=ΛμνΛρσuνvσΛμνΛρσwνσ.{\displaystyle u\otimes v\rightarrow \Lambda u\otimes \Lambda v={\Lambda ^{\mu }}_{\nu }u^{\nu }\otimes {\Lambda ^{\rho }}_{\sigma }v^{\sigma }={\Lambda ^{\mu }}_{\nu }{\Lambda ^{\rho }}_{\sigma }u^{\nu }\otimes v^{\sigma }\equiv {\Lambda ^{\mu }}_{\nu }{\Lambda ^{\rho }}_{\sigma }w^{\nu \sigma }.}             (T2)

The second step uses the bilinearity of the tensor product and the last step defines a 2-tensor on component form, or rather, it just renames the tensoruv.

These observations generalize in an obvious way to more factors, and using the fact that a general tensor on a vector spaceV can be written as a sum of a coefficient (component!) times tensor products of basis vectors and basis covectors, one arrives at the transformation law for anytensor quantityT. It is given by[29]

Tθικαβζ=ΛαμΛβνΛζρΛθσΛιυΛκζTσυζμνρ,{\displaystyle T_{\theta '\iota '\cdots \kappa '}^{\alpha '\beta '\cdots \zeta '}={\Lambda ^{\alpha '}}_{\mu }{\Lambda ^{\beta '}}_{\nu }\cdots {\Lambda ^{\zeta '}}_{\rho }{\Lambda _{\theta '}}^{\sigma }{\Lambda _{\iota '}}^{\upsilon }\cdots {\Lambda _{\kappa '}}^{\zeta }T_{\sigma \upsilon \cdots \zeta }^{\mu \nu \cdots \rho },}             (T3)

whereΛχψ is defined above. This form can generally be reduced to the form for generaln-component objects given above with a single matrix (Π(Λ)) operating on column vectors. This latter form is sometimes preferred; e.g., for the electromagnetic field tensor.

Transformation of the electromagnetic field

[edit]
Lorentz boost of an electric charge; the charge is at rest in one frame or the other.
Main article:Electromagnetic tensor
Further information:classical electromagnetism and special relativity

Lorentz transformations can also be used to illustrate that themagnetic fieldB andelectric fieldE are simply different aspects of the same force — theelectromagnetic force, as a consequence of relative motion betweenelectric charges and observers.[30] The fact that the electromagnetic field shows relativistic effects becomes clear by carrying out a simple thought experiment.[31]

  • An observer measures a charge at rest in frameF. The observer will detect a static electric field. As the charge is stationary in this frame, there is no electric current, so the observer does not observe any magnetic field.
  • The other observer in frameF′ moves at velocityv relative toF and the charge.This observer sees a different electric field because the charge moves at velocityv in their rest frame. The motion of the charge corresponds to anelectric current, and thus the observer in frameF′ also sees a magnetic field.

The electric and magnetic fields transform differently from space and time, but exactly the same way as relativistic angular momentum and the boost vector.

The electromagnetic field strength tensor is given byFμν=[01cEx1cEy1cEz1cEx0BzBy1cEyBz0Bx1cEzByBx0]{\displaystyle F^{\mu \nu }={\begin{bmatrix}0&-{\frac {1}{c}}E_{\text{x}}&-{\frac {1}{c}}E_{\text{y}}&-{\frac {1}{c}}E_{\text{z}}\\{\frac {1}{c}}E_{\text{x}}&0&-B_{\text{z}}&B_{\text{y}}\\{\frac {1}{c}}E_{\text{y}}&B_{\text{z}}&0&-B_{\text{x}}\\{\frac {1}{c}}E_{\text{z}}&-B_{\text{y}}&B_{\text{x}}&0\end{bmatrix}}}in with signature(+, −, −, −). In relativity, the factorc may be absorbed into the tensor components to eliminate its explicit appearance in expressions.[32] Consider a Lorentz boost in thex-direction. It is given by[33]Λμν=[γγβ00γβγ0000100001],Fμν=[0ExEyEzEx0BzByEyBz0BxEzByBx0],{\displaystyle {\Lambda ^{\mu }}_{\nu }={\begin{bmatrix}\gamma &-\gamma \beta &0&0\\-\gamma \beta &\gamma &0&0\\0&0&1&0\\0&0&0&1\\\end{bmatrix}},\qquad F^{\mu \nu }={\begin{bmatrix}0&E_{\text{x}}&E_{\text{y}}&E_{\text{z}}\\-E_{\text{x}}&0&B_{\text{z}}&-B_{\text{y}}\\-E_{\text{y}}&-B_{\text{z}}&0&B_{\text{x}}\\-E_{\text{z}}&B_{\text{y}}&-B_{\text{x}}&0\end{bmatrix}},}where the signature is(−, +, +, +) and the field tensor is displayed side by side for easiest possible reference in the manipulations below.

The general transformation law(T3) becomesFμν=ΛμμΛννFμν.{\displaystyle F^{\mu '\nu '}={\Lambda ^{\mu '}}_{\mu }{\Lambda ^{\nu '}}_{\nu }F^{\mu \nu }.}

For the magnetic field one obtainsBx=F23=Λ2μΛ3νFμν=Λ22Λ33F23=1×1×Bx=Bx,By=F31=Λ3μΛ1νFμν=Λ33Λ1νF3ν=Λ33Λ10F30+Λ33Λ11F31=1×(βγ)(Ez)+1×γBy=γBy+βγEz=γ(Bβ×E)yBz=F12=Λ1μΛ2νFμν=Λ1μΛ22Fμ2=Λ10Λ22F02+Λ11Λ22F12=(γβ)×1×Ey+γ×1×Bz=γBzβγEy=γ(Bβ×E)z{\displaystyle {\begin{aligned}B_{x'}&=F^{2'3'}={\Lambda ^{2}}_{\mu }{\Lambda ^{3}}_{\nu }F^{\mu \nu }={\Lambda ^{2}}_{2}{\Lambda ^{3}}_{3}F^{23}=1\times 1\times B_{\text{x}}\\&=B_{\text{x}},\\B_{y'}&=F^{3'1'}={\Lambda ^{3}}_{\mu }{\Lambda ^{1}}_{\nu }F^{\mu \nu }={\Lambda ^{3}}_{3}{\Lambda ^{1}}_{\nu }F^{3\nu }={\Lambda ^{3}}_{3}{\Lambda ^{1}}_{0}F^{30}+{\Lambda ^{3}}_{3}{\Lambda ^{1}}_{1}F^{31}\\&=1\times (-\beta \gamma )(-E_{\text{z}})+1\times \gamma B_{\text{y}}=\gamma B_{\text{y}}+\beta \gamma E_{\text{z}}\\&=\gamma \left(\mathbf {B} -{\boldsymbol {\beta }}\times \mathbf {E} \right)_{\text{y}}\\B_{z'}&=F^{1'2'}={\Lambda ^{1}}_{\mu }{\Lambda ^{2}}_{\nu }F^{\mu \nu }={\Lambda ^{1}}_{\mu }{\Lambda ^{2}}_{2}F^{\mu 2}={\Lambda ^{1}}_{0}{\Lambda ^{2}}_{2}F^{02}+{\Lambda ^{1}}_{1}{\Lambda ^{2}}_{2}F^{12}\\&=(-\gamma \beta )\times 1\times E_{\text{y}}+\gamma \times 1\times B_{\text{z}}=\gamma B_{\text{z}}-\beta \gamma E_{\text{y}}\\&=\gamma \left(\mathbf {B} -{\boldsymbol {\beta }}\times \mathbf {E} \right)_{\text{z}}\end{aligned}}}

For the electric field resultsEx=F01=Λ0μΛ1νFμν=Λ01Λ10F10+Λ00Λ11F01=(γβ)(γβ)(Ex)+γγEx=γ2β2(Ex)+γ2Ex=Ex(1β2)γ2=Ex,Ey=F02=Λ0μΛ2νFμν=Λ0μΛ22Fμ2=Λ00Λ22F02+Λ01Λ22F12=γ×1×Ey+(βγ)×1×Bz=γEyβγBz=γ(E+β×B)yEz=F03=Λ0μΛ3νFμν=Λ0μΛ33Fμ3=Λ00Λ33F03+Λ01Λ33F13=γ×1×Ezβγ×1×(By)=γEz+βγBy=γ(E+β×B)z.{\displaystyle {\begin{aligned}E_{x'}&=F^{0'1'}={\Lambda ^{0}}_{\mu }{\Lambda ^{1}}_{\nu }F^{\mu \nu }={\Lambda ^{0}}_{1}{\Lambda ^{1}}_{0}F^{10}+{\Lambda ^{0}}_{0}{\Lambda ^{1}}_{1}F^{01}\\&=(-\gamma \beta )(-\gamma \beta )(-E_{\text{x}})+\gamma \gamma E_{\text{x}}=-\gamma ^{2}\beta ^{2}(E_{\text{x}})+\gamma ^{2}E_{\text{x}}=E_{\text{x}}(1-\beta ^{2})\gamma ^{2}\\&=E_{\text{x}},\\E_{y'}&=F^{0'2'}={\Lambda ^{0}}_{\mu }{\Lambda ^{2}}_{\nu }F^{\mu \nu }={\Lambda ^{0}}_{\mu }{\Lambda ^{2}}_{2}F^{\mu 2}={\Lambda ^{0}}_{0}{\Lambda ^{2}}_{2}F^{02}+{\Lambda ^{0}}_{1}{\Lambda ^{2}}_{2}F^{12}\\&=\gamma \times 1\times E_{\text{y}}+(-\beta \gamma )\times 1\times B_{\text{z}}=\gamma E_{\text{y}}-\beta \gamma B_{\text{z}}\\&=\gamma \left(\mathbf {E} +{\boldsymbol {\beta }}\times \mathbf {B} \right)_{\text{y}}\\E_{z'}&=F^{0'3'}={\Lambda ^{0}}_{\mu }{\Lambda ^{3}}_{\nu }F^{\mu \nu }={\Lambda ^{0}}_{\mu }{\Lambda ^{3}}_{3}F^{\mu 3}={\Lambda ^{0}}_{0}{\Lambda ^{3}}_{3}F^{03}+{\Lambda ^{0}}_{1}{\Lambda ^{3}}_{3}F^{13}\\&=\gamma \times 1\times E_{\text{z}}-\beta \gamma \times 1\times (-B_{\text{y}})=\gamma E_{\text{z}}+\beta \gamma B_{\text{y}}\\&=\gamma \left(\mathbf {E} +{\boldsymbol {\beta }}\times \mathbf {B} \right)_{\text{z}}.\end{aligned}}}

Here,β = (β, 0, 0) is used. These results can be summarized byE=EB=BE=γ(E+β×B)=γ(E+β×B),B=γ(Bβ×E)=γ(Bβ×E),{\displaystyle {\begin{aligned}\mathbf {E} _{\parallel '}&=\mathbf {E} _{\parallel }\\\mathbf {B} _{\parallel '}&=\mathbf {B} _{\parallel }\\\mathbf {E} _{\bot '}&=\gamma \left(\mathbf {E} _{\bot }+{\boldsymbol {\beta }}\times \mathbf {B} _{\bot }\right)=\gamma \left(\mathbf {E} +{\boldsymbol {\beta }}\times \mathbf {B} \right)_{\bot },\\\mathbf {B} _{\bot '}&=\gamma \left(\mathbf {B} _{\bot }-{\boldsymbol {\beta }}\times \mathbf {E} _{\bot }\right)=\gamma \left(\mathbf {B} -{\boldsymbol {\beta }}\times \mathbf {E} \right)_{\bot },\end{aligned}}}and are independent of the metric signature. For SI units, substituteEE/c.Misner, Thorne & Wheeler (1973) refer to this last form as the3 + 1 view as opposed to thegeometric view represented by the tensor expressionFμν=ΛμμΛννFμν,{\displaystyle F^{\mu '\nu '}={\Lambda ^{\mu '}}_{\mu }{\Lambda ^{\nu '}}_{\nu }F^{\mu \nu },}and make a strong point of the ease with which results that are difficult to achieve using the3 + 1 view can be obtained and understood. Only objects that have well defined Lorentz transformation properties (in fact underany smooth coordinate transformation) are geometric objects. In the geometric view, the electromagnetic field is a six-dimensional geometric object inspacetime as opposed to two interdependent, but separate, 3-vector fields inspace andtime. The fieldsE (alone) andB (alone) do not have well defined Lorentz transformation properties. The mathematical underpinnings are equations(T1) and(T2) that immediately yield(T3). One should note that the primed and unprimed tensors refer to thesame event in spacetime. Thus the complete equation with spacetime dependence isFμν(x)=ΛμμΛννFμν(Λ1x)=ΛμμΛννFμν(x).{\displaystyle F^{\mu '\nu '}\left(x'\right)={\Lambda ^{\mu '}}_{\mu }{\Lambda ^{\nu '}}_{\nu }F^{\mu \nu }\left(\Lambda ^{-1}x'\right)={\Lambda ^{\mu '}}_{\mu }{\Lambda ^{\nu '}}_{\nu }F^{\mu \nu }(x).}

Length contraction has an effect oncharge densityρ andcurrent densityJ, and time dilation has an effect on the rate of flow of charge (current), so charge and current distributions must transform in a related way under a boost. It turns out they transform exactly like the space-time and energy-momentum four-vectors,j=jγρvn+(γ1)(jn)nρ=γ(ρjvnc2),{\displaystyle {\begin{aligned}\mathbf {j} '&=\mathbf {j} -\gamma \rho v\mathbf {n} +\left(\gamma -1\right)(\mathbf {j} \cdot \mathbf {n} )\mathbf {n} \\\rho '&=\gamma \left(\rho -\mathbf {j} \cdot {\frac {v\mathbf {n} }{c^{2}}}\right),\end{aligned}}}or, in the simpler geometric view,jμ=Λμμjμ.{\displaystyle j^{\mu '}={\Lambda ^{\mu '}}_{\mu }j^{\mu }.}

Charge density transforms as the time component of a four-vector. It is a rotational scalar. The current density is a 3-vector.

TheMaxwell equations are invariant under Lorentz transformations.

Spinors

[edit]

Equation(T1) hold unmodified for any representation of the Lorentz group, including thebispinor representation. In(T2) one simply replaces all occurrences ofΛ by the bispinor representationΠ(Λ),

uvΠ(Λ)uΠ(Λ)v=Π(Λ)αβuβΠ(Λ)ρσvσ=Π(Λ)αβΠ(Λ)ρσuβvσΠ(Λ)αβΠ(Λ)ρσwβσ{\displaystyle {\begin{aligned}u\otimes v\rightarrow \Pi (\Lambda )u\otimes \Pi (\Lambda )v&={\Pi (\Lambda )^{\alpha }}_{\beta }u^{\beta }\otimes {\Pi (\Lambda )^{\rho }}_{\sigma }v^{\sigma }\\&={\Pi (\Lambda )^{\alpha }}_{\beta }{\Pi (\Lambda )^{\rho }}_{\sigma }u^{\beta }\otimes v^{\sigma }\\&\equiv {\Pi (\Lambda )^{\alpha }}_{\beta }{\Pi (\Lambda )^{\rho }}_{\sigma }w^{\beta \sigma }\end{aligned}}}             (T4)

The above equation could, for instance, be the transformation of a state inFock space describing two free electrons.

Transformation of general fields

[edit]

A generalnoninteracting multi-particle state (Fock space state) inquantum field theory transforms according to the rule[34]

U(Λ,a)Ψp1σ1n1;p2σ2n2;=eiaμ[(Λp1)μ+(Λp2)μ+](Λp1)0(Λp2)0p10p20(σ1σ2Dσ1σ1(j1)[W(Λ,p1)]Dσ2σ2(j2)[W(Λ,p2)])ΨΛp1σ1n1;Λp2σ2n2;,{\displaystyle {\begin{aligned}&U(\Lambda ,a)\Psi _{p_{1}\sigma _{1}n_{1};p_{2}\sigma _{2}n_{2};\cdots }\\={}&e^{-ia_{\mu }\left[(\Lambda p_{1})^{\mu }+(\Lambda p_{2})^{\mu }+\cdots \right]}{\sqrt {\frac {(\Lambda p_{1})^{0}(\Lambda p_{2})^{0}\cdots }{p_{1}^{0}p_{2}^{0}\cdots }}}\left(\sum _{\sigma _{1}'\sigma _{2}'\cdots }D_{\sigma _{1}'\sigma _{1}}^{(j_{1})}\left[W(\Lambda ,p_{1})\right]D_{\sigma _{2}'\sigma _{2}}^{(j_{2})}\left[W(\Lambda ,p_{2})\right]\cdots \right)\Psi _{\Lambda p_{1}\sigma _{1}'n_{1};\Lambda p_{2}\sigma _{2}'n_{2};\cdots },\end{aligned}}}1

whereW(Λ,p) is theWigner's little group[35] andD(j) is the(2j + 1)-dimensional representation ofSO(3).

See also

[edit]

Footnotes

[edit]
  1. ^One can imagine that in each inertial frame there are observers positioned throughout space, each with a synchronized clock and at rest in the particular inertial frame. These observers then report to a central office, where all reports are collected. When one speaks of aparticular observer, one refers to someone having, at least in principle, a copy of this report. See, e.g.,Sard (1970).
  2. ^The separate requirements of the three equations lead to three different groups. The second equation is satisfied for spacetime translations in addition to Lorentz transformations leading to thePoincaré group or theinhomogeneous Lorentz group. The first equation (or the second restricted to lightlike separation) leads to a yet larger group, theconformal group of spacetime.
  3. ^The groupsO(3, 1) andO(1, 3) are isomorphic. It is widely believed that the choice between the two metric signatures has no physical relevance, even though some objects related toO(3, 1) andO(1, 3) respectively, e.g., theClifford algebras corresponding to the different signatures of the bilinear form associated to the two groups, are non-isomorphic.
  4. ^For two square matricesA andB,det(AB) = det(A)det(B)
  5. ^Explicitly,ζK=ζxKx+ζyKy+ζzKz{\displaystyle {\boldsymbol {\zeta }}\cdot \mathbf {K} =\zeta _{\text{x}}K_{\text{x}}+\zeta _{\text{y}}K_{\text{y}}+\zeta _{\text{z}}K_{\text{z}}}θJ=θxJx+θyJy+θzJz{\displaystyle {\boldsymbol {\theta }}\cdot \mathbf {J} =\theta _{\text{x}}J_{\text{x}}+\theta _{\text{y}}J_{\text{y}}+\theta _{\text{z}}J_{\text{z}}}
  6. ^Inquantum mechanics,relativistic quantum mechanics, andquantum field theory, a different convention is used for these matrices; the right hand sides are all multiplied by a factor of the imaginary uniti =−1.
  7. ^Until now the term "vector" has exclusively referred to "Euclidean vector", examples are positionr, velocityv, etc. The term "vector" applies much more broadly than Euclidean vectors, row or column vectors, etc., seeLinear algebra andVector space for details. The generators of a Lie group also form a vector space over afield of numbers (e.g.real numbers,complex numbers), since alinear combination of the generators is also a generator. They just live in a different space to the position vectors in ordinary 3-dimensional space.
  8. ^In ordinary 3-dimensionalposition space, the position vectorr =xex +yey +zez is expressed as a linear combination of the Cartesian unit vectorsex,ey,ez which form a basis, and the Cartesian coordinatesx, y, z are coordinates with respect to this basis.

Notes

[edit]
  1. ^Rao, K. N. Srinivasa (1988).The Rotation and Lorentz Groups and Their Representations for Physicists (illustrated ed.). John Wiley & Sons. p. 213.ISBN 978-0-470-21044-4.Equation 6-3.24, page 210
  2. ^Forshaw & Smith 2009
  3. ^Cottingham & Greenwood 2007, p. 21
  4. ^Lorentz 1904
  5. ^O'Connor & Robertson 1996
  6. ^Brown 2003
  7. ^Rothman 2006, pp. 112f
  8. ^Darrigol 2005, pp. 1–22
  9. ^Macrossan 1986, pp. 232–34
  10. ^The reference is within the following paper:Poincaré 1905, pp. 1504–1508
  11. ^Einstein 1905, pp. 891–921
  12. ^Young & Freedman 2008
  13. ^Forshaw & Smith 2009
  14. ^Moses Fayngold (2008).Special Relativity and How it Works (illustrated ed.). John Wiley & Sons. p. 102.ISBN 978-3-527-40607-4.Extract of page 102
  15. ^Mircea S. Rogalski; Stuart B. Palmer (2018).Advanced University Physics (2nd, revised ed.). CRC Press. p. 70.ISBN 978-1-4200-5712-6.Extract of page 70
  16. ^Steane, Andrew M. (2012).Relativity Made Relatively Easy (illustrated ed.). OUP Oxford. p. 124.ISBN 978-0-19-966286-9.Extract of page 124
  17. ^Steane, Andrew."The Lorentz transformation"(PDF).
  18. ^George Arfken (2012).International Edition University Physics. Elsevier. p. 367.ISBN 978-0-323-14203-8.Extract of page 367
  19. ^E.R. Dobbs (2013).Basic Electromagnetism (illustrated ed.). Springer Science & Business Media. p. 113.ISBN 978-94-011-2112-5.Extract of page 113
  20. ^Einstein 1916
  21. ^Barut 1964, p. 18–19
  22. ^Chaichian & Hagedorn 1997, p. 239
  23. ^Furry, W. H. (1955-11-01)."Lorentz Transformation and the Thomas Precession".American Journal of Physics.23 (8):517–525.Bibcode:1955AmJPh..23..517F.doi:10.1119/1.1934085.ISSN 0002-9505.
  24. ^Weinberg 1972
  25. ^Weinberg 2005, pp. 55–58
  26. ^Ohlsson 2011, p. 3–9
  27. ^Dennery & Krzywicki 2012, p. 138
  28. ^Hall 2003, Chapter 4
  29. ^Carroll 2004, p. 22
  30. ^Grant & Phillips 2008
  31. ^Griffiths 2007
  32. ^Jackson 1975, p. [page needed]
  33. ^Misner, Thorne & Wheeler 1973
  34. ^Weinberg 2002, Chapter 3
  35. ^"INSPIRE".inspirehep.net. Retrieved2024-09-04.

References

[edit]

Websites

[edit]

Papers

[edit]

Books

[edit]

Further reading

[edit]

External links

[edit]
Wikisource has original works on the topic:Relativity
Wikibooks has a book on the topic of:special relativity
Wikiversity has learning resources aboutLorentz transformations
Special
relativity
Background
Fundamental
concepts
Formulation
Phenomena
Spacetime
General
relativity
Background
Fundamental
concepts
Formulation
Phenomena
Advanced
theories
Solutions
Scientists
International
National
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Lorentz_transformation&oldid=1319731784"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp