These matrices are named after the physicistWolfgang Pauli. Inquantum mechanics, they occur in thePauli equation, which takes into account the interaction of thespin of a particle with an externalelectromagnetic field. They also represent the interaction states of two polarization filters for horizontal/vertical polarization, 45 degree polarization (right/left), and circular polarization (right/left).
Each Pauli matrix isHermitian, and together with the identity matrix (sometimes considered as the zeroth Pauli matrix), the Pauli matrices form abasis of thevector space of Hermitian matrices over thereal numbers, under addition. This means that anyHermitian matrix can be written in a unique way as alinear combination of Pauli matrices, with all coefficients being real numbers.
The Pauli matrices satisfy the useful product relation:
Hermitian operators representobservables in quantum mechanics, so the Pauli matrices span the space of observables of thecomplex two-dimensionalHilbert space. In the context of Pauli's work, represents the observable corresponding to spin along theth coordinate axis in three-dimensionalEuclidean space.
Cayley table; the entry shows the value of the row times the column.
×
All three of the Pauli matrices can be compacted into a single expression:
This expression is useful for "selecting" any one of the matrices numerically by substituting values of in turn useful when any of the matrices (but no particular one) is to be used in algebraic manipulations.
from which we can deduce that each matrix haseigenvalues.
With the inclusion of the identity matrix (sometimes denoted), the Pauli matrices form an orthogonal basis (in the sense ofHilbert–Schmidt) of theHilbert space of Hermitian matrices over and the Hilbert space of allcomplex matrices over.
The Pauli vector is defined by[b]where,, and are an equivalent notation for the more familiar,, and.
The Pauli vector provides a mapping mechanism from a vector basis to a Pauli matrix basis[2] as follows:
More formally, this defines a map from to the vector space of traceless Hermitian matrices. This map encodes structures of as a normed vector space and as a Lie algebra (with thecross-product as its Lie bracket) via functions of matrices, making the map an isomorphism of Lie algebras. This makes the Pauli matrices intertwiners from the point of view of representation theory.
Another way to view the Pauli vector is as a Hermitian traceless matrix-valued dual vector, that is, an element of that maps
Each component of can be recovered from the matrix (seecompleteness relation below)This constitutes an inverse to the map, making it manifest that the map is a bijection.
The norm is given by the determinant (up to a minus sign)Then, considering the conjugation action of an matrix on this space of matrices,
we find and that is Hermitian and traceless. It then makes sense to define where has the same norm as and therefore interpret as a rotation of three-dimensional space. In fact, it turns out that thespecial restriction on implies that the rotation is orientation preserving. This allows the definition of a map given by
where This map is the concrete realization of the double cover of by and therefore shows that The components of can be recovered using the tracing process above:
The cross-product is given by the matrix commutator (up to a factor of)In fact, the existence of a norm follows from the fact that is a Lie algebra (seeKilling form).
This cross-product can be used to prove the orientation-preserving property of the map above.
The eigenvalues of are This follows immediately from tracelessness and explicitly computing the determinant.
More abstractly, without computing the determinant, which requires explicit properties of the Pauli matrices, this follows from since this can be factorised into A standard result in linear algebra (a linear map that satisfies a polynomial equation written in distinct linear factors isdiagonalizable) means this implies is diagonalizable with possible eigenvalues The tracelessness of means it has exactly one of each eigenvalue.
Its normalized eigenvectors areThese expressions become singular for They can be rescued by letting and taking the limit which yields the correct eigenvectors(0,1) and (1,0) of
Alternatively, one may use spherical coordinates to obtain the eigenvectors and
The Pauli 4-vector, used in spinor theory, is written with components
This defines a map from to the vector space of Hermitian matrices,
which also encodes theMinkowski metric (withmostly minus convention) in its determinant:
This 4-vector also has a completeness relation. It is convenient to define a second Pauli 4-vector
and allow raising and lowering using the Minkowski metric tensor. The relation can then be written
Similarly to the Pauli 3-vector case, we can find a matrix group that acts as isometries on in this case the matrix group is and this shows Similarly to above, this can be explicitly realized for with components
In fact, the determinant property follows abstractly from trace properties of the For matrices, the following identity holds:
That is, the 'cross-terms' can be written as traces. When are chosen to be different the cross-terms vanish. It then follows, now showing summation explicitly, Since the matrices are this is equal to
Pauli vectors elegantly map these commutation and anticommutation relations to corresponding vector products. Adding the commutator to the anticommutator gives
so that,
Contracting each side of the equation with components of two3-vectorsap andbq (which commute with the Pauli matrices, i.e.,apσq =σqap) for each matrixσq and vector componentap (and likewise withbq) yields
Ifi is identified with the pseudoscalarσx σy σz then the right hand side becomes which is also the definition for the product of two vectors in geometric algebra.
If we define the spin operator asJ =ħ/2σ , thenJ satisfies the commutation relation: Or equivalently, the Pauli vector satisfies:
while the determinant of the exponential itself is just1, which makes it thegeneric group element ofSU(2).
A more abstract version of formula(2) for a general2 × 2 matrix can be found in the article onmatrix exponentials. A general version of(2) for an analytic (ata and−a) function is provided by application ofSylvester's formula,[3]
It is also straightforward to likewise work out the adjoint action on the Pauli vector, namely rotation of any angle along any axis:
Taking the dot product of any unit vector with the above formula generates the expression of any single qubit operator under any rotation. For example, it can be shown that
An alternative notation that is commonly used for the Pauli matrices is to write the vector indexk in the superscript, and the matrix indices as subscripts, so that the element in rowα and columnβ of thek-th Pauli matrix isσkαβ .
In this notation, thecompleteness relation for the Pauli matrices can be written
Proof
The fact that the Pauli matrices, along with the identity matrixI, form an orthogonal basis for the Hilbert space of all 2 × 2complex matrices over means that we can express any 2 × 2 complex matrixM aswherec is a complex number, anda is a 3-component, complex vector. It is straightforward to show, using the properties listed above, thatwhere "tr" denotes thetrace, and hence thatwhich can be rewritten in terms of matrix indices aswheresummation over the repeated indices is impliedγ andδ. Since this is true for any choice of the matrixM, the completeness relation follows as stated above.Q.E.D.
As noted above, it is common to denote the 2 × 2 unit matrix byσ0 , soσ0αβ =δαβ . The completeness relation can alternatively be expressed as
The fact that any Hermitiancomplex 2 × 2 matrices can be expressed in terms of the identity matrix and the Pauli matrices also leads to theBloch sphere representation of 2 × 2mixed states’ density matrix, (positive semidefinite 2 × 2 matrices with unit trace. This can be seen by first expressing an arbitrary Hermitian matrix as a real linear combination of{σ0,σ1,σ2,σ3} as above, and then imposing the positive-semidefinite andtrace1 conditions.
For a pure state, in polar coordinates, theidempotent density matrix
acts on the state eigenvector with eigenvalue +1, hence it acts like aprojection operator.
Its eigenvalues are therefore[d] 1 or −1. It may thus be utilized as an interaction term in a Hamiltonian, splitting the energy eigenvalues of its symmetric versus antisymmetric eigenstates.
The groupSU(2) is theLie group ofunitary2 × 2 matrices with unit determinant; itsLie algebra is the set of all2 × 2 anti-Hermitian matrices with trace 0. Direct calculation, as above, shows that theLie algebra is the three-dimensional real algebraspanned by the set{iσk}. In compact notation,
As a result, eachiσj can be seen as aninfinitesimal generator of SU(2). The elements of SU(2) are exponentials of linear combinations of these three generators, and multiply as indicated above in discussing the Pauli vector. Although this suffices to generate SU(2), it is not a properrepresentation ofsu(2), as the Pauli eigenvalues are scaled unconventionally. The conventional normalization isλ =1/2 , so that
The Lie algebra isisomorphic to the Lie algebra, which corresponds to the Lie groupSO(3), thegroup ofrotations in three-dimensional space. In other words, one can say that the i σj are a realization (and, in fact, the lowest-dimensional realization) ofinfinitesimal rotations in three-dimensional space. However, even though and are isomorphic as Lie algebras,SU(2) andSO(3) are not isomorphic as Lie groups.SU(2) is actually adouble cover ofSO(3), meaning that there is a two-to-one group homomorphism fromSU(2) ↦SO(3) , seerelationship between SO(3) and SU(2).
The real linear span of{I, iσ1,i σ2,i σ3} is isomorphic to the real algebra ofquaternions,, represented by the span of the basis vectors The isomorphism from to this set is given by the following map (notice the reversed signs for the Pauli matrices):
Alternatively, the isomorphism can be achieved by a map using the Pauli matrices in reversed order,[5]
As the set ofversors forms a group isomorphic toSU(2),U gives yet another way of describingSU(2). The two-to-one homomorphism fromSU(2) toSO(3) may be given in terms of the Pauli matrices in this formulation.
Inclassical mechanics, Pauli matrices are useful in the context of the Cayley–Klein parameters.[6] The matrix corresponding to the position of a point in space is defined in terms of the above Pauli vector matrix,
Consequently, the transformation matrix for rotations about the-axis through an angle may be written in terms of Pauli matrices and the unit matrix as[6]
Similar expressions follow for general Pauli vector rotations as detailed above.
An interesting property of spin1⁄2 particles is that they must be rotated by an angle of in order to return to their original configuration. This is due to the two-to-one correspondence between SU(2) and SO(3) mentioned above, and the fact that, although one visualizes spin up/down as the north–south pole on the2-sphere they are actually represented byorthogonal vectors in the two-dimensional complexHilbert space.
For a spin1⁄2 particle, the spin operator is given by, thefundamental representation ofSU(2). By takingKronecker products of this representation with itself repeatedly, one may construct all higher irreducible representations. That is, the resultingspin operators for higher spin systems in three spatial dimensions, for arbitrarily largej, can be calculated using thisspin operator andladder operators. They can be found inRotation group SO(3) § A note on Lie algebras. The analog formula to the above generalization of Euler's formula for Pauli matrices, the group element in terms of spin matrices, is tractable, but less simple.[7]
Also useful in thequantum mechanics of multiparticle systems, the generalPauli group is defined to consist of all-foldtensor products of Pauli matrices.
Inrelativistic quantum mechanics, the spinors in four dimensions are4 × 1 (or1 × 4) matrices. Hence the Pauli matrices or the Sigma matrices operating on these spinors have to be4 × 4 matrices. They are defined in terms of 2 × 2 Pauli matrices as
It follows from this definition that the matrices have the same algebraic properties as the σk matrices.
However,relativistic angular momentum is not a three-vector, but a second orderfour-tensor. Hence needs to be replaced byΣμν, the generator ofLorentz transformations on spinors. By the antisymmetry of angular momentum, theΣμν are also antisymmetric. Hence there are only six independent matrices.
The first three are the The remaining three, where theDiracαk matrices are defined as
The relativistic spin matricesΣμν are written in compact form in terms of commutator ofgamma matrices as
Inquantum information, single-qubitquantum gates are 2 × 2unitary matrices. The Pauli matrices are some of the most important single-qubit operations. In that context, the Cartan decomposition given above is called the "Z–Ydecomposition of a single-qubit gate". Choosing a different Cartan pair gives a similar "X–Ydecomposition of a single-qubit gate ".
^This conforms to the conventionin mathematics for thematrix exponential,i σ ⟼ exp(i σ) . In the conventionin physics,σ ⟼ exp(−i σ) , hence in it no pre-multiplication byi is necessary to land inSU(2) .
^The Pauli vector is a formal device. It may be thought of as an element of where thetensor product space is endowed with a mapping induced by thedot product on
^The relation amonga, b, c, n, m, k derived here in the2 × 2 representation holds forall representations ofSU(2), being agroup identity. Note that, by virtue of the standard normalization of that group's generators ashalf the Pauli matrices, the parametersa,b,c correspond tohalf the rotation angles of the rotation group. That is, the Gibbs formula linked amounts to
^Explicitly, in the convention of "right-space matrices into elements of left-space matrices", it is