Inlinear algebra andfunctional analysis, aspectral theorem is a result about when alinear operator ormatrix can bediagonalized (that is, represented as adiagonal matrix in some basis). This is extremely useful because computations involving a diagonalizable matrix can often be reduced to much simpler computations involving the corresponding diagonal matrix. The concept of diagonalization is relatively straightforward for operators onfinite-dimensional vector spaces but requires some modification for operators on infinite-dimensional spaces. In general, the spectral theorem identifies a class oflinear operators that can be modeled bymultiplication operators, which are as simple as one can hope to find. In more abstract language, the spectral theorem is a statement about commutativeC*-algebras. See alsospectral theory for a historical perspective.
Examples of operators to which the spectral theorem applies areself-adjoint operators or more generallynormal operators onHilbert spaces.
The spectral theorem also provides acanonical decomposition, called thespectral decomposition, of the underlying vector space on which the operator acts.
Augustin-Louis Cauchy proved the spectral theorem forsymmetric matrices, i.e., that every real, symmetric matrix is diagonalizable. In addition, Cauchy was the first to be systematic aboutdeterminants.[1][2] The spectral theorem as generalized byJohn von Neumann is today perhaps the most important result ofoperator theory.
This article mainly focuses on the simplest kind of spectral theorem, that for aself-adjoint operator on a Hilbert space. However, as noted above, the spectral theorem also holds for normal operators on a Hilbert space.
We begin by considering aHermitian matrix on (but the following discussion will be adaptable to the more restrictive case ofsymmetric matrices on). We consider aHermitian mapA on a finite-dimensionalcomplexinner product spaceV endowed with apositive definitesesquilinearinner product. The Hermitian condition on means that for allx,y ∈V,
An equivalent condition is thatA* =A, whereA* is theHermitian conjugate ofA. In the case thatA is identified with a Hermitian matrix, the matrix ofA* is equal to itsconjugate transpose. (IfA is areal matrix, then this is equivalent toAT =A, that is,A is asymmetric matrix.)
This condition implies that all eigenvalues of a Hermitian map are real: To see this, it is enough to apply it to the case whenx =y is an eigenvector. (Recall that aneigenvector of a linear mapA is a non-zero vectorv such thatAv =λv for some scalarλ. The valueλ is the correspondingeigenvalue. Moreover, theeigenvalues are roots of thecharacteristic polynomial.)
Theorem—IfA is Hermitian onV, then there exists anorthonormal basis ofV consisting of eigenvectors ofA. Each eigenvalue ofA is real.
We provide a sketch of a proof for the case where the underlying field of scalars is thecomplex numbers.
By thefundamental theorem of algebra, applied to thecharacteristic polynomial ofA, there is at least one complex eigenvalueλ1 and corresponding eigenvectorv1, which must by definition be non-zero. Then since we find thatλ1 is real. Now consider the space, theorthogonal complement ofv1. By Hermiticity, is aninvariant subspace ofA. To see that, consider any so that by definition of. To satisfy invariance, we need to check if. This is true because,. Applying the same argument to shows thatA has at least one real eigenvalue and corresponding eigenvector. This can be used to build another invariant subspace. Finite induction then finishes the proof.
The matrix representation ofA in a basis of eigenvectors is diagonal, and by the construction the proof gives a basis of mutually orthogonal eigenvectors; by choosing them to be unit vectors one obtains an orthonormal basis of eigenvectors.A can be written as a linear combination of pairwise orthogonal projections, called itsspectral decomposition. Letbe the eigenspace corresponding to an eigenvalue. Note that the definition does not depend on any choice of specific eigenvectors. In general,V is the orthogonal direct sum of the spaces where the ranges over thespectrum of.
When the matrix being decomposed is Hermitian, the spectral decomposition is a special case of theSchur decomposition (see the proof in case ofnormal matrices below).
The spectral decomposition is a special case of thesingular value decomposition, which states that any matrix can be expressed as, where and areunitary matrices and is a diagonal matrix. The diagonal entries of are uniquely determined by and are known as thesingular values of. If is Hermitian, then and which implies.
The spectral theorem extends to a more general class of matrices. LetA be an operator on a finite-dimensional inner product space.A is said to benormal ifA*A =AA*.
One can show thatA is normal if and only if it is unitarily diagonalizable using theSchur decomposition. That is, any matrix can be written asA =UTU*, whereU is unitary andT isupper triangular.IfA is normal, then one sees thatTT* =T*T. Therefore,T must be diagonal since a normal upper triangular matrix is diagonal (seenormal matrix). The converse is obvious.
In other words,A is normal if and only if there exists aunitary matrixU such thatwhereD is adiagonal matrix. Then, the entries of the diagonal ofD are theeigenvalues ofA. The column vectors ofU are the eigenvectors ofA and they are orthonormal. Unlike the Hermitian case, the entries ofD need not be real.
In the more general setting of Hilbert spaces, which may have an infinite dimension, the statement of the spectral theorem forcompactself-adjoint operators is virtually the same as in the finite-dimensional case.
Theorem—SupposeA is a compact self-adjoint operator on a (real or complex) Hilbert spaceV. Then there is anorthonormal basis ofV consisting of eigenvectors ofA. Each eigenvalue is real.
As for Hermitian matrices, the key point is to prove the existence of at least one nonzero eigenvector. One cannot rely on determinants to show existence of eigenvalues, but one can use a maximization argument analogous to the variational characterization of eigenvalues.
If the compactness assumption is removed, then it isnot true that every self-adjoint operator has eigenvectors. For example, the multiplication operator on which takes each to is bounded and self-adjoint, but has no eigenvectors. However, its spectrum, suitably defined, is still equal to, see spectrum of bounded operator.
The next generalization we consider is that ofbounded self-adjoint operators on a Hilbert space. Such operators may have no eigenvectors: for instance letA be the operator of multiplication byt on, that is,[3]
This operator does not have any eigenvectorsin, though it does have eigenvectors in a larger space. Namely thedistribution, where is theDirac delta function, is an eigenvector when construed in an appropriate sense. The Dirac delta function is however not a function in the classical sense and does not lie in the Hilbert spaceL2[0, 1]. Thus, the delta-functions are "generalized eigenvectors" of but not eigenvectors in the usual sense.
In the absence of (true) eigenvectors, one can look for a "spectral subspace" consisting of analmost eigenvector, i.e, a closed subspace of associated with aBorel set in thespectrum of. This subspace can be thought of as the closed span of generalized eigenvectors for with eigenvalues in.[4] In the above example, where we might consider the subspace of functions supported on a small interval inside. This space is invariant under and for any in this subspace, is very close to. Each subspace, in turn, is encoded by the associated projection operator, and the collection of all the subspaces is then represented by aprojection-valued measure.
One formulation of the spectral theorem expresses the operatorA as an integral of the coordinate function over the operator's spectrum with respect to a projection-valued measure.[5]When the self-adjoint operator in question iscompact, this version of the spectral theorem reduces to something similar to the finite-dimensional spectral theorem above, except that the operator is expressed as a finite or countably infinite linear combination of projections, that is, the measure consists only of atoms.
An alternative formulation of the spectral theorem says that every bounded self-adjoint operator is unitarily equivalent to a multiplication operator, a relatively simple type of operator.
Theorem[6]—Let be a bounded self-adjoint operator on a Hilbert space. Then there is ameasure space and a real-valuedessentially bounded measurable function on and aunitary operator such thatwhere is themultiplication operator:and.
Multiplication operators are a direct generalization of diagonal matrices. A finite-dimensional Hermitian vector space may be coordinatized as the space of functions from a basis to the complex numbers, so that the-coordinates of a vector are the values of the corresponding function. The finite-dimensional spectral theorem for a self-adjoint operator states that there exists an orthonormal basis of eigenvectors, so that the inner product becomes thedot product with respect to the-coordinates: thus is isomorphic to for the discrete unit measure on. Also is unitarily equivalent to the multiplication operator, where is the eigenvalue of: that is, multiplies each-coordinate by the corresponding eigenvalue, the action of a diagonal matrix. Finally, theoperator norm is equal to the magnitude of the largest eigenvector.
The spectral theorem is the beginning of the vast research area of functional analysis calledoperator theory; see alsospectral measure.
There is also an analogous spectral theorem for boundednormal operators on Hilbert spaces. The only difference in the conclusion is that now may be complex-valued.
There is also a formulation of the spectral theorem in terms ofdirect integrals. It is similar to the multiplication-operator formulation, but more canonical.
Let be a bounded self-adjoint operator and let be the spectrum of. The direct-integral formulation of the spectral theorem associates two quantities to. First, a measure on, and second, a family of Hilbert spaces We then form the direct integral Hilbert spaceThe elements of this space are functions (or "sections") such that for all. The direct-integral version of the spectral theorem may be expressed as follows:[7]
Theorem—If is a bounded self-adjoint operator, then is unitarily equivalent to the "multiplication by" operator onfor some measure and some family of Hilbert spaces. The measure is uniquely determined by up to measure-theoretic equivalence; that is, any two measure associated to the same have the same sets of measure zero. The dimensions of the Hilbert spaces are uniquely determined by up to a set of-measure zero.
The spaces can be thought of as something like "eigenspaces" for. Note, however, that unless the one-element set has positive measure, the space is not actually a subspace of the direct integral. Thus, the's should be thought of as "generalized eigenspace"—that is, the elements of are "eigenvectors" that do not actually belong to the Hilbert space.
Although both the multiplication-operator and direct integral formulations of the spectral theorem express a self-adjoint operator as unitarily equivalent to a multiplication operator, the direct integral approach is more canonical. First, the set over which the direct integral takes place (the spectrum of the operator) is canonical. Second, the function we are multiplying by is canonical in the direct-integral approach: Simply the function.
A vector is called acyclic vector for if the vectors span a dense subspace of the Hilbert space. Suppose is a bounded self-adjoint operator for which a cyclic vector exists. In that case, there is no distinction between the direct-integral and multiplication-operator formulations of the spectral theorem. Indeed, in that case, there is a measure on the spectrum of such that is unitarily equivalent to the "multiplication by" operator on.[8] This result represents simultaneously as a multiplication operatorand as a direct integral, since is just a direct integral in which each Hilbert space is just.
Not every bounded self-adjoint operator admits a cyclic vector; indeed, by the uniqueness in the direct integral decomposition, this can occur only when all the's have dimension one. When this happens, we say that has "simple spectrum" in the sense ofspectral multiplicity theory. That is, a bounded self-adjoint operator that admits a cyclic vector should be thought of as the infinite-dimensional generalization of a self-adjoint matrix with distinct eigenvalues (i.e., each eigenvalue has multiplicity one).
Although not every admits a cyclic vector, it is easy to see that we can decompose the Hilbert space as a direct sum of invariant subspaces on which has a cyclic vector. This observation is the key to the proofs of the multiplication-operator and direct-integral forms of the spectral theorem.
One important application of the spectral theorem (in whatever form) is the idea of defining afunctional calculus. That is, given a function defined on the spectrum of, we wish to define an operator. If is simply a positive power,, then is just the-th power of,. The interesting cases are where is a nonpolynomial function such as a square root or an exponential. Either of the versions of the spectral theorem provides such a functional calculus.[9] In the direct-integral version, for example, acts as the "multiplication by" operator in the direct integral:That is to say, each space in the direct integral is a (generalized) eigenspace for with eigenvalue.
Many important linear operators which occur inanalysis, such asdifferential operators, areunbounded. There is also a spectral theorem forself-adjoint operators that applies in these cases. To give an example, every constant-coefficient differential operator is unitarily equivalent to a multiplication operator. Indeed, the unitary operator that implements this equivalence is theFourier transform; the multiplication operator is a type ofFourier multiplier.
In general, spectral theorem for self-adjoint operators may take several equivalent forms.[10] Notably, all of the formulations given in the previous section for bounded self-adjoint operators—the projection-valued measure version, the multiplication-operator version, and the direct-integral version—continue to hold for unbounded self-adjoint operators, with small technical modifications to deal with domain issues. Specifically, the only reason the multiplication operator on is bounded, is due to the choice of domain. The same operator on, e.g., would be unbounded.
The notion of "generalized eigenvectors" naturally extends to unbounded self-adjoint operators, as they are characterized asnon-normalizable eigenvectors. Contrary to the case ofalmost eigenvectors, however, the eigenvalues can be real or complex and, even if they are real, do not necessarily belong to the spectrum. Though, for self-adjoint operators there always exist a real subset of "generalized eigenvalues" such that the corresponding set of eigenvectors iscomplete.[11]