This articlemay need to be rewritten to comply with Wikipedia'squality standards, as it is written like a maths textbook, not an encyclopedia article.You can help. Thetalk page may contain suggestions.(September 2017) |
In the mathematical discipline offunctional analysis, the concept of acompact operator on Hilbert space is an extension of the concept of a matrix acting on a finite-dimensional vector space; inHilbert space,compact operators are precisely the closure offinite-rank operators (representable by finite-dimensional matrices) in thetopology induced by theoperator norm. As such, results from matrix theory can sometimes be extended to compact operators using similar arguments. By contrast, the study of general operators on infinite-dimensional spaces often requires a genuinely different approach.
For example, thespectral theory of compact operators onBanach spaces takes a form that is very similar to theJordan canonical form of matrices. In the context of Hilbert spaces, a square matrix is unitarily diagonalizable if and only if it isnormal. A corresponding result holds for normal compact operators on Hilbert spaces. More generally, the compactness assumption can be dropped. As stated above, the techniques used to prove results, e.g., thespectral theorem, in the non-compact case are typically different, involving operator-valuedmeasures on thespectrum.
Some results for compact operators on Hilbert space will be discussed, starting with general properties before considering subclasses of compact operators.
Let be aHilbert space and be the set ofbounded operators on. Then, an operator is said to be acompact operator if the image of each bounded set under isrelatively compact.
We list in this section some general properties of compact operators.
IfX andY are separable Hilbert spaces (in fact,X Banach andY normed will suffice), thenT :X →Y is compact if and only if it issequentially continuous when viewed as a map fromX with theweak topology toY (with the norm topology). (See (Zhu 2007, Theorem 1.14, p.11), and note in this reference that the uniform boundedness will apply in the situation whereF ⊆X satisfies (∀φ ∈ Hom(X,K)) sup{x**(φ) = φ(x) :x} < ∞, whereK is the underlying field. The uniform boundedness principle applies since Hom(X,K) with the norm topology will be a Banach space, and the mapsx** : Hom(X,K) →K are continuous homomorphisms with respect to this topology.)
The family of compact operators is a norm-closed, two-sided, *-ideal inL(H). Consequently, a compact operatorT cannot have a bounded inverse ifH is infinite-dimensional. IfST =TS =I, then the identity operator would be compact, a contradiction.
If sequences of bounded operatorsBn →B,Cn →C in thestrong operator topology andT is compact, then converges to in norm.[1] For example, consider the Hilbert space with standard basis {en}. LetPm be the orthogonal projection on the linear span of {e1, ...,em}. The sequence {Pm} converges to the identity operatorI strongly but not uniformly. DefineT byT is compact, and, as claimed above,PmT →IT =T in the uniform operator topology: for allx,
Notice eachPm is a finite-rank operator. Similar reasoning shows that ifT is compact, thenT is the uniform limit of some sequence of finite-rank operators.
By the norm-closedness of the ideal of compact operators, the converse is also true.
The quotient C*-algebra ofL(H) modulo the compact operators is called theCalkin algebra, in which one can consider properties of an operator up to compact perturbation.
A bounded operatorT on a Hilbert spaceH is said to beself-adjoint ifT =T*, or equivalently,
It follows that ⟨Tx,x⟩ is real for everyx ∈H, thus eigenvalues ofT, when they exist, are real. When a closed linear subspaceL ofH is invariant underT, then the restriction ofT toL is a self-adjoint operator onL, and furthermore, theorthogonal complementL⊥ ofL is also invariant underT. For example, the spaceH can be decomposed as the orthogonaldirect sum of twoT–invariant closed linear subspaces: thekernel ofT, and the orthogonal complement(kerT)⊥ of the kernel (which is equal to the closure of the range ofT, for any bounded self-adjoint operator). These basic facts play an important role in the proof of the spectral theorem below.
The classification result for Hermitiann ×n matrices is thespectral theorem: IfM =M*, thenM is unitarily diagonalizable, and the diagonalization ofM has real entries. LetT be a compact self-adjoint operator on a Hilbert spaceH. We will prove the same statement forT: the operatorT can be diagonalized by an orthonormal set of eigenvectors, each of which corresponds to a real eigenvalue.
Theorem For every compact self-adjoint operatorT on a real or complex Hilbert spaceH, there exists anorthonormal basis ofH consisting of eigenvectors ofT. More specifically, the orthogonal complement of the kernel ofT admits either a finite orthonormal basis of eigenvectors ofT, or acountably infinite orthonormal basis {en} of eigenvectors ofT, with corresponding eigenvalues{λn} ⊂R, such thatλn → 0.
In other words, a compact self-adjoint operator can be unitarily diagonalized. This is the spectral theorem.
WhenH isseparable, one can mix the basis {en} with acountable orthonormal basis for the kernel ofT, and obtain an orthonormal basis {fn} forH, consisting of eigenvectors ofT with real eigenvalues {μn} such thatμn → 0.
Corollary For every compact self-adjoint operatorT on a real or complex separable infinite-dimensional Hilbert spaceH, there exists a countably infinite orthonormal basis {fn} ofH consisting of eigenvectors ofT, with corresponding eigenvalues{μn} ⊂R, such thatμn → 0.
Let us discuss first the finite-dimensional proof. Proving the spectral theorem for a Hermitiann ×n matrixT hinges on showing the existence of one eigenvectorx. Once this is done, Hermiticity implies that both the linear span and orthogonal complement ofx (of dimensionn − 1) are invariant subspaces ofT. The desired result is then obtained by induction for.
The existence of an eigenvector can be shown in (at least) two alternative ways:
Note. In the finite-dimensional case, part of the first approach works in much greater generality; any square matrix, not necessarily Hermitian, has an eigenvector. This is simply not true for general operators on Hilbert spaces. In infinite dimensions, it is also not immediate how to generalize the concept of the characteristic polynomial.
The spectral theorem for the compact self-adjoint case can be obtained analogously: one finds an eigenvector by extending the second finite-dimensional argument above, then apply induction. We first sketch the argument for matrices.
Since the closed unit sphereS inR2n is compact, andf is continuous,f(S) is compact on the real line, thereforef attains a maximum onS, at some unit vectory. ByLagrange's multiplier theorem,y satisfiesfor some λ. By Hermiticity,Ty = λy.
Alternatively, letz ∈Cn be any vector. Notice that if a unit vectory maximizes ⟨Tx,x⟩ on the unit sphere (or on the unit ball), it also maximizes theRayleigh quotient:
Consider the function:
By calculus,h′(0) = 0, i.e.,
Define:
After some algebra the above expression becomes (Re denotes the real part of a complex number)
Butz is arbitrary, thereforeTy −my = 0. This is the crux of proof for spectral theorem in the matricial case.
Note that while the Lagrange multipliers generalize to the infinite-dimensional case, the compactness of the unit sphere is lost. This is where the assumption that the operatorT be compact is useful.
Claim IfT is a compact self-adjoint operator on a non-zero Hilbert spaceH andthenm(T) or −m(T) is an eigenvalue ofT.
Ifm(T) = 0, thenT = 0 by thepolarization identity, and this case is clear. Consider the function
ReplacingT by −T if necessary, one may assume that the supremum off on the closed unit ballB ⊂H is equal tom(T) > 0. Iff attains its maximumm(T) onB at some unit vectory, then, by the same argument used for matrices,y is an eigenvector ofT, with corresponding eigenvalueλ = ⟨λy,y⟩ =⟨Ty,y⟩ =f(y) =m(T).
By theBanach–Alaoglu theorem and the reflexivity ofH, the closed unit ballB is weakly compact. Also, the compactness ofT means (see above) thatT :X with the weak topology →X with the norm topology is continuous[disputed –discuss]. These two facts imply thatf is continuous onB equipped with the weak topology, andf attains therefore its maximumm onB at somey ∈B. By maximality, which in turn implies thaty also maximizes the Rayleigh quotientg(x) (see above). This shows thaty is an eigenvector ofT, and ends the proof of the claim.
Note. The compactness ofT is crucial. In general,f need not be continuous for the weak topology on the unit ballB. For example, letT be the identity operator, which is not compact whenH is infinite-dimensional. Take any orthonormal sequence {yn}. Thenyn converges to 0 weakly, but limf(yn) = 1 ≠ 0 =f(0).
LetT be a compact operator on a Hilbert spaceH. A finite (possibly empty) or countably infinite orthonormal sequence {en} of eigenvectors ofT, with corresponding non-zero eigenvalues, is constructed by induction as follows. LetH0 =H andT0 =T. Ifm(T0) = 0, thenT = 0 and the construction stops without producing any eigenvectoren. Suppose that orthonormal eigenvectorse0, ...,en − 1 ofT have been found. ThenEn := span(e0, ...,en − 1) is invariant underT, and by self-adjointness, the orthogonal complementHn ofEn is an invariant subspace ofT. LetTn denote the restriction ofT toHn. Ifm(Tn) = 0, thenTn = 0, and the construction stops. Otherwise, by theclaim applied toTn, there is a norm one eigenvectoren ofT inHn, with corresponding non-zero eigenvalue λn =±m(Tn).
LetF = (span{en})⊥, where {en} is the finite or infinite sequence constructed by the inductive process; by self-adjointness,F is invariant underT. LetS denote the restriction ofT toF. If the process was stopped after finitely many steps, with the last vectorem−1, thenF=Hm andS =Tm = 0 by construction. In the infinite case, compactness ofT and the weak-convergence ofen to 0 imply thatTen =λnen → 0, thereforeλn → 0. SinceF is contained inHn for everyn, it follows thatm(S) ≤m({Tn}) = |λn| for everyn, hencem(S) = 0. This implies again thatS = 0.
The fact thatS = 0 means thatF is contained in the kernel ofT. Conversely, ifx ∈ ker(T) then by self-adjointness,x is orthogonal to every eigenvector {en} with non-zero eigenvalue. It follows thatF = ker(T), and that {en} is an orthonormal basis for the orthogonal complement of the kernel ofT. One can complete the diagonalization ofT by selecting an orthonormal basis of the kernel. This proves the spectral theorem.
A shorter but more abstract proof goes as follows: byZorn's lemma, selectU to be a maximal subset ofH with the following three properties: all elements ofU are eigenvectors ofT, they have norm one, and any two distinct elements ofU are orthogonal. LetF be the orthogonal complement of the linear span ofU. IfF ≠ {0}, it is a non-trivial invariant subspace ofT, and by the initial claim, there must exist a norm one eigenvectory ofT inF. But thenU ∪ {y} contradicts the maximality ofU. It follows thatF = {0}, hence span(U) is dense inH. This shows thatU is an orthonormal basis ofH consisting of eigenvectors ofT.
IfT is compact on an infinite-dimensional Hilbert spaceH, thenT is not invertible, hence σ(T), the spectrum ofT, always contains 0. The spectral theorem shows that σ(T) consists of the eigenvalues {λn} ofT and of 0 (if 0 is not already an eigenvalue). The set σ(T) is a compact subset of the complex numbers, and the eigenvalues are dense in σ(T).
Any spectral theorem can be reformulated in terms of afunctional calculus. In the present context, we have:
Theorem. LetC(σ(T)) denote theC*-algebra of continuous functions on σ(T). There exists a unique isometric homomorphismΦ :C(σ(T)) →L(H) such that Φ(1) =I and, iff is the identity functionf(λ) =λ, thenΦ(f) =T. Now we may define (clearly this would hold when is polynomial). Then it also holds, thatσ(g(T)) =g(σ(T)).
The functional calculus map Φ is defined in a natural way: let {en} be an orthonormal basis of eigenvectors forH, with corresponding eigenvalues {λn}; forf ∈C(σ(T)), the operator Φ(f), diagonal with respect to the orthonormal basis {en}, is defined by settingfor everyn. Since Φ(f) is diagonal with respect to an orthonormal basis, its norm is equal to the supremum of the modulus of diagonal coefficients,
The other properties of Φ can be readily verified. Conversely, any homomorphism Ψ satisfying the requirements of the theorem must coincide with Φ whenf is a polynomial. By theWeierstrass approximation theorem, polynomial functions are dense inC(σ(T)), and it follows thatΨ = Φ. This shows that Φ is unique.
The more generalcontinuous functional calculus can be defined for any self-adjoint (or even normal, in the complex case) bounded linear operator on a Hilbert space. The compact case described here is a particularly simple instance of this functional calculus.
Consider an Hilbert spaceH (e.g. the finite-dimensionalCn), and a commuting set of self-adjoint operators. Then under suitable conditions, it can be simultaneously (unitarily) diagonalized.Viz., there exists an orthonormal basisQ consisting of common eigenvectors for the operators — i.e.,
Lemma—Suppose all the operators in are compact. Then every closed non-zero-invariant sub-space has a common eigenvector for.
Case I: all the operators have each exactly one eigenvalue on. Take any of unit length. It is a common eigenvector.
Case II: there is some operator with at least 2 eigenvalues on and let. SinceT is compact and α is non-zero, we have is a finite-dimensional (and therefore closed) non-zero-invariant sub-space (because the operators all commute withT, we have for and, that). In particular, since α is just one of the eigenvalues of on, we definitely have. Thus we could in principle argue by induction over dimension, yielding that has a common eigenvector for.
Theorem 1—If all the operators in are compact then the operators can be simultaneously (unitarily) diagonalized.
The following setis partially ordered by inclusion. This clearly has the Zorn property. So takingQ a maximal member, ifQ is a basis for the whole Hilbert spaceH, we are done. If this were not the case, then letting, it is easy to see that this would be an-invariant non-trivial closed subspace; and thus by the lemma above, therein would lie a common eigenvector for the operators (necessarily orthogonal toQ). But then there would then be a proper extension ofQ withinP; a contradiction to its maximality.
Theorem 2—If there is an injective compact operator in; then the operators can be simultaneously (unitarily) diagonalized.
Fix compact injective. Then we have, by the spectral theory of compact symmetric operators on Hilbert spaces:where is a discrete, countable subset of positive real numbers, and all the eigenspaces are finite-dimensional. Since a commuting set, we have all the eigenspaces are invariant. Since the operators restricted to the eigenspaces (which are finite-dimensional) are automatically all compact, we can apply Theorem 1 to each of these, and find orthonormal basesQσ for the. SinceT0 is symmetric, we have thatis a (countable) orthonormal set. It is also, by the decomposition we first stated, a basis forH.
Theorem 3—IfH a finite-dimensional Hilbert space, and a commutative set of operators, each of which is diagonalisable; then the operators can be simultaneously diagonalized.
Case I: all operators have exactly one eigenvalue. Then any basis forH will do.
Case II: Fix an operator with at least two eigenvalues, and let so that is a symmetric operator. Now let α be an eigenvalue of. Then it is easy to see that both:are non-trivial-invariant subspaces. By induction over dimension we have that there are linearly independent basesQ1,Q2 for the subspaces, which demonstrate that the operators in can be simultaneously diagonalisable on the subspaces. Clearly then demonstrates that the operators in can be simultaneously diagonalised.
Notice we did not have to directly use the machinery of matrices at all in this proof. There are other versions which do.
We can strengthen the above to the case where all the operators merely commute with their adjoint; in this case we remove the term "orthogonal" from the diagonalisation. There are weaker results for operators arising from representations due to Weyl–Peter. LetG be a fixed locally compact hausdorff group, and (the space of square integrable measurable functions with respect to the unique-up-to-scale Haar measure onG). Consider the continuous shift action:
Then ifG were compact then there is a unique decomposition ofH into a countable direct sum of finite-dimensional, irreducible, invariant subspaces (this is essentially diagonalisation of the family of operators). IfG were not compact, but were abelian, then diagonalisation is not achieved, but we get a uniquecontinuous decomposition ofH into 1-dimensional invariant subspaces.
The family of Hermitian matrices is a proper subset of matrices that are unitarily diagonalizable. A matrixM is unitarily diagonalizable if and only if it is normal, i.e.,M*M =MM*. Similar statements hold for compact normal operators.
LetT be compact andT*T =TT*. Apply theCartesian decomposition toT: define
The self-adjoint compact operatorsR andJ are called the real and imaginary parts ofT, respectively. ThatT is compact implies thatT* and, consequently,R andJ are compact. Furthermore, the normality ofT implies thatR andJ commute. Therefore they can be simultaneously diagonalized, from which follows the claim.
Ahyponormal compact operator (in particular, asubnormal operator) is normal.
The spectrum of aunitary operatorU lies on the unit circle in the complex plane; it could be the entire unit circle. However, ifU is identity plus a compact perturbation,U has only a countable spectrum, containing 1 and possibly, a finite set or a sequence tending to 1 on the unit circle. More precisely, supposeU =I +C whereC is compact. The equationsUU* =U*U =I andC =U −I show thatC is normal. The spectrum ofC contains 0, and possibly, a finite set or a sequence tending to 0. SinceU =I +C, the spectrum ofU is obtained by shifting the spectrum ofC by 1.