Movatterモバイル変換

Riesz representation theorem

From Wikipedia, the free encyclopedia

Theorem about the dual of a Hilbert space

This article is about a theorem concerning the dual of a Hilbert space. For the theorems relating linear functionals to measures, seeRiesz–Markov–Kakutani representation theorem. For other theorems, seeRiesz theorem.

TheRiesz representation theorem, sometimes called theRiesz–Fréchet representation theorem afterFrigyes Riesz andMaurice René Fréchet, establishes an important connection between aHilbert space and itscontinuous dual space. If the underlyingfield is thereal numbers, the two areisometrically isomorphic; if the underlying field is thecomplex numbers, the two are isometricallyanti-isomorphic. The (anti-)isomorphism is a particularnatural isomorphism.

Preliminaries and notation

[edit]

Let $H {\displaystyle H}$ be aHilbert space over a field $\mathbb {F} ,$ where $\mathbb {F}$ is either the real numbers $\mathbb {R}$ or the complex numbers $\mathbb {C} .$ If $\mathbb {F} =\mathbb {C}$ (resp. if $\mathbb {F} =\mathbb {R}$ ) then $H {\displaystyle H}$ is called acomplex Hilbert space (resp. areal Hilbert space). Every real Hilbert space can be extended to be adense subset of a unique (up tobijective isometry) complex Hilbert space, called itscomplexification, which is why Hilbert spaces are often automatically assumed to be complex. Real and complex Hilbert spaces have in common many, but by no means all, properties and results/theorems.

This article is intended for bothmathematicians andphysicists and will describe the theorem for both. In both mathematics and physics, if a Hilbert space is assumed to be real (that is, if $\mathbb {F} =\mathbb {R}$ ) then this will usually be made clear. Often in mathematics, and especially in physics, unless indicated otherwise, "Hilbert space" is usually automatically assumed to mean "complex Hilbert space." Depending on the author, in mathematics, "Hilbert space" usually means either (1) a complex Hilbert space, or (2) a realor complex Hilbert space.

Linear and antilinear maps

[edit]

By definition, anantilinear map (also called aconjugate-linear map) $f:H\to Y$ is a map betweenvector spaces that isadditive: $f(x+y)=f(x)+f(y)\quad {\text{ for all }}x,y\in H,$ andantilinear (also calledconjugate-linear orconjugate-homogeneous): $f(cx)={\overline {c}}f(x)\quad {\text{ for all }}x\in H{\text{ and all scalar }}c\in \mathbb {F} ,$ where ${\overline {c}}$ is the conjugate of the complex number $c=a+bi$ , given by ${\overline {c}}=a-bi$ .

In contrast, a map $f:H\to Y$ islinear if it is additive andhomogeneous: $f(cx)=cf(x)\quad {\text{ for all }}x\in H\quad {\text{ and all scalars }}c\in \mathbb {F} .$

Every constant $0 {\displaystyle 0}$ map is always both linear and antilinear. If $\mathbb {F} =\mathbb {R}$ then the definitions of linear maps and antilinear maps are completely identical. A linear map from a Hilbert space into aBanach space (or more generally, from any Banach space into anytopological vector space) iscontinuous if and only if it isbounded; the same is true of antilinear maps. Theinverse of any antilinear (resp. linear) bijection is again an antilinear (resp. linear) bijection. The composition of twoantilinear maps is alinear map.

Continuous dual and anti-dual spaces

Afunctional on $H {\displaystyle H}$ is a function $H\to \mathbb {F}$ whosecodomain is the underlying scalar field $\mathbb {F} .$ Denote by $H^{*}$ (resp. by ${\overline {H}}^{*})$ the set of all continuous linear (resp. continuous antilinear) functionals on $H, {\displaystyle H,}$ which is called the(continuous) dual space (resp. the(continuous) anti-dual space) of $H . {\displaystyle H.}$ ^[1] If $\mathbb {F} =\mathbb {R}$ then linear functionals on $H {\displaystyle H}$ are the same as antilinear functionals and consequently, the same is true for such continuous maps: that is, $H^{*}={\overline {H}}^{*}.$

One-to-one correspondence between linear and antilinear functionals

Given any functional $f~:~H\to \mathbb {F} ,$ theconjugate of $f {\displaystyle f}$ is the functional ${\begin{alignedat}{4}{\overline {f}}:\,&H&&\to \,&&\mathbb {F} \\&h&&\mapsto \,&&{\overline {f(h)}}.\\\end{alignedat}}$

This assignment is most useful when $\mathbb {F} =\mathbb {C}$ because if $\mathbb {F} =\mathbb {R}$ then $f={\overline {f}}$ and the assignment $f\mapsto {\overline {f}}$ reduces down to theidentity map.

The assignment $f\mapsto {\overline {f}}$ defines an antilinearbijective correspondence from the set of

all functionals (resp. all linear functionals, all continuous linear functionals

H^{*}

) on

H, {\displaystyle H,}

onto the set of

all functionals (resp. allantilinear functionals, all continuousantilinear functionals

{\overline {H}}^{*}

) on

H . {\displaystyle H.}

Mathematics vs. physics notations and definitions of inner product

[edit]

TheHilbert space $H {\displaystyle H}$ has an associatedinner product $H\times H\to \mathbb {F}$ valued in $H {\displaystyle H}$ 's underlying scalar field $\mathbb {F}$ that is linear in one coordinate and antilinear in the other (as specified below).If $H {\displaystyle H}$ is a complex Hilbert space ( $\mathbb {F} =\mathbb {C}$ ), then there is a crucial difference between the notations prevailing in mathematics versus physics, regarding which of the two variables is linear.However, for real Hilbert spaces ( $\mathbb {F} =\mathbb {R}$ ), the inner product is asymmetric map that is linear in each coordinate (bilinear), so there can be no such confusion.

Inmathematics, the inner product on a Hilbert space $H {\displaystyle H}$ is often denoted by $\left\langle \cdot \,,\cdot \right\rangle$ or $\left\langle \cdot \,,\cdot \right\rangle _{H}$ while inphysics, thebra–ket notation $\left\langle \cdot \mid \cdot \right\rangle$ or $\left\langle \cdot \mid \cdot \right\rangle _{H}$ is typically used. In this article, these two notations will be related by the equality:

$\left\langle x,y\right\rangle :=\left\langle y\mid x\right\rangle \quad {\text{ for all }}x,y\in H.$ These have the following properties:

The map $\left\langle \cdot \,,\cdot \right\rangle$ islinear in its first coordinate; equivalently, the map $\left\langle \cdot \mid \cdot \right\rangle$ islinear in its second coordinate. That is, for fixed $y\in H,$ the map $\left\langle \,y\mid \cdot \,\right\rangle =\left\langle \,\cdot \,,y\,\right\rangle :H\to \mathbb {F}$ with ${\textstyle h\mapsto \left\langle \,y\mid h\,\right\rangle =\left\langle \,h,y\,\right\rangle }$ is a linear functional on $H . {\displaystyle H.}$ This linear functional is continuous, so $\left\langle \,y\mid \cdot \,\right\rangle =\left\langle \,\cdot ,y\,\right\rangle \in H^{*}.$
The map $\left\langle \cdot \,,\cdot \right\rangle$ isantilinear in itssecond coordinate; equivalently, the map $\left\langle \cdot \mid \cdot \right\rangle$ isantilinear in itsfirst coordinate. That is, for fixed $y\in H,$ the map $\left\langle \,\cdot \mid y\,\right\rangle =\left\langle \,y,\cdot \,\right\rangle :H\to \mathbb {F}$ with ${\textstyle h\mapsto \left\langle \,h\mid y\,\right\rangle =\left\langle \,y,h\,\right\rangle }$ is an antilinear functional on $H . {\displaystyle H.}$ This antilinear functional is continuous, so $\left\langle \,\cdot \mid y\,\right\rangle =\left\langle \,y,\cdot \,\right\rangle \in {\overline {H}}^{*}.$

In computations, one must consistently use either the mathematics notation $\left\langle \cdot \,,\cdot \right\rangle$ , which is (linear, antilinear); or the physics notation $\left\langle \cdot \mid \cdot \right\rangle$ , which is (antilinear | linear).

Canonical norm and inner product on the dual space and anti-dual space

[edit]

If $x=y$ then $\langle \,x\mid x\,\rangle =\langle \,x,x\,\rangle$ is a non-negative real number and the map $\|x\|:={\sqrt {\langle x,x\rangle }}={\sqrt {\langle x\mid x\rangle }}$

defines acanonical norm on $H {\displaystyle H}$ that makes $H {\displaystyle H}$ into anormed space.^[1] As with all normed spaces, the (continuous) dual space $H^{*}$ carries a canonical norm, called thedual norm, that is defined by^[1] $\|f\|_{H^{*}}~:=~\sup _{\|x\|\leq 1,x\in H}|f(x)|\quad {\text{ for every }}f\in H^{*}.$

The canonical norm on the (continuous)anti-dual space ${\overline {H}}^{*},$ denoted by $\|f\|_{{\overline {H}}^{*}},$ is defined by using this same equation:^[1] $\|f\|_{{\overline {H}}^{*}}~:=~\sup _{\|x\|\leq 1,x\in H}|f(x)|\quad {\text{ for every }}f\in {\overline {H}}^{*}.$

This canonical norm on $H^{*}$ satisfies theparallelogram law, which means that thepolarization identity can be used to define acanonical inner product on $H^{*},$ which this article will denote by the notations $\left\langle f,g\right\rangle _{H^{*}}:=\left\langle g\mid f\right\rangle _{H^{*}},$ where this inner product turns $H^{*}$ into a Hilbert space. There are now two ways of defining a norm on $H^{*}:$ the norm induced by this inner product (that is, the norm defined by $f\mapsto {\sqrt {\left\langle f,f\right\rangle _{H^{*}}}}$ ) and the usualdual norm (defined as the supremum over the closedunit ball). These norms are the same; explicitly, this means that the following holds for every $f\in H^{*}:$ $\sup _{\|x\|\leq 1,x\in H}|f(x)|=\|f\|_{H^{*}}~=~{\sqrt {\langle f,f\rangle _{H^{*}}}}~=~{\sqrt {\langle f\mid f\rangle _{H^{*}}}}.$

As will be described later, the Riesz representation theorem can be used to give an equivalent definition of the canonical norm and the canonical inner product on $H^{*}.$

The same equations that were used above can also be used to define a norm and inner product on $H {\displaystyle H}$ 'santi-dual space ${\overline {H}}^{*}.$ ^[1]

Canonical isometry between the dual and antidual

Thecomplex conjugate ${\overline {f}}$ of a functional $f, {\displaystyle f,}$ which was defined above, satisfies $\|f\|_{H^{*}}~=~\left\|{\overline {f}}\right\|_{{\overline {H}}^{*}}\quad {\text{ and }}\quad \left\|{\overline {g}}\right\|_{H^{*}}~=~\|g\|_{{\overline {H}}^{*}}$ for every $f\in H^{*}$ and every $g\in {\overline {H}}^{*}.$ This says exactly that the canonical antilinearbijection defined by ${\begin{alignedat}{4}\operatorname {Cong} :\;&&H^{*}&&\;\to \;&{\overline {H}}^{*}\\[0.3ex]&&f&&\;\mapsto \;&{\overline {f}}\\\end{alignedat}}$ as well as its inverse $\operatorname {Cong} ^{-1}~:~{\overline {H}}^{*}\to H^{*}$ are antilinearisometries and consequently alsohomeomorphisms. The inner products on the dual space $H^{*}$ and the anti-dual space ${\overline {H}}^{*},$ denoted respectively by $\langle \,\cdot \,,\,\cdot \,\rangle _{H^{*}}$ and $\langle \,\cdot \,,\,\cdot \,\rangle _{{\overline {H}}^{*}},$ are related by $\langle \,{\overline {f}}\,|\,{\overline {g}}\,\rangle _{{\overline {H}}^{*}}={\overline {\langle \,f\,|\,g\,\rangle _{H^{*}}}}=\langle \,g\,|\,f\,\rangle _{H^{*}}\qquad {\text{ for all }}f,g\in H^{*}$ and $\langle \,{\overline {f}}\,|\,{\overline {g}}\,\rangle _{H^{*}}={\overline {\langle \,f\,|\,g\,\rangle _{{\overline {H}}^{*}}}}=\langle \,g\,|\,f\,\rangle _{{\overline {H}}^{*}}\qquad {\text{ for all }}f,g\in {\overline {H}}^{*}.$

If $\mathbb {F} =\mathbb {R}$ then $H^{*}={\overline {H}}^{*}$ and this canonical map $\operatorname {Cong} :H^{*}\to {\overline {H}}^{*}$ reduces down to the identity map.

Riesz representation theorem

[edit]

Two vectors $x {\displaystyle x}$ and $y {\displaystyle y}$ areorthogonal if $\langle x,y\rangle =0,$ which happens if and only if $\|y\|\leq \|y+sx\|$ for all scalars $s . {\displaystyle s.}$ ^[2] Theorthogonal complement of a subset $X\subseteq H$ is $X^{\bot }:=\{\,y\in H:\langle y,x\rangle =0{\text{ for all }}x\in X\,\},$ which is always aclosed vector subspace of $H . {\displaystyle H.}$ TheHilbert projection theorem guarantees that for anynonempty closedconvex subset $C {\displaystyle C}$ of aHilbert space there exists a unique vector $m\in C$ such that $\|m\|=\inf _{c\in C}\|c\|;$ that is, $m\in C$ is the (unique)global minimum point of the function $C\to [0,\infty )$ defined by $c\mapsto \|c\|.$

Statement

[edit]

Riesz representation theorem—Let $H {\displaystyle H}$ be aHilbert space whoseinner product $\left\langle x,y\right\rangle$ is linear in itsfirst argument andantilinear in its second argument and let $\langle y\mid x\rangle :=\langle x,y\rangle$ be the corresponding physics notation. For every continuous linear functional $\varphi \in H^{*},$ there exists a unique vector $f_{\varphi }\in H,$ called theRiesz representation of $\varphi ,$ such that^[3] $\varphi (x)=\left\langle x,f_{\varphi }\right\rangle =\left\langle f_{\varphi }\mid x\right\rangle \quad {\text{ for all }}x\in H.$

Importantly forcomplex Hilbert spaces, $f_{\varphi }$ is always located in theantilinear coordinate of the inner product.^{[note 1]}

Furthermore, the length of the representation vector is equal to the norm of the functional: $\left\|f_{\varphi }\right\|_{H}=\|\varphi \|_{H^{*}},$ and $f_{\varphi }$ is the unique vector $f_{\varphi }\in \left(\ker \varphi \right)^{\bot }$ with $\varphi \left(f_{\varphi }\right)=\|\varphi \|^{2}.$ It is also the unique element of minimum norm in $C:=\varphi ^{-1}\left(\|\varphi \|^{2}\right)$ ; that is to say, $f_{\varphi }$ is the unique element of $C {\displaystyle C}$ satisfying $\left\|f_{\varphi }\right\|=\inf _{c\in C}\|c\|.$ Moreover, any non-zero $q\in (\ker \varphi )^{\bot }$ can be written as $q=\left(\|q\|^{2}/\,{\overline {\varphi (q)}}\right)\ f_{\varphi }.$

Corollary—Thecanonical map from $H {\displaystyle H}$ into its dual $H^{*}$ ^[1] is theinjective antilinear operator isometry^{[note 2]}^[1] ${\begin{alignedat}{4}\Phi :\;&&H&&\;\to \;&H^{*}\\[0.3ex]&&y&&\;\mapsto \;&\langle \,\cdot \,,y\rangle =\langle y|\,\cdot \,\rangle \\\end{alignedat}}$ The Riesz representation theorem states that this map issurjective (and thusbijective) when $H {\displaystyle H}$ is complete and that its inverse is thebijective isometric antilinear isomorphism ${\begin{alignedat}{4}\Phi ^{-1}:\;&&H^{*}&&\;\to \;&H\\[0.3ex]&&\varphi &&\;\mapsto \;&f_{\varphi }\\\end{alignedat}}.$ Consequently,every continuous linear functional on the Hilbert space $H {\displaystyle H}$ can be written uniquely in the form $\langle y\,|\,\cdot \,\rangle$ ^[1] where $\|\langle y\,|\cdot \rangle \|_{H^{*}}=\|y\|_{H}$ for every $y\in H.$ The assignment $y\mapsto \langle y,\cdot \rangle =\langle \cdot \,|\,y\rangle$ can also be viewed as a bijectivelinear isometry $H\to {\overline {H}}^{*}$ into theanti-dual space of $H, {\displaystyle H,}$ ^[1] which is thecomplex conjugate vector space of thecontinuous dual space $H^{*}.$

The inner products on $H {\displaystyle H}$ and $H^{*}$ are related by $\left\langle \Phi h,\Phi k\right\rangle _{H^{*}}={\overline {\langle h,k\rangle }}_{H}=\langle k,h\rangle _{H}\quad {\text{ for all }}h,k\in H$ and similarly, $\left\langle \Phi ^{-1}\varphi ,\Phi ^{-1}\psi \right\rangle _{H}={\overline {\langle \varphi ,\psi \rangle }}_{H^{*}}=\left\langle \psi ,\varphi \right\rangle _{H^{*}}\quad {\text{ for all }}\varphi ,\psi \in H^{*}.$

The set $C:=\varphi ^{-1}\left(\|\varphi \|^{2}\right)$ satisfies $C=f_{\varphi }+\ker \varphi$ and $C-f_{\varphi }=\ker \varphi$ so when $f_{\varphi }\neq 0$ then $C {\displaystyle C}$ can be interpreted as being theaffine hyperplane^{[note 3]} that is parallel to the vector subspace $\ker \varphi$ and contains $f_{\varphi }.$

For $y\in H,$ the physics notation for the functional $\Phi (y)\in H^{*}$ is the bra $\langle y|,$ where explicitly this means that $\langle y|:=\Phi (y),$ which complements the ket notation $|y\rangle$ defined by $|y\rangle :=y.$ In the mathematical treatment ofquantum mechanics, the theorem can be seen as a justification for the popularbra–ket notation. The theorem says that, every bra $\langle \psi \,|$ has a corresponding ket $|\,\psi \rangle ,$ and the latter is unique.

Historically, the theorem is often attributed simultaneously toRiesz andFréchet in 1907 (see references).

Proof^[4]

Let $\mathbb {F}$ denote the underlying scalar field of $H . {\displaystyle H.}$

Proof of norm formula:

Fix $y\in H.$ Define $\Lambda :H\to \mathbb {F}$ by $\Lambda (z):=\langle \,y\,|\,z\,\rangle ,$ which is a linear functional on $H {\displaystyle H}$ since $z {\displaystyle z}$ is in the linear argument. By theCauchy–Schwarz inequality, $|\Lambda (z)|=|\langle \,y\,|\,z\,\rangle |\leq \|y\|\|z\|$ which shows that $\Lambda$ is bounded (equivalently,continuous) and that $\|\Lambda \|\leq \|y\|.$ It remains to show that $\|y\|\leq \|\Lambda \|.$ By using $y {\displaystyle y}$ in place of $z, {\displaystyle z,}$ it follows that $\|y\|^{2}=\langle \,y\,|\,y\,\rangle =\Lambda (y)=|\Lambda (y)|\leq \|\Lambda \|\|y\|$ (the equality $\Lambda (y)=|\Lambda (y)|$ holds because $\Lambda (y)=\|y\|^{2}\geq 0$ is real and non-negative). Thus that $\|\Lambda \|=\|y\|.$ $\blacksquare$

The proof above did not use the fact that $H {\displaystyle H}$ iscomplete, which shows that the formula for the norm $\|\langle \,y\,|\,\cdot \,\rangle \|_{H^{*}}=\|y\|_{H}$ holds more generally for allinner product spaces.

Proof that a Riesz representation of $\varphi$ is unique:

Suppose $f,g\in H$ are such that $\varphi (z)=\langle \,f\,|\,z\,\rangle$ and $\varphi (z)=\langle \,g\,|\,z\,\rangle$ for all $z\in H.$ Then $\langle \,f-g\,|\,z\,\rangle =\langle \,f\,|\,z\,\rangle -\langle \,g\,|\,z\,\rangle =\varphi (z)-\varphi (z)=0\quad {\text{ for all }}z\in H$ which shows that $\Lambda :=\langle \,f-g\,|\,\cdot \,\rangle$ is the constant $0 {\displaystyle 0}$ linear functional. Consequently $0=\|\langle \,f-g\,|\,\cdot \,\rangle \|=\|f-g\|,$ which implies that $f-g=0.$ $\blacksquare$

Proof that a vector $f_{\varphi }$ representing $\varphi$ exists:

Let $K:=\ker \varphi :=\{m\in H:\varphi (m)=0\}.$ If $K=H$ (or equivalently, if $\varphi =0$ ) then taking $f_{\varphi }:=0$ completes the proof so assume that $K\neq H$ and $\varphi \neq 0.$ The continuity of $\varphi$ implies that $K {\displaystyle K}$ is a closed subspace of $H {\displaystyle H}$ (because $K=\varphi ^{-1}(\{0\})$ and $\{0\}$ is a closed subset of $\mathbb {F}$ ). Let $K^{\bot }:=\{v\in H~:~\langle \,v\,|\,k\,\rangle =0~{\text{ for all }}k\in K\}$ denote theorthogonal complement of $K {\displaystyle K}$ in $H . {\displaystyle H.}$ Because $K {\displaystyle K}$ is closed and $H {\displaystyle H}$ is a Hilbert space,^{[note 4]} $H {\displaystyle H}$ can be written as the direct sum $H=K\oplus K^{\bot }$ ^{[note 5]} (a proof of this is given in the article on theHilbert projection theorem). Because $K\neq H,$ there exists some non-zero $p\in K^{\bot }.$ For any $h\in H,$ $\varphi [(\varphi h)p-(\varphi p)h]~=~\varphi [(\varphi h)p]-\varphi [(\varphi p)h]~=~(\varphi h)\varphi p-(\varphi p)\varphi h=0,$ which shows that $(\varphi h)p-(\varphi p)h~\in ~\ker \varphi =K,$ where now $p\in K^{\bot }$ implies $0=\langle \,p\,|\,(\varphi h)p-(\varphi p)h\,\rangle ~=~\langle \,p\,|\,(\varphi h)p\,\rangle -\langle \,p\,|\,(\varphi p)h\,\rangle ~=~(\varphi h)\langle \,p\,|\,p\,\rangle -(\varphi p)\langle \,p\,|\,h\,\rangle .$ Solving for $\varphi h$ shows that $\varphi h={\frac {(\varphi p)\langle \,p\,|\,h\,\rangle }{\|p\|^{2}}}=\left\langle \,{\frac {\overline {\varphi p}}{\|p\|^{2}}}p\,{\Bigg |}\,h\,\right\rangle \quad {\text{ for every }}h\in H,$ which proves that the vector $f_{\varphi }:={\frac {\overline {\varphi p}}{\|p\|^{2}}}p$ satisfies $\varphi h=\langle \,f_{\varphi }\,|\,h\,\rangle {\text{ for every }}h\in H.$

Applying the norm formula that was proved above with $y:=f_{\varphi }$ shows that $\|\varphi \|_{H^{*}}=\left\|\left\langle \,f_{\varphi }\,|\,\cdot \,\right\rangle \right\|_{H^{*}}=\left\|f_{\varphi }\right\|_{H}.$ Also, the vector $u:={\frac {p}{\|p\|}}$ has norm $\|u\|=1$ and satisfies $f_{\varphi }:={\overline {\varphi (u)}}u.$ $\blacksquare$

It can now be deduced that $K^{\bot }$ is $1 {\displaystyle 1}$ -dimensional when $\varphi \neq 0.$ Let $q\in K^{\bot }$ be any non-zero vector. Replacing $p {\displaystyle p}$ with $q {\displaystyle q}$ in the proof above shows that the vector $g:={\frac {\overline {\varphi q}}{\|q\|^{2}}}q$ satisfies $\varphi (h)=\langle \,g\,|\,h\,\rangle$ for every $h\in H.$ The uniqueness of the (non-zero) vector $f_{\varphi }$ representing $\varphi$ implies that $f_{\varphi }=g,$ which in turn implies that ${\overline {\varphi q}}\neq 0$ and $q={\frac {\|q\|^{2}}{\overline {\varphi q}}}f_{\varphi }.$ Thus every vector in $K^{\bot }$ is a scalar multiple of $f_{\varphi }.$ $\blacksquare$

The formulas for the inner products follow from thepolarization identity.

Observations

[edit]

If $\varphi \in H^{*}$ then $\varphi \left(f_{\varphi }\right)=\left\langle f_{\varphi },f_{\varphi }\right\rangle =\left\|f_{\varphi }\right\|^{2}=\|\varphi \|^{2}.$ So in particular, $\varphi \left(f_{\varphi }\right)\geq 0$ is always real and furthermore, $\varphi \left(f_{\varphi }\right)=0$ if and only if $f_{\varphi }=0$ if and only if $\varphi =0.$

Linear functionals as affine hyperplanes

A non-trivial continuous linear functional $\varphi$ is often interpreted geometrically by identifying it with the affine hyperplane $A:=\varphi ^{-1}(1)$ (the kernel $\ker \varphi =\varphi ^{-1}(0)$ is also often visualized alongside $A:=\varphi ^{-1}(1)$ although knowing $A {\displaystyle A}$ is enough to reconstruct $\ker \varphi$ because if $A=\varnothing$ then $\ker \varphi =H$ and otherwise $\ker \varphi =A-A$ ). In particular, the norm of $\varphi$ should somehow be interpretable as the "norm of the hyperplane $A {\displaystyle A}$ ". When $\varphi \neq 0$ then the Riesz representation theorem provides such an interpretation of $\|\varphi \|$ in terms of the affine hyperplane^{[note 3]} $A:=\varphi ^{-1}(1)$ as follows: using the notation from the theorem's statement, from $\|\varphi \|^{2}\neq 0$ it follows that $C:=\varphi ^{-1}\left(\|\varphi \|^{2}\right)=\|\varphi \|^{2}\varphi ^{-1}(1)=\|\varphi \|^{2}A$ and so $\|\varphi \|=\left\|f_{\varphi }\right\|=\inf _{c\in C}\|c\|$ implies $\|\varphi \|=\inf _{a\in A}\|\varphi \|^{2}\|a\|$ and thus $\|\varphi \|={\frac {1}{\inf _{a\in A}\|a\|}}.$ This can also be seen by applying theHilbert projection theorem to $A {\displaystyle A}$ and concluding that the global minimum point of the map $A\to [0,\infty )$ defined by $a\mapsto \|a\|$ is ${\frac {f_{\varphi }}{\|\varphi \|^{2}}}\in A.$ The formulas ${\frac {1}{\inf _{a\in A}\|a\|}}=\sup _{a\in A}{\frac {1}{\|a\|}}$ provide the promised interpretation of the linear functional's norm $\|\varphi \|$ entirely in terms of its associated affine hyperplane $A=\varphi ^{-1}(1)$ (because with this formula, knowing only theset $A {\displaystyle A}$ is enough to describe the norm of its associated linearfunctional). Defining ${\frac {1}{\infty }}:=0,$ theinfimum formula $\|\varphi \|={\frac {1}{\inf _{a\in \varphi ^{-1}(1)}\|a\|}}$ will also hold when $\varphi =0.$ When the supremum is taken in $\mathbb {R}$ (as is typically assumed), then the supremum of the empty set is $\sup \varnothing =-\infty$ but if the supremum is taken in the non-negative reals $[0,\infty )$ (which is theimage/range of the norm $\|\,\cdot \,\|$ when $\dim H>0$ ) then this supremum is instead $\sup \varnothing =0,$ in which case the supremum formula $\|\varphi \|=\sup _{a\in \varphi ^{-1}(1)}{\frac {1}{\|a\|}}$ will also hold when $\varphi =0$ (although the atypical equality $\sup \varnothing =0$ is usually unexpected and so risks causing confusion).

Constructions of the representing vector

[edit]

Using the notation from the theorem above, several ways of constructing $f_{\varphi }$ from $\varphi \in H^{*}$ are now described. If $\varphi =0$ then $f_{\varphi }:=0$ ; in other words, $f_{0}=0.$

This special case of $\varphi =0$ is henceforth assumed to be known, which is why some of the constructions given below start by assuming $\varphi \neq 0.$

Orthogonal complement of kernel

If $\varphi \neq 0$ then for any $0\neq u\in (\ker \varphi )^{\bot },$ $f_{\varphi }:={\frac {{\overline {\varphi (u)}}u}{\|u\|^{2}}}.$

If $u\in (\ker \varphi )^{\bot }$ is aunit vector (meaning $\|u\|=1$ ) then $f_{\varphi }:={\overline {\varphi (u)}}u$ (this is true even if $\varphi =0$ because in this case $f_{\varphi }={\overline {\varphi (u)}}u={\overline {0}}u=0$ ). If $u {\displaystyle u}$ is a unit vector satisfying the above condition then the same is true of $-u,$ which is also a unit vector in $(\ker \varphi )^{\bot }.$ However, ${\overline {\varphi (-u)}}(-u)={\overline {\varphi (u)}}u=f_{\varphi }$ so both these vectors result in the same $f_{\varphi }.$

Orthogonal projection onto kernel

If $x\in H$ is such that $\varphi (x)\neq 0$ and if $x_{K}$ is theorthogonal projection of $x {\displaystyle x}$ onto $\ker \varphi$ then^{[proof 1]} $f_{\varphi }={\frac {\|\varphi \|^{2}}{\varphi (x)}}\left(x-x_{K}\right).$

Orthonormal basis

Given anorthonormal basis $\left\{e_{i}\right\}_{i\in I}$ of $H {\displaystyle H}$ and a continuous linear functional $\varphi \in H^{*},$ the vector $f_{\varphi }\in H$ can be constructed uniquely by $f_{\varphi }=\sum _{i\in I}{\overline {\varphi \left(e_{i}\right)}}e_{i}$ where all but at most countably many $\varphi \left(e_{i}\right)$ will be equal to $0 {\displaystyle 0}$ and where the value of $f_{\varphi }$ does not actually depend on choice of orthonormal basis (that is, using any other orthonormal basis for $H {\displaystyle H}$ will result in the same vector). If $y\in H$ is written as $y=\sum _{i\in I}a_{i}e_{i}$ then $\varphi (y)=\sum _{i\in I}\varphi \left(e_{i}\right)a_{i}=\langle f_{\varphi }|y\rangle$ and $\left\|f_{\varphi }\right\|^{2}=\varphi \left(f_{\varphi }\right)=\sum _{i\in I}\varphi \left(e_{i}\right){\overline {\varphi \left(e_{i}\right)}}=\sum _{i\in I}\left|\varphi \left(e_{i}\right)\right|^{2}=\|\varphi \|^{2}.$

If the orthonormal basis $\left\{e_{i}\right\}_{i\in I}=\left\{e_{i}\right\}_{i=1}^{\infty }$ is a sequence then this becomes $f_{\varphi }={\overline {\varphi \left(e_{1}\right)}}e_{1}+{\overline {\varphi \left(e_{2}\right)}}e_{2}+\cdots$ and if $y\in H$ is written as $y=\sum _{i\in I}a_{i}e_{i}=a_{1}e_{1}+a_{2}e_{2}+\cdots$ then $\varphi (y)=\varphi \left(e_{1}\right)a_{1}+\varphi \left(e_{2}\right)a_{2}+\cdots =\langle f_{\varphi }|y\rangle .$

Induced linear map into anti-dual

The map defined by placing $y {\displaystyle y}$ into thelinear coordinate of the inner product and letting the variable $h\in H$ vary over theantilinear coordinate results in anantilinear functional: $\langle \,\cdot \mid y\,\rangle =\langle \,y,\cdot \,\rangle :H\to \mathbb {F} \quad {\text{ defined by }}\quad h\mapsto \langle \,h\mid y\,\rangle =\langle \,y,h\,\rangle .$

This map is an element of ${\overline {H}}^{*},$ which is the continuousanti-dual space of $H . {\displaystyle H.}$ Thecanonical map from $H {\displaystyle H}$ into its anti-dual ${\overline {H}}^{*}$ ^[1] is thelinear operator ${\begin{alignedat}{4}\operatorname {In} _{H}^{{\overline {H}}^{*}}:\;&&H&&\;\to \;&{\overline {H}}^{*}\\[0.3ex]&&y&&\;\mapsto \;&\langle \,\cdot \mid y\,\rangle =\langle \,y,\cdot \,\rangle \\[0.3ex]\end{alignedat}}$ which is also aninjective isometry.^[1] TheFundamental theorem of Hilbert spaces, which is related to Riesz representation theorem, states that this map is surjective (and thusbijective). Consequently, every antilinear functional on $H {\displaystyle H}$ can be written (uniquely) in this form.^[1]

If $\operatorname {Cong} :H^{*}\to {\overline {H}}^{*}$ is the canonicalantilinear bijective isometry $f\mapsto {\overline {f}}$ that was defined above, then the following equality holds: $\operatorname {Cong} ~\circ ~\operatorname {In} _{H}^{H^{*}}~=~\operatorname {In} _{H}^{{\overline {H}}^{*}}.$

Extending the bra–ket notation to bras and kets

[edit]

Main article:Bra–ket notation

Let $\left(H,\langle \cdot ,\cdot \rangle _{H}\right)$ be a Hilbert space and as before, let $\langle y\,|\,x\rangle _{H}:=\langle x,y\rangle _{H}.$ Let ${\begin{alignedat}{4}\Phi :\;&&H&&\;\to \;&H^{*}\\[0.3ex]&&g&&\;\mapsto \;&\left\langle \,g\mid \cdot \,\right\rangle _{H}=\left\langle \,\cdot ,g\,\right\rangle _{H}\\\end{alignedat}}$ which is a bijective antilinear isometry that satisfies $(\Phi h)g=\langle h\mid g\rangle _{H}=\langle g,h\rangle _{H}\quad {\text{ for all }}g,h\in H.$

Bras

Given a vector $h\in H,$ let $\langle h\,|$ denote the continuous linear functional $\Phi h$ ; that is, $\langle h\,|~:=~\Phi h$ so that this functional $\langle h\,|$ is defined by $g\mapsto \left\langle \,h\mid g\,\right\rangle _{H}.$ This map was denoted by $\left\langle h\mid \cdot \,\right\rangle$ earlier in this article.

The assignment $h\mapsto \langle h|$ is just the isometric antilinear isomorphism $\Phi ~:~H\to H^{*},$ which is why $~\langle cg+h\,|~=~{\overline {c}}\langle g\mid ~+~\langle h\,|~$ holds for all $g,h\in H$ and all scalars $c . {\displaystyle c.}$ The result of plugging some given $g\in H$ into the functional $\langle h\,|$ is the scalar $\langle h\,|\,g\rangle _{H}=\langle g,h\rangle _{H},$ which may be denoted by $\langle h\mid g\rangle .$ ^{[note 6]}

Bra of a linear functional

Given a continuous linear functional $\psi \in H^{*},$ let $\langle \psi \mid$ denote the vector $\Phi ^{-1}\psi \in H$ ; that is, $\langle \psi \mid ~:=~\Phi ^{-1}\psi .$

The assignment $\psi \mapsto \langle \psi \mid$ is just the isometric antilinear isomorphism $\Phi ^{-1}~:~H^{*}\to H,$ which is why $~\langle c\psi +\phi \mid ~=~{\overline {c}}\langle \psi \mid ~+~\langle \phi \mid ~$ holds for all $\phi ,\psi \in H^{*}$ and all scalars $c . {\displaystyle c.}$

The defining condition of the vector $\langle \psi |\in H$ is the technically correct but unsightly equality $\left\langle \,\langle \psi \mid \,\mid g\right\rangle _{H}~=~\psi g\quad {\text{ for all }}g\in H,$ which is why the notation $\left\langle \psi \mid g\right\rangle$ is used in place of $\left\langle \,\langle \psi \mid \,\mid g\right\rangle _{H}=\left\langle g,\,\langle \psi \mid \right\rangle _{H}.$ With this notation, the defining condition becomes $\left\langle \psi \mid g\right\rangle ~=~\psi g\quad {\text{ for all }}g\in H.$

Kets

For any given vector $g\in H,$ the notation $|\,g\rangle$ is used to denote $g {\displaystyle g}$ ; that is, $\mid g\rangle :=g.$

The assignment $g\mapsto |\,g\rangle$ is just the identity map $\operatorname {Id} _{H}:H\to H,$ which is why $~\mid cg+h\rangle ~=~c\mid g\rangle ~+~\mid h\rangle ~$ holds for all $g,h\in H$ and all scalars $c . {\displaystyle c.}$

The notation $\langle h\mid g\rangle$ and $\langle \psi \mid g\rangle$ is used in place of $\left\langle h\mid \,\mid g\rangle \,\right\rangle _{H}~=~\left\langle \mid g\rangle ,h\right\rangle _{H}$ and $\left\langle \psi \mid \,\mid g\rangle \,\right\rangle _{H}~=~\left\langle g,\,\langle \psi \mid \right\rangle _{H},$ respectively. As expected, $~\langle \psi \mid g\rangle =\psi g~$ and $~\langle h\mid g\rangle ~$ really is just the scalar $~\langle h\mid g\rangle _{H}~=~\langle g,h\rangle _{H}.$

Adjoints and transposes

[edit]

Let $A:H\to Z$ be acontinuous linear operator betweenHilbert spaces $\left(H,\langle \cdot ,\cdot \rangle _{H}\right)$ and $\left(Z,\langle \cdot ,\cdot \rangle _{Z}\right).$ As before, let $\langle y\mid x\rangle _{H}:=\langle x,y\rangle _{H}$ and $\langle y\mid x\rangle _{Z}:=\langle x,y\rangle _{Z}.$

Denote by ${\begin{alignedat}{4}\Phi _{H}:\;&&H&&\;\to \;&H^{*}\\[0.3ex]&&g&&\;\mapsto \;&\langle \,g\mid \cdot \,\rangle _{H}\\\end{alignedat}}\quad {\text{ and }}\quad {\begin{alignedat}{4}\Phi _{Z}:\;&&Z&&\;\to \;&Z^{*}\\[0.3ex]&&y&&\;\mapsto \;&\langle \,y\mid \cdot \,\rangle _{Z}\\\end{alignedat}}$ the usual bijective antilinear isometries that satisfy: $\left(\Phi _{H}g\right)h=\langle g\mid h\rangle _{H}\quad {\text{ for all }}g,h\in H\qquad {\text{ and }}\qquad \left(\Phi _{Z}y\right)z=\langle y\mid z\rangle _{Z}\quad {\text{ for all }}y,z\in Z.$

Definition of the adjoint

[edit]

Main articles:Hermitian adjoint andConjugate transpose

For every $z\in Z,$ the scalar-valued map $\langle z\mid A(\cdot )\rangle _{Z}$ ^{[note 7]} on $H {\displaystyle H}$ defined by $h\mapsto \langle z\mid Ah\rangle _{Z}=\langle Ah,z\rangle _{Z}$

is a continuous linear functional on $H {\displaystyle H}$ and so by the Riesz representation theorem, there exists a unique vector in $H, {\displaystyle H,}$ denoted by $A^{*}z,$ such that $\langle z\mid A(\cdot )\rangle _{Z}=\left\langle A^{*}z\mid \cdot \,\right\rangle _{H},$ or equivalently, such that $\langle z\mid Ah\rangle _{Z}=\left\langle A^{*}z\mid h\right\rangle _{H}\quad {\text{ for all }}h\in H.$

The assignment $z\mapsto A^{*}z$ thus induces a function $A^{*}:Z\to H$ called theadjoint of $A:H\to Z$ whose defining condition is $\langle z\mid Ah\rangle _{Z}=\left\langle A^{*}z\mid h\right\rangle _{H}\quad {\text{ for all }}h\in H{\text{ and all }}z\in Z.$ The adjoint $A^{*}:Z\to H$ is necessarily acontinuous (equivalently, abounded)linear operator.

If $H {\displaystyle H}$ is finite dimensional with the standard inner product and if $M {\displaystyle M}$ is thetransformation matrix of $A {\displaystyle A}$ with respect to the standard orthonormal basis then $M {\displaystyle M}$ 'sconjugate transpose ${\overline {M^{\operatorname {T} }}}$ is the transformation matrix of the adjoint $A^{*}.$

Adjoints are transposes

[edit]

Main article:Transpose of a linear map

Descriptions of self-adjoint, normal, and unitary operators

[edit]

Assume $Z=H$ and let $\Phi :=\Phi _{H}=\Phi _{Z}.$ Let $A:H\to H$ be a continuous (that is, bounded) linear operator.

Whether or not $A:H\to H$ isself-adjoint,normal, orunitary depends entirely on whether or not $A {\displaystyle A}$ satisfies certain defining conditions related to its adjoint, which was shown by (Adjoint-transpose) to essentially be just the transpose ${}^{t}A:H^{*}\to H^{*}.$ Because the transpose of $A {\displaystyle A}$ is a map between continuous linear functionals, these defining conditions can consequently be re-expressed entirely in terms of linear functionals, as the remainder of subsection will now describe in detail. The linear functionals that are involved are the simplest possible continuous linear functionals on $H {\displaystyle H}$ that can be defined entirely in terms of $A, {\displaystyle A,}$ the inner product $\langle \,\cdot \mid \cdot \,\rangle$ on $H, {\displaystyle H,}$ and some given vector $h\in H.$ Specifically, these are $\left\langle Ah\mid \cdot \,\right\rangle$ and $\langle h\mid A(\cdot )\rangle$ ^{[note 7]} where $\left\langle Ah\mid \cdot \,\right\rangle =\Phi (Ah)=(\Phi \circ A)h\quad {\text{ and }}\quad \langle h\mid A(\cdot )\rangle =\left({}^{t}A\circ \Phi \right)h.$

Self-adjoint operators

A continuous linear operator $A:H\to H$ is calledself-adjoint if it is equal to its own adjoint; that is, if $A=A^{*}.$ Using (Adjoint-transpose), this happens if and only if: $\Phi \circ A={}^{t}A\circ \Phi$ where this equality can be rewritten in the following two equivalent forms: $A=\Phi ^{-1}\circ {}^{t}A\circ \Phi \quad {\text{ or }}\quad {}^{t}A=\Phi \circ A\circ \Phi ^{-1}.$

Unraveling notation and definitions produces the following characterization of self-adjoint operators in terms of the aforementioned continuous linear functionals: $A {\displaystyle A}$ is self-adjoint if and only if for all $z\in H,$ the linear functional $\langle z\mid A(\cdot )\rangle$ ^{[note 7]} is equal to the linear functional $\langle Az\mid \cdot \,\rangle$ ; that is, if and only if

\langle z\mid A(\cdot )\rangle =\langle Az\mid \cdot \,\rangle \quad {\text{ for all }}z\in H

Self-adjointness functionals

where if bra-ket notation is used, this is $\langle z\mid A~=~\langle Az\mid \quad {\text{ for all }}z\in H.$

Normal operators

See also:Normal operator andNormal matrix

A continuous linear operator $A:H\to H$ is callednormal if $AA^{*}=A^{*}A,$ which happens if and only if for all $z,h\in H,$ $\left\langle AA^{*}z\mid h\right\rangle =\left\langle A^{*}Az\mid h\right\rangle .$

Using (Adjoint-transpose) and unraveling notation and definitions produces^{[proof 2]} the following characterization of normal operators in terms of inner products of continuous linear functionals: $A {\displaystyle A}$ is a normal operator if and only if

\left\langle \,\langle Ah\mid \cdot \,\rangle \mid \langle Az\mid \cdot \,\rangle \,\right\rangle _{H^{*}}~=~\left\langle \,\langle h|A(\cdot )\rangle \mid \langle z\mid A(\cdot )\rangle \,\right\rangle _{H^{*}}\quad {\text{ for all }}z,h\in H

Normality functionals

where the left hand side is also equal to ${\overline {\langle Ah\mid Az\rangle }}_{H}=\langle Az\mid Ah\rangle _{H}.$ The left hand side of this characterization involvesonly linear functionals of the form $\langle Ah\mid \cdot \,\rangle$ while the right hand side involvesonly linear functions of the form $\langle h\mid A(\cdot )\rangle$ (defined as above^{[note 7]}). So in plain English, characterization (Normality functionals) says that an operator isnormal when the inner product of any two linear functions of the first form is equal to the inner product of their second form (using the same vectors $z,h\in H$ for both forms).In other words, if it happens to be the case (and when $A {\displaystyle A}$ is injective or self-adjoint, it is) that the assignment of linear functionals $\langle Ah\mid \cdot \,\rangle ~\mapsto ~\langle h|A(\cdot )\rangle$ is well-defined (or alternatively, if $\langle h|A(\cdot )\rangle ~\mapsto ~\langle Ah\mid \cdot \,\rangle$ is well-defined) where $h {\displaystyle h}$ ranges over $H, {\displaystyle H,}$ then $A {\displaystyle A}$ is a normal operator if and only if this assignment preserves the inner product on $H^{*}.$

The fact that every self-adjoint bounded linear operator is normal follows readily by direct substitution of $A^{*}=A$ into either side of $A^{*}A=AA^{*}.$ This same fact also follows immediately from the direct substitution of the equalities (Self-adjointness functionals) into either side of (Normality functionals).

Alternatively, for a complex Hilbert space, the continuous linear operator $A {\displaystyle A}$ is a normal operator if and only if $\|Az\|=\left\|A^{*}z\right\|$ for every $z\in H,$ ^[2] which happens if and only if $\|Az\|_{H}=\|\langle z\,|\,A(\cdot )\rangle \|_{H^{*}}\quad {\text{ for every }}z\in H.$

Unitary operators

An invertible bounded linear operator $A:H\to H$ is said to beunitary if its inverse is its adjoint: $A^{-1}=A^{*}.$ By using (Adjoint-transpose), this is seen to be equivalent to $\Phi \circ A^{-1}={}^{t}A\circ \Phi .$ Unraveling notation and definitions, it follows that $A {\displaystyle A}$ is unitary if and only if $\langle A^{-1}z\mid \cdot \,\rangle =\langle z\mid A(\cdot )\rangle \quad {\text{ for all }}z\in H.$

The fact that a bounded invertible linear operator $A:H\to H$ is unitary if and only if $A^{*}A=\operatorname {Id} _{H}$ (or equivalently, ${}^{t}A\circ \Phi \circ A=\Phi$ ) produces another (well-known) characterization: an invertible bounded linear map $A {\displaystyle A}$ is unitary if and only if $\langle Az\mid A(\cdot )\,\rangle =\langle z\mid \cdot \,\rangle \quad {\text{ for all }}z\in H.$

Because $A:H\to H$ is invertible (and so in particular a bijection), this is also true of the transpose ${}^{t}A:H^{*}\to H^{*}.$ This fact also allows the vector $z\in H$ in the above characterizations to be replaced with $A z {\displaystyle Az}$ or $A^{-1}z,$ thereby producing many more equalities. Similarly, $\,\cdot \,$ can be replaced with $A(\cdot )$ or $A^{-1}(\cdot ).$

Citations

[edit]

^^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j ^k ^lTrèves 2006, pp. 112–123.
^^a ^b ^cRudin 1991, pp. 306–312.
^Roman 2008, p. 351 Theorem 13.32
^Rudin 1991, pp. 307−309.
^Rudin 1991, pp. 92–115.

Notes

[edit]

^If $\mathbb {F} =\mathbb {R}$ then the inner product will be symmetric so it does not matter which coordinate of the inner product the element $y {\displaystyle y}$ is placed into because the same map will result. But if $\mathbb {F} =\mathbb {C}$ then except for the constant $0 {\displaystyle 0}$ map,antilinear functionals on $H {\displaystyle H}$ are completely distinct fromlinear functionals on $H, {\displaystyle H,}$ which makes the coordinate that $y {\displaystyle y}$ is placed into isvery important. For a non-zero $y\in H$ to induce alinear functional (rather than anantilinear functional), $y {\displaystyle y}$ must be placed into theantilinear coordinate of the inner product. If it is incorrectly placed into the linear coordinate instead of the antilinear coordinate then the resulting map will be the antilinear map $h\mapsto \langle y,h\rangle =\langle h\mid y\rangle ,$ which isnot a linear functional on $H {\displaystyle H}$ and so it willnot be an element of the continuous dual space $H^{*}.$
^This means that for all vectors $y\in H:$ (1) $\Phi :H\to H^{*}$ isinjective. (2) Thenorms of $y {\displaystyle y}$ and $\Phi (y)$ are the same: $\|\Phi (y)\|=\|y\|.$ (3) $\Phi$ is anadditive map, meaning that $\Phi (x+y)=\Phi (x)+\Phi (y)$ for all $x,y\in H.$ (4) $\Phi$ isconjugate homogeneous: $\Phi (sy)={\overline {s}}\Phi (y)$ for all scalars $s . {\displaystyle s.}$ (5) $\Phi$ isreal homogeneous: $\Phi (ry)=r\Phi (y)$ for all real numbers $r\in \mathbb {R} .$
^^a ^bThis footnote explains how to define - using only $H {\displaystyle H}$ 's operations - addition and scalar multiplication of affine hyperplanes so that these operations correspond to addition and scalar multiplication of linear functionals. Let $H {\displaystyle H}$ be any vector space and let $H^{\#}$ denote itsalgebraic dual space. Let ${\mathcal {A}}:=\left\{\varphi ^{-1}(1):\varphi \in H^{\#}\right\}$ and let $\,{\hat {\cdot }}\,$ and $\,{\hat {+}}\,$ denote the (unique) vector space operations on ${\mathcal {A}}$ that make the bijection $I:H^{\#}\to {\mathcal {A}}$ defined by $\varphi \mapsto \varphi ^{-1}(1)$ into avector space isomorphism. Note that $\varphi ^{-1}(1)=\varnothing$ if and only if $\varphi =0,$ so $\varnothing$ is the additive identity of $\left({\mathcal {A}},{\hat {+}},{\hat {\cdot }}\right)$ (because this is true of $I^{-1}(\varnothing )=0$ in $H^{\#}$ and $I {\displaystyle I}$ is a vector space isomorphism). For every $A\in {\mathcal {A}},$ let $\ker A=H$ if $A=\varnothing$ and let $\ker A=A-A$ otherwise; if $A=I(\varphi )=\varphi ^{-1}(1)$ then $\ker A=\ker \varphi$ so this definition is consistent with the usual definition of the kernel of a linear functional. Say that $A,B\in {\mathcal {A}}$ areparallel if $\ker A=\ker B,$ where if $A {\displaystyle A}$ and $B {\displaystyle B}$ are not empty then this happens if and only if the linear functionals $I^{-1}(A)$ and $I^{-1}(B)$ are non-zero scalar multiples of each other. The vector space operations on the vector space of affine hyperplanes ${\mathcal {A}}$ are now described in a way that involvesonly the vector space operations on $H {\displaystyle H}$ ; this results in an interpretation of the vector space operations on the algebraic dual space $H^{\#}$ that is entirely in terms of affine hyperplanes. Fix hyperplanes $A,B\in {\mathcal {A}}.$ If $s {\displaystyle s}$ is a scalar then $s{\hat {\cdot }}A=\left\{h\in H:sh\in A\right\}.$ Describing the operation $A{\hat {+}}B$ in terms of only the sets $A=\varphi ^{-1}(1)$ and $B=\psi ^{-1}(1)$ is more complicated because by definition, $A{\hat {+}}B=I(\varphi ){\hat {+}}I(\psi ):=I(\varphi +\psi )=(\varphi +\psi )^{-1}(1).$ If $A=\varnothing$ (respectively, if $B=\varnothing$ ) then $A{\hat {+}}B$ is equal to $B {\displaystyle B}$ (resp. is equal to $A {\displaystyle A}$ ) so assume $A\neq \varnothing$ and $B\neq \varnothing .$ The hyperplanes $A {\displaystyle A}$ and $B {\displaystyle B}$ are parallel if and only if there exists some scalar $r {\displaystyle r}$ (necessarily non-0) such that $A=rB,$ in which case $A{\hat {+}}B=\left\{h\in H:(1+r)h\in B\right\};$ this can optionally be subdivided into two cases: if $r=-1$ (which happens if and only if the linear functionals $I^{-1}(A)$ and $I^{-1}(B)$ are negatives of each) then $A{\hat {+}}B=\varnothing$ while if $r\neq -1$ then $A{\hat {+}}B={\frac {1}{1+r}}B={\frac {r}{1+r}}A.$ Finally, assume now that $\ker A\neq \ker B.$ Then $A{\hat {+}}B$ is the unique affine hyperplane containing both $A\cap \ker B$ and $B\cap \ker A$ as subsets; explicitly, $\ker \left(A{\hat {+}}B\right)=\operatorname {span} \left(A\cap \ker B-B\cap \ker A\right)$ and $A{\hat {+}}B=A\cap \ker B+\ker \left(A{\hat {+}}B\right)=B\cap \ker A+\ker \left(A{\hat {+}}B\right).$ To see why this formula for $A{\hat {+}}B$ should hold, consider $H:=\mathbb {R} ^{3},$ $A:=\varphi ^{-1}(1),$ and $B:=\psi ^{-1}(1),$ where $\varphi (x,y,z):=x$ and $\psi (x,y,z):=x+y$ (or alternatively, $\psi (x,y,z):=y$ ). Then by definition, $A{\hat {+}}B:=(\varphi +\psi )^{-1}(1)$ and $\ker \left(A{\hat {+}}B\right):=(\varphi +\psi )^{-1}(0).$ Now $A\cap \ker B~=~\varphi ^{-1}(1)\cap \psi ^{-1}(0)~\subseteq ~(\varphi +\psi )^{-1}(1)$ is an affine subspace ofcodimension $2 {\displaystyle 2}$ in $H {\displaystyle H}$ (it is equal to a translation of the $z {\displaystyle z}$ -axis $\{(0,0)\}\times \mathbb {R}$ ). The same is true of $B\cap \ker A.$ Plotting an $x {\displaystyle x}$ - $y {\displaystyle y}$ -plane cross section (that is, setting $z=$ constant) of the sets $\ker A,\ker B,A$ and $B {\displaystyle B}$ (each of which will be plotted as a line), the set $(\varphi +\psi )^{-1}(1)$ will then be plotted as the (unique) line passing through the $A\cap \ker B$ and $B\cap \ker A$ (which will be plotted as two distinct points) while $(\varphi +\psi )^{-1}(0)=\ker \left(A{\hat {+}}B\right)$ will be plotted the line through the origin that is parallel to $A{\hat {+}}B=(\varphi +\psi )^{-1}(1).$ The above formulas for $\ker \left(A{\hat {+}}B\right):=(\varphi +\psi )^{-1}(0)$ and $A{\hat {+}}B:=(\varphi +\psi )^{-1}(1)$ follow naturally from the plot and they also hold in general.
^Showing that there is a non-zero vector $v {\displaystyle v}$ in $K^{\bot }$ relies on the continuity of $\phi$ and theCauchy completeness of $H . {\displaystyle H.}$ This is the only place in the proof in which these properties are used.
^Technically, $H=K\oplus K^{\bot }$ means that the addition map $K\times K^{\bot }\to H$ defined by $(k,p)\mapsto k+p$ is a surjectivelinear isomorphism andhomeomorphism. See the article oncomplemented subspaces for more details.
^The usual notation for plugging an element $g {\displaystyle g}$ into a linear map $F {\displaystyle F}$ is $F(g)$ and sometimes $F g . {\displaystyle Fg.}$ Replacing $F {\displaystyle F}$ with $\langle h\mid :=~\Phi h$ produces $\langle h\mid (g)$ or $\langle h\mid g,$ which is unsightly (despite being consistent with the usual notation used with functions). Consequently, the symbol $\,\rangle \,$ is appended to the end, so that the notation $\langle h\mid g\rangle$ is used instead to denote this value $(\Phi h)g.$
^^a ^b ^c ^d ^eThe notation $\left\langle z\mid A(\cdot )\right\rangle _{Z}$ denotes the continuous linear functional defined by $g\mapsto \left\langle z\mid Ag\right\rangle _{Z}.$

Proofs

^This is because $x_{K}=x-{\frac {\left\langle x,f_{\varphi }\right\rangle }{\left\|f_{\varphi }\right\|^{2}}}f_{\varphi }.$ Now use $\left\|f_{\varphi }\right\|^{2}=\|\varphi \|^{2}$ and $\left\langle x,f_{\varphi }\right\rangle =\varphi (x)$ and solve for $f_{\varphi }.$ $\blacksquare$
^ $\left\langle A^{*}Az\mid h\right\rangle =\left\langle \,Az\mid Ah\,\right\rangle _{H}=\left\langle \,\Phi Ah\mid \Phi Az\,\right\rangle _{H^{*}}$ where $\Phi Ah:=\left\langle Ah\mid \cdot \,\right\rangle$ and $\Phi Az:=\left\langle Az\mid \cdot \,\right\rangle .$ By definition of the adjoint, $\left\langle A^{*}h\mid A^{*}z\,\right\rangle =\left\langle h\mid AA^{*}z\,\right\rangle$ so taking the complex conjugate of both sides proves that $\left\langle AA^{*}z\mid h\right\rangle =\left\langle A^{*}z\mid A^{*}h\right\rangle .$ From $A^{*}=\Phi ^{-1}\circ {}^{t}A\circ \Phi ,$ it follows that $\left\langle AA^{*}z\,|\,h\right\rangle _{H}=\left\langle A^{*}z\mid A^{*}h\right\rangle _{H}=\left\langle \Phi ^{-1}\circ {}^{t}A\circ \Phi z\mid \Phi ^{-1}\circ {}^{t}A\circ \Phi h\right\rangle _{H}=\left\langle \,{}^{t}A\circ \Phi h\mid {}^{t}A\circ \Phi z\right\rangle _{H^{*}}$ where $\left({}^{t}A\circ \Phi \right)h=\langle h\,|\,A(\cdot )\rangle$ and $\left({}^{t}A\circ \Phi \right)z=\langle z\,|\,A(\cdot )\rangle .$ $\blacksquare$

Bibliography

[edit]

Bachman, George; Narici, Lawrence (2000).Functional Analysis (Second ed.). Mineola, New York: Dover Publications.ISBN 978-0486402512.OCLC 829157984.
Fréchet, M. (1907)."Sur les ensembles de fonctions et les opérations linéaires".Les Comptes rendus de l'Académie des sciences (in French).144: 1414–1416.
P. HalmosMeasure Theory, D. van Nostrand and Co., 1950.
P. Halmos,A Hilbert Space Problem Book, Springer, New York 1982(problem 3 contains version for vector spaces with coordinate systems).
Riesz, F. (1907)."Sur une espèce de géométrie analytique des systèmes de fonctions sommables".Comptes rendus de l'Académie des Sciences (in French).144: 1409–1411.
Riesz, F. (1909)."Sur les opérations fonctionnelles linéaires".Comptes rendus de l'Académie des Sciences (in French).149: 974–977.

Roman, Stephen (2008),Advanced Linear Algebra,Graduate Texts in Mathematics (Third ed.), Springer,ISBN 978-0-387-72828-5
Rudin, Walter (1991).Functional Analysis. International Series in Pure and Applied Mathematics. Vol. 8 (Second ed.). New York, NY:McGraw-Hill Science/Engineering/Math.ISBN 978-0-07-054236-5.OCLC 21163277.
Walter Rudin,Real and Complex Analysis, McGraw-Hill, 1966,ISBN 0-07-100276-6.
Trèves, François (2006) [1967].Topological Vector Spaces, Distributions and Kernels. Mineola, N.Y.: Dover Publications.ISBN 978-0-486-45352-1.OCLC 853623322.

Functional analysis (topics –glossary)

Spaces

Banach Besov Fréchet Hilbert Hölder Nuclear Orlicz Schwartz Sobolev Topological vector
Properties	Barrelled Complete Dual (Algebraic /Topological) Locally convex Reflexive Separable

Theorems

Operators

Algebras

Open problems

Applications

Advanced topics

Category

v t e Hilbert spaces
Basic concepts	Adjoint Inner product andL-semi-inner product Hilbert space andPrehilbert space Orthogonal complement Orthonormal basis
Main results	Bessel's inequality Cauchy–Schwarz inequality Riesz representation
Other results	Hilbert projection theorem Parseval's identity Polarization identity (Parallelogram law)
Maps	Compact operator on Hilbert space Densely defined Hermitian form Hilbert–Schmidt Normal Self-adjoint Sesquilinear form Trace class Unitary
Examples	Cⁿ(K) withK compact &n<∞ Segal–BargmannF