Inmathematics,Hurwitz's theorem is atheorem ofAdolf Hurwitz, published posthumously in 1923, solving theHurwitz problem forfinite-dimensionalunitalrealnon-associative algebras endowed with a nondegeneratepositive-definitequadratic form. The theorem states that if the quadratic form defines ahomomorphism into the positive real numbers on the non-zero part of the algebra, then the algebra must beisomorphic to thereal numbers, thecomplex numbers, thequaternions, or theoctonions, and that there are no other possibilities. Such algebras, sometimes calledHurwitz algebras, are examples ofcomposition algebras.
The theory of composition algebras has subsequently been generalized to arbitrary quadratic forms and arbitraryfields.[1] Hurwitz's theorem implies that multiplicative formulas for sums of squares can only occur in 1, 2, 4 and 8 dimensions, a result originallyproved by Hurwitz in 1898. It is a special case of theHurwitz problem, solved also inRadon (1922). Subsequent proofs of the restrictions on the dimension have been given byEckmann (1943) using therepresentation theory of finite groups and byLee (1948) andChevalley (1954) usingClifford algebras. Hurwitz's theorem has been applied inalgebraic topology to problems onvector fields on spheres and thehomotopy groups of theclassical groups[2] and inquantum mechanics to theclassification of simple Jordan algebras.[3]
AHurwitz algebra orcomposition algebra is a finite-dimensional not necessarily associative algebraA with identity endowed with a nondegenerate quadratic formq such thatq(a b) =q(a) q(b). If the underlying coefficient field is the reals andq is positive-definite, so that(a, b) =1/2[q(a +b) −q(a) −q(b)] is aninner product, thenA is called aEuclidean Hurwitz algebra or (finite-dimensional)normed division algebra.[4]
IfA is a Euclidean Hurwitz algebra anda is inA, define theinvolution and right and left multiplication operators by
Evidently the involution has period two and preserves the inner product and norm. These operators have the following properties:
These properties are proved starting from the polarized version of the identity(ab, ab) = (a, a)(b, b):
Settingb = 1 ord = 1 yieldsL(a*) =L(a)* andR(c*) =R(c)*.
HenceRe(ab) = (ab, 1)1 = (a, b*)1 = (ba, 1)1 = Re(ba).
SimilarlyRe (ab)c = ((ab)c,1)1 = (ab, c*)1 = (b, a* c*)1 = (bc,a*)1 = (a(bc),1)1 = Rea(bc).
Hence((ab)*, c) = (ab,c*) = (b,a*c*) = (1,b*(a*c*)) = (1, (b*a*)c*) = (b*a*,c), so that(ab)* =b*a*.
By the polarized identity‖a‖2 (c, d) = (ac, ad) = (a* (ac), d) soL(a*) L(a) =L(‖a‖2). Applied to 1 this givesa*a = ‖a‖2 1. Replacinga bya* gives the other identity.
Substituting the formula fora* inL(a*) L(a) =L(a*a) givesL(a)2 =L(a2). The formulaR(a2) =R(a)2 is proved analogously.
It is routine to check that the real numbersR, the complex numbersC and the quaternionsH are examples of associative Euclidean Hurwitz algebras with their standard norms and involutions. There are moreover natural inclusionsR ⊂C ⊂H.
Analysing such an inclusion leads to theCayley–Dickson construction, formalized byA.A. Albert. LetA be a Euclidean Hurwitz algebra andB a proper unital subalgebra, so a Euclidean Hurwitz algebra in its own right. Pick aunit vectorj inA orthogonal toB. Since(j, 1) = 0, it follows thatj* = −j and hencej2 = −1. LetC be subalgebra generated byB andj. It is unital and is again a Euclidean Hurwitz algebra. It satisfies the followingCayley–Dickson multiplication laws:
B andBj are orthogonal, sincej is orthogonal toB. Ifa is inB, thenj a =a* j, since by orthogonal0 = 2(j, a*) =ja −a*j. The formula for the involution follows. To show thatB ⊕B j is closed under multiplicationBj =jB. SinceBj is orthogonal to 1,(bj)* = −bj.
Imposing the multiplicativity of the norm onC fora +bj andc +dj gives:
which leads to
Henced(ac) = (da)c, so thatBmust be associative.
This analysis applies to the inclusion ofR inC andC inH. TakingO =H ⊕H with the product and inner product above gives a noncommutative nonassociative algebra generated byJ = (0, 1). This recovers the usual definition of theoctonions orCayley numbers. IfA is a Euclidean algebra, it must containR. If it is strictly larger thanR, the argument above shows that it containsC. If it is larger thanC, it containsH. If it is larger still, it must containO. But there the process must stop, becauseO is not associative. In factH is not commutative anda(bj) = (ba)j ≠ (ab)j inO.[5]
Theorem. The only Euclidean Hurwitz algebras are the real numbers, the complex numbers, the quaternions and the octonions.
The proofs ofLee (1948) andChevalley (1954) useClifford algebras to show that the dimensionN ofA must be 1, 2, 4 or 8. In fact the operatorsL(a) with(a, 1) = 0 satisfyL(a)2 = −‖a‖2 and so form a real Clifford algebra. Ifa is a unit vector, thenL(a) is skew-adjoint with square−I. SoN must be eithereven or 1 (in which caseA contains no unit vectors orthogonal to 1). The real Clifford algebra and itscomplexification act on the complexification ofA, anN-dimensional complex space. IfN is even,N − 1 isodd, so the Clifford algebra has exactly two complexirreducible representations of dimension2N/2 − 1. So thispower of 2 must divideN. It is easy to see that this impliesN can only be 1, 2, 4 or 8.
The proof ofEckmann (1943) uses therepresentation theory offinite groups, or the projective representation theory ofelementary abelian 2-groups, known to be equivalent to the representation theory of real Clifford algebras. Indeed, taking anorthonormal basisei of theorthogonal complement of 1 gives rise to operatorsUi =L(ei)satisfying
This is aprojective representation of a direct product ofN − 1groups oforder 2. (N is assumed to be greater than 1.) The operatorsUi by construction are skew-symmetric and orthogonal. In fact Eckmann constructed operators of this type in a slightly different but equivalent way. It is in fact the method originally followed inHurwitz (1923).[6] Assume that there is a composition law for two forms
wherezi is bilinear inx andy. Thus
where thematrixT(x) = (aij) is linear inx. The relations above are equivalent to
Writing
the relations become
Now setVi = (TN)tTi. ThusVN =I and theV1, ... , VN − 1 are skew-adjoint, orthogonal satisfying exactly the same relations as theUi's:
SinceVi is anorthogonal matrix with square−I on a realvector space,N is even.
LetG be the finite group generated by elementsvi such that
whereε iscentral of order 2. Thecommutator subgroup[G, G] is just formed of 1 andε. IfN is odd this coincides with thecenter while ifN is even the center has order 4 with extra elementsγ =v1...vN − 1 andεγ. Ifg inG is not in the center itsconjugacy class is exactlyg andεg. Thus there are2N − 1 + 1 conjugacy classes forN odd and2N − 1 + 2 forN even.G has|G / [G, G] | = 2N − 1 1-dimensional complex representations. The total number of irreducible complex representations is the number of conjugacy classes. So sinceN is even, there are two further irreducible complex representations. Since the sum of the squares of the dimensions equals|G| and the dimensions divide|G|, the two irreducibles must have dimension2(N − 2)/2. WhenN is even, there are two and their dimension must divide the order of the group, so is a power of two, so they must both have dimension2(N − 2)/2. The space on which theVi's act can be complexified. It will have complex dimensionN. It breaks up into some of complex irreducible representations ofG, all having dimension2(N − 2)/2. In particular this dimension is≤N, soN is less than or equal to 8. IfN = 6, the dimension is 4, which does not divide 6. SoN can only be 1, 2, 4 or 8.
LetA be a Euclidean Hurwitz algebra and letMn(A) be the algebra ofn-by-n matrices overA. It is a unital nonassociative algebra with an involution given by
The traceTr(X) is defined as the sum of the diagonal elements ofX and the real-valued trace byTrR(X ) = Re Tr(X ). The real-valued trace satisfies:
These are immediate consequences of the known identities forn = 1.
InA define theassociator by
It is trilinear and vanishes identically ifA is associative. SinceA is analternative algebra[a, a, b] = 0 and[b, a, a] = 0. Polarizing it follows that the associator is antisymmetric in its three entries. Furthermore, ifa,b orc lie inR then[a, b, c] = 0. These facts imply thatM3(A) has certain commutation properties. In fact ifX is a matrix inM3(A) with real entries on the diagonal then
witha inA. In fact ifY = [X, X 2], then
Since the diagonal entries ofX are real, the off-diagonal entries ofY vanish. Each diagonalentry ofY is a sum of two associators involving only off diagonal terms ofX. Since the associators are invariant undercyclic permutations, the diagonal entries ofY are all equal.
LetHn(A) be the space of self-adjoint elements inMn(A) with productX ∘Y =1/2(X Y +Y X) and inner product(X, Y ) = TrR(X Y ).
Theorem.Hn(A) is aEuclidean Jordan algebra ifA is associative (the real numbers, complex numbers or quaternions) andn ≥ 3 or ifA is nonassociative (the octonions) andn = 3.
Theexceptional Jordan algebraH3(O) is called theAlbert algebra afterA.A. Albert.
To check thatHn(A) satisfies the axioms for a Euclidean Jordan algebra, the real trace defines asymmetric bilinear form with(X, X) = Σ ‖xij‖2. So it is an inner product. It satisfies the associativity property(Z∘X, Y ) = (X, Z∘Y ) because of the properties of the real trace. The main axiom to check is the Jordan condition for the operatorsL(X) defined byL(X)Y =X ∘Y:
This is easy to check whenA is associative, sinceMn(A) is an associative algebra so a Jordan algebra withX ∘Y =1/2(X Y +Y X). WhenA =O andn = 3 a special argument is required, one of the shortest being due toFreudenthal (1951).[7]
In fact ifT is inH3(O) withTr T = 0, then
defines a skew-adjoint derivation ofH3(O). Indeed,
so that
Polarizing yields:
SettingZ = 1 shows thatD is skew-adjoint. The derivation propertyD(X ∘Y) =D(X)∘Y +X∘D(Y) follows by this and the associativity property of the inner product in the identity above.
WithA andn as in the statement of the theorem, letK be the group ofautomorphisms ofE =Hn(A) leaving invariant the inner product. It is a closed subgroup ofO(E) so acompactLie group. ItsLie algebra consists of skew-adjoint derivations.Freudenthal (1951) showed that givenX inE there is an automorphismk inK such thatk(X) is adiagonal matrix. (By self-adjointness the diagonal entries will be real.) Freudenthal's diagonalization theorem immediately implies the Jordan condition, since Jordan products by real diagonal matrices commute onMn(A) for any non-associative algebraA.
To prove the diagonalization theorem, takeX inE. By compactnessk can be chosen inK minimizing the sums of the squares of the norms of the off-diagonal terms ofk(X ). SinceK preserves the sums of all the squares, this is equivalent to maximizing the sums of the squares of the norms of the diagonal terms ofk(X ). ReplacingX byk X, it can be assumed that the maximum is attained atX. Since thesymmetric groupSn, acting by permuting the coordinates, lies inK, ifX is not diagonal, it can be supposed thatx12 and its adjointx21 are non-zero. LetT be the skew-adjoint matrix with(2, 1) entrya,(1, 2) entry−a* and 0 elsewhere and letD be the derivation adT ofE. Letkt = exp tD inK. Then only the first two diagonal entries inX(t) =ktX differ from those ofX. The diagonal entries are real. The derivative ofx11(t) att = 0 is the(1, 1) coordinate of[T, X], i.e.a* x21 +x12 a = 2(x21, a). This derivative is non-zero ifa =x21. On the other hand, the groupkt preserves the real-valued trace. Since it can only changex11 andx22, it preserves their sum. However, on the linex +y = constant,x2 +y2 has no local maximum (only a global minimum), a contradiction. HenceX must be diagonal.