Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Determinant

From Wikipedia, the free encyclopedia
In mathematics, invariant of square matrices
This article is about mathematics. For determinants in epidemiology, seeRisk factor. For determinants in immunology, seeEpitope.

Inmathematics, thedeterminant is ascalar-valuedfunction of the entries of asquare matrix. The determinant of a matrixA is commonly denoteddet(A),detA, or|A|. Its value characterizes some properties of the matrix and thelinear map represented, on a givenbasis, by the matrix. In particular, the determinant is nonzeroif and only if the matrix isinvertible and the corresponding linear map is anisomorphism. However, if the determinant is zero, the matrix is referred to as singular, meaning it does not have an inverse.

The determinant is completely determined by the two following properties: the determinant of a product of matrices is the product of their determinants, and the determinant of atriangular matrix is the product of its diagonal entries.

The determinant of a2 × 2 matrix is

|abcd|=adbc,{\displaystyle {\begin{vmatrix}a&b\\c&d\end{vmatrix}}=ad-bc,}

and the determinant of a3 × 3 matrix is

|abcdefghi|=aei+bfg+cdhcegbdiafh.{\displaystyle {\begin{vmatrix}a&b&c\\d&e&f\\g&h&i\end{vmatrix}}=aei+bfg+cdh-ceg-bdi-afh.}

The determinant of ann ×n matrix can be defined in several equivalent ways, the most common beingLeibniz formula, which expresses the determinant as a sum ofn!{\displaystyle n!} (thefactorial ofn) signed products of matrix entries. It can be computed by theLaplace expansion, which expresses the determinant as alinear combination of determinants of submatrices, or withGaussian elimination, which allows computing arow echelon form with the same determinant, equal to the product of the diagonal entries of the row echelon form.

Determinants can also be defined by some of their properties. Namely, the determinant is the unique function defined on then ×n matrices that has the four following properties:

  1. The determinant of theidentity matrix is1.
  2. The exchange of two rows multiplies the determinant by−1.
  3. Multiplying a row by a number multiplies the determinant by this number.
  4. Adding a multiple of one row to another row does not change the determinant.

The above properties relating to rows (properties 2–4) may be replaced by the corresponding statements with respect to columns.

The determinant is invariant undermatrix similarity. This implies that, given a linearendomorphism of afinite-dimensional vector space, the determinant of the matrix that represents it on abasis does not depend on the chosen basis. This allows defining thedeterminant of a linear endomorphism, which does not depend on the choice of acoordinate system.

Determinants occur throughout mathematics. For example, a matrix is often used to represent thecoefficients in asystem of linear equations, and determinants can be used to solve these equations (Cramer's rule), although other methods of solution are computationally much more efficient. Determinants are used for defining thecharacteristic polynomial of a square matrix, whose roots are theeigenvalues. Ingeometry, the signedn-dimensionalvolume of an-dimensionalparallelepiped is expressed by a determinant, and the determinant of a linearendomorphism determines how theorientation and then-dimensional volume are transformed under the endomorphism. This is used incalculus withexterior differential forms and theJacobian determinant, in particular forchanges of variables inmultiple integrals.

Two by two matrices

[edit]

The determinant of a2 × 2 matrix(abcd){\displaystyle {\begin{pmatrix}a&b\\c&d\end{pmatrix}}} is denoted either by "det" or by vertical bars around the matrix, and is defined as

det(abcd)=|abcd|=adbc.{\displaystyle \det {\begin{pmatrix}a&b\\c&d\end{pmatrix}}={\begin{vmatrix}a&b\\c&d\end{vmatrix}}=ad-bc.}

For example,

det(3714)=|3714|=(3(4))(71)=19.{\displaystyle \det {\begin{pmatrix}3&7\\1&-4\end{pmatrix}}={\begin{vmatrix}3&7\\1&{-4}\end{vmatrix}}=(3\cdot (-4))-(7\cdot 1)=-19.}

First properties

[edit]

The determinant has several key properties that can be proved by direct evaluation of the definition for2×2{\displaystyle 2\times 2}-matrices, and that continue to hold for determinants of larger matrices. They are as follows:[1] first, the determinant of theidentity matrix(1001){\displaystyle {\begin{pmatrix}1&0\\0&1\end{pmatrix}}} is 1.Second, the determinant is zero if two rows are the same:

|abab|=abba=0.{\displaystyle {\begin{vmatrix}a&b\\a&b\end{vmatrix}}=ab-ba=0.}

This holds similarly if the two columns are the same. Moreover,

|ab+bcd+d|=a(d+d)(b+b)c=|abcd|+|abcd|.{\displaystyle {\begin{vmatrix}a&b+b'\\c&d+d'\end{vmatrix}}=a(d+d')-(b+b')c={\begin{vmatrix}a&b\\c&d\end{vmatrix}}+{\begin{vmatrix}a&b'\\c&d'\end{vmatrix}}.}

Finally, if any column is multiplied by some numberr{\displaystyle r} (i.e., all entries in that column are multiplied by that number), the determinant is also multiplied by that number:

|rabrcd|=radbrc=r(adbc)=r|abcd|.{\displaystyle {\begin{vmatrix}r\cdot a&b\\r\cdot c&d\end{vmatrix}}=rad-brc=r(ad-bc)=r\cdot {\begin{vmatrix}a&b\\c&d\end{vmatrix}}.}

Geometric meaning

[edit]
The area of the parallelogram is the absolute value of the determinant of the matrix formed by the vectors representing the parallelogram's sides.

If the matrix entries are real numbers, the matrixA represents thelinear map that maps thebasis vectors to the columns ofA. The images of the basis vectors form aparallelogram that represents the image of theunit square under the mapping. The parallelogram defined by the columns of the above matrix is the one with vertices at(0, 0),(a,c),(a +b,c +d), and(b,d), as shown in the accompanying diagram.

The absolute value ofadbc is the area of the parallelogram, and thus represents the scale factor by which areas are transformed byA.

The absolute value of the determinant together with the sign becomes thesigned area of the parallelogram. The signed area is the same as the usualarea, except that it is negative when the angle from the first to the second vector defining the parallelogram turns in a clockwise direction (which is opposite to the direction one would get for theidentity matrix).

To show thatadbc is the signed area, one may consider a matrix containing two vectorsu ≡ (a,c) andv ≡ (b,d) representing the parallelogram's sides. The signed area can be expressed as|u| |v| sinθ for the angleθ between the vectors, which is simply base times height, the length of one vector times the perpendicular component of the other. Due to thesine this already is the signed area, yet it may be expressed more conveniently using thecosine of the complementary angle to a perpendicular vector, e.g.u = (−c,a), so that|u| |v| cosθ′ becomes the signed area in question, which can be determined by the pattern of thescalar product to be equal toadbc according to the following equations:

Signed area=|u||v|sinθ=|u||v|cosθ=(ca)(bd)=adbc.{\displaystyle {\text{Signed area}}=|{\boldsymbol {u}}|\,|{\boldsymbol {v}}|\,\sin \,\theta =\left|{\boldsymbol {u}}^{\perp }\right|\,\left|{\boldsymbol {v}}\right|\,\cos \,\theta '={\begin{pmatrix}-c\\a\end{pmatrix}}\cdot {\begin{pmatrix}b\\d\end{pmatrix}}=ad-bc.}

Thus the determinant gives the area scale factor and the orientation induced by the mapping represented byA. When the determinant is equal to one, the linear mapping defined by the matrix preserves area and orientation.

The volume of thisparallelepiped is the absolute value of the determinant of the matrix formed by the columns constructed from the vectors r1, r2, and r3.

If ann ×nreal matrixA is written in terms of its column vectorsA=[a1a2an]{\displaystyle A=\left[{\begin{array}{c|c|c|c}\mathbf {a} _{1}&\mathbf {a} _{2}&\cdots &\mathbf {a} _{n}\end{array}}\right]}, then

A(100)=a1,A(010)=a2,,A(001)=an.{\displaystyle A{\begin{pmatrix}1\\0\\\vdots \\0\end{pmatrix}}=\mathbf {a} _{1},\quad A{\begin{pmatrix}0\\1\\\vdots \\0\end{pmatrix}}=\mathbf {a} _{2},\quad \ldots ,\quad A{\begin{pmatrix}0\\0\\\vdots \\1\end{pmatrix}}=\mathbf {a} _{n}.}

This means thatA{\displaystyle A} maps the unitn-cube to then-dimensionalparallelotope defined by the vectorsa1,a2,,an,{\displaystyle \mathbf {a} _{1},\mathbf {a} _{2},\ldots ,\mathbf {a} _{n},} the regionP={c1a1++cnan0ci1 i}{\displaystyle P=\left\{c_{1}\mathbf {a} _{1}+\cdots +c_{n}\mathbf {a} _{n}\mid 0\leq c_{i}\leq 1\ \forall i\right\}} ({\textstyle \forall } stands for "for all" as alogical symbol.)

The determinant gives thesignedn-dimensional volume of this parallelotope,det(A)=±vol(P),{\displaystyle \det(A)=\pm {\text{vol}}(P),} and hence describes more generally then-dimensional volume scale factor of thelinear transformation produced byA.[2] (The sign shows whether the transformation preserves or reversesorientation.) In particular, if the determinant is zero, then this parallelotope has volume zero and is not fullyn-dimensional, which indicates that the dimension of the image ofA is less thann. Thismeans thatA produces a linear transformation which is neitheronto norone-to-one, and so is not invertible.

Definition

[edit]

LetA be asquare matrix withn rows andn columns, so that it can be written as

A=[a1,1a1,2a1,na2,1a2,2a2,nan,1an,2an,n].{\displaystyle A={\begin{bmatrix}a_{1,1}&a_{1,2}&\cdots &a_{1,n}\\a_{2,1}&a_{2,2}&\cdots &a_{2,n}\\\vdots &\vdots &\ddots &\vdots \\a_{n,1}&a_{n,2}&\cdots &a_{n,n}\end{bmatrix}}.}

The entriesa1,1{\displaystyle a_{1,1}} etc. are, for many purposes, real or complex numbers. As discussed below, the determinant is also defined for matrices whose entries are in acommutative ring.

The determinant ofA is denoted by det(A), or it can be denoted directly in terms of the matrix entries by writing enclosing bars instead of brackets:

|a1,1a1,2a1,na2,1a2,2a2,nan,1an,2an,n|.{\displaystyle {\begin{vmatrix}a_{1,1}&a_{1,2}&\cdots &a_{1,n}\\a_{2,1}&a_{2,2}&\cdots &a_{2,n}\\\vdots &\vdots &\ddots &\vdots \\a_{n,1}&a_{n,2}&\cdots &a_{n,n}\end{vmatrix}}.}

There are various equivalent ways to define the determinant of a square matrixA, i.e. one with the same number of rows and columns: the determinant can be defined via theLeibniz formula, an explicit formula involving sums of products of certain entries of the matrix. The determinant can also be characterized as the unique function depending on the entries of the matrix satisfying certain properties. This approach can also be used to compute determinants by simplifying the matrices in question.

Leibniz formula

[edit]
Main article:Leibniz formula for determinants

3 × 3 matrices

[edit]

TheLeibniz formula for the determinant of a3 × 3 matrix is the following:

|abcdefghi|=aei+bfg+cdhcegbdiafh. {\displaystyle {\begin{vmatrix}a&b&c\\d&e&f\\g&h&i\end{vmatrix}}=aei+bfg+cdh-ceg-bdi-afh.\ }

In this expression, each term has one factor from each row, all in different columns, arranged in increasing row order. For example,bdi hasb from the first row second column,d from the second row first column, andi from the third row third column. The signs are determined by how many transpositions of factors are necessary to arrange the factors in increasing order of their columns (given that the terms are arranged left-to-right in increasing row order): positive for an even number of transpositions and negative for an odd number. For the example ofbdi, the single transposition ofbd todb givesdbi, whose three factors are from the first, second and third columns respectively; this is an odd number of transpositions, so the term appears with negative sign.

Rule of Sarrus

Therule of Sarrus is a mnemonic for the expanded form of this determinant: the sum of the products of three diagonal north-west to south-east lines of matrix elements, minus the sum of the products of three diagonal south-west to north-east lines of elements, when the copies of the first two columns of the matrix are written beside it as in the illustration. This scheme for calculating the determinant of a3 × 3 matrix does not carry over into higher dimensions.

n ×n matrices

[edit]

Generalizing the above to higher dimensions, the determinant of ann×n{\displaystyle n\times n} matrix is an expression involvingpermutations and theirsignatures. A permutation of the set{1,2,,n}{\displaystyle \{1,2,\dots ,n\}} is abijective functionσ{\displaystyle \sigma } from this set to itself, with valuesσ(1),σ(2),,σ(n){\displaystyle \sigma (1),\sigma (2),\ldots ,\sigma (n)} exhausting the entire set. The set of all such permutations, called thesymmetric group, is commonly denotedSn{\displaystyle S_{n}}. The signaturesgn(σ){\displaystyle \operatorname {sgn}(\sigma )} of a permutationσ{\displaystyle \sigma } is+1,{\displaystyle +1,} if the permutation can be obtained with an even number of transpositions (exchanges of two entries); otherwise, it is1.{\displaystyle -1.}

Given a matrix

A=[a1,1a1,nan,1an,n],{\displaystyle A={\begin{bmatrix}a_{1,1}\ldots a_{1,n}\\\vdots \qquad \vdots \\a_{n,1}\ldots a_{n,n}\end{bmatrix}},}

the Leibniz formula for its determinant is, usingsigma notation for the sum,

det(A)=|a1,1a1,nan,1an,n|=σSnsgn(σ)a1,σ(1)an,σ(n).{\displaystyle \det(A)={\begin{vmatrix}a_{1,1}\ldots a_{1,n}\\\vdots \qquad \vdots \\a_{n,1}\ldots a_{n,n}\end{vmatrix}}=\sum _{\sigma \in S_{n}}\operatorname {sgn}(\sigma )a_{1,\sigma (1)}\cdots a_{n,\sigma (n)}.}

Usingpi notation for the product, this can be shortened into

det(A)=σSn(sgn(σ)i=1nai,σ(i)){\displaystyle \det(A)=\sum _{\sigma \in S_{n}}\left(\operatorname {sgn}(\sigma )\prod _{i=1}^{n}a_{i,\sigma (i)}\right)}.

TheLevi-Civita symbolεi1,,in{\displaystyle \varepsilon _{i_{1},\ldots ,i_{n}}} is defined on then-tuples of integers in{1,,n}{\displaystyle \{1,\ldots ,n\}} as0 if two of the integers are equal, and otherwise as the signature of the permutation defined by then-tuple of integers. With the Levi-Civita symbol, the Leibniz formula becomes

det(A)=i1,i2,,inεi1ina1,i1an,in,{\displaystyle \det(A)=\sum _{i_{1},i_{2},\ldots ,i_{n}}\varepsilon _{i_{1}\cdots i_{n}}a_{1,i_{1}}\!\cdots a_{n,i_{n}},}

where the sum is taken over alln-tuples of integers in{1,,n}.{\displaystyle \{1,\ldots ,n\}.}[3][4]

Properties

[edit]

Characterization of the determinant

[edit]

The determinant can be characterized by the following three key properties. To state these, it is convenient to regard ann×n{\displaystyle n\times n} matrixA as being composed of itsn{\displaystyle n} columns, so denoted as

A=(a1,,an),{\displaystyle A={\big (}a_{1},\dots ,a_{n}{\big )},}

where thecolumn vectorai{\displaystyle a_{i}} (for eachi) is composed of the entries of the matrix in thei-th column.

  1. det(I)=1{\displaystyle \det \left(I\right)=1}, whereI{\displaystyle I} is anidentity matrix.
  2. The determinant ismultilinear: if thejth column of a matrixA{\displaystyle A} is written as alinear combinationaj=rv+w{\displaystyle a_{j}=r\cdot v+w} of twocolumn vectorsv andw and a numberr, then the determinant ofA is expressible as a similar linear combination:
    |A|=|a1,,aj1,rv+w,aj+1,,an|=r|a1,,v,an|+|a1,,w,,an|{\displaystyle {\begin{aligned}|A|&={\big |}a_{1},\dots ,a_{j-1},r\cdot v+w,a_{j+1},\dots ,a_{n}|\\&=r\cdot |a_{1},\dots ,v,\dots a_{n}|+|a_{1},\dots ,w,\dots ,a_{n}|\end{aligned}}}
  3. The determinant isalternating: whenever two columns of a matrix are identical, its determinant is 0:
    |a1,,v,,v,,an|=0.{\displaystyle |a_{1},\dots ,v,\dots ,v,\dots ,a_{n}|=0.}

If the determinant is defined using the Leibniz formula as above, these three properties can be proved by direct inspection of that formula. Some authors also approach the determinant directly using these three properties: it can be shown that there is exactly one function that assigns to anyn×n{\displaystyle n\times n} matrixA a number that satisfies these three properties.[5] This also shows that this more abstract approach to the determinant yields the same definition as the one using the Leibniz formula.

To see this it suffices to expand the determinant by multi-linearity in the columns into a (huge) linear combination of determinants of matrices in which each column is astandard basis vector. These determinants are either 0 (by property 9) or else ±1 (by properties 1 and 12 below), so the linear combination gives the expression above in terms of the Levi-Civita symbol. While less technical in appearance, this characterization cannot entirely replace the Leibniz formula in defining the determinant, since without it the existence of an appropriate function is not clear.[citation needed]

Immediate consequences

[edit]

These rules have several further consequences:

Example

[edit]

These characterizing properties and their consequences listed above are both theoretically significant, but can also be used to compute determinants for concrete matrices. In fact,Gaussian elimination can be applied to bring any matrix into upper triangular form, and the steps in this algorithm affect the determinant in a controlled way. The following concrete example illustrates the computation of the determinant of the matrixA{\displaystyle A} using that method:

A=[212214331].{\displaystyle A={\begin{bmatrix}-2&-1&2\\2&1&4\\-3&3&-1\end{bmatrix}}.}
Computation of the determinant of matrixA{\displaystyle A}
MatrixB=[312314031]{\displaystyle B={\begin{bmatrix}-3&-1&2\\3&1&4\\0&3&-1\end{bmatrix}}}

C=[3523134001]{\displaystyle C={\begin{bmatrix}-3&5&2\\3&13&4\\0&0&-1\end{bmatrix}}}

D=[5321334001]{\displaystyle D={\begin{bmatrix}5&-3&2\\13&3&4\\0&0&-1\end{bmatrix}}}

E=[1832034001]{\displaystyle E={\begin{bmatrix}18&-3&2\\0&3&4\\0&0&-1\end{bmatrix}}}

Obtained by

add the second column to the first

add 3 times the third column to the second

swap the first two columns

add133{\displaystyle -{\frac {13}{3}}} times the second column to the first

Determinant|A|=|B|{\displaystyle |A|=|B|}

|B|=|C|{\displaystyle |B|=|C|}

|D|=|C|{\displaystyle |D|=-|C|}

|E|=|D|{\displaystyle |E|=|D|}

Combining these equalities gives|A|=|E|=(183(1))=54.{\displaystyle |A|=-|E|=-(18\cdot 3\cdot (-1))=54.}

Transpose

[edit]

The determinant of thetranspose ofA{\displaystyle A} equals the determinant ofA:

det(AT)=det(A){\displaystyle \det \left(A^{\textsf {T}}\right)=\det(A)}.

This can be proven by inspecting the Leibniz formula.[6] This implies that in all the properties mentioned above, the word "column" can be replaced by "row" throughout. For example, viewing ann ×n matrix as being composed ofn rows, the determinant is ann-linear function.

Multiplicativity and matrix groups

[edit]

The determinant is amultiplicative map, i.e., for square matricesA{\displaystyle A} andB{\displaystyle B} of equal size, the determinant of amatrix product equals the product of their determinants:

det(AB)=det(A)det(B){\displaystyle \det(AB)=\det(A)\det(B)}

This key fact can be proven by observing that, for a fixed matrixB{\displaystyle B}, both sides of the equation are alternating and multilinear as a function depending on the columns ofA{\displaystyle A}. Moreover, they both take the valuedetB{\displaystyle \det B} whenA{\displaystyle A} is the identity matrix. The above-mentioned unique characterization of alternating multilinear maps therefore shows this claim.[7]

A matrixA{\displaystyle A} with entries in afield isinvertible precisely if its determinant is nonzero. This follows from the multiplicativity of the determinant and the formula for the inverse involving the adjugate matrix mentioned below. In this event, the determinant of the inverse matrix is given by

det(A1)=1det(A)=[det(A)]1{\displaystyle \det \left(A^{-1}\right)={\frac {1}{\det(A)}}=[\det(A)]^{-1}}.

In particular, products and inverses of matrices with non-zero determinant (respectively, determinant one) still have this property. Thus, the set of such matrices (of fixed sizen{\displaystyle n} over a fieldK{\displaystyle K}) forms a group known as thegeneral linear groupGLn(K){\displaystyle \operatorname {GL} _{n}(K)} (respectively, asubgroup called thespecial linear groupSLn(K)GLn(K){\displaystyle \operatorname {SL} _{n}(K)\subset \operatorname {GL} _{n}(K)}. More generally, the word "special" indicates the subgroup of anothermatrix group of matrices of determinant one. Examples include thespecial orthogonal group (which ifn is 2 or 3 consists of allrotation matrices), and thespecial unitary group.

Because the determinant respects multiplication and inverses, it is in fact agroup homomorphism fromGLn(K){\displaystyle \operatorname {GL} _{n}(K)} into the multiplicative groupK×{\displaystyle K^{\times }} of nonzero elements ofK{\displaystyle K}. This homomorphism is surjective and its kernel isSLn(K){\displaystyle \operatorname {SL} _{n}(K)} (the matrices with determinant one). Hence, by thefirst isomorphism theorem, this shows thatSLn(K){\displaystyle \operatorname {SL} _{n}(K)} is anormal subgroup ofGLn(K){\displaystyle \operatorname {GL} _{n}(K)}, and that thequotient groupGLn(K)/SLn(K){\displaystyle \operatorname {GL} _{n}(K)/\operatorname {SL} _{n}(K)} is isomorphic toK×{\displaystyle K^{\times }}.

TheCauchy–Binet formula is a generalization of that product formula forrectangular matrices. This formula can also be recast as a multiplicative formula forcompound matrices whose entries are the determinants of all quadratic submatrices of a given matrix.[8][9]

Laplace expansion

[edit]

Laplace expansion expresses the determinant of a matrixA{\displaystyle A}recursively in terms of determinants of smaller matrices, known as itsminors. The minorMi,j{\displaystyle M_{i,j}} is defined to be the determinant of the(n1)×(n1){\displaystyle (n-1)\times (n-1)} matrix that results fromA{\displaystyle A} by removing thei{\displaystyle i}-th row and thej{\displaystyle j}-th column. The expression(1)i+jMi,j{\displaystyle (-1)^{i+j}M_{i,j}} is known as acofactor. For everyi{\displaystyle i}, one has the equality

det(A)=j=1n(1)i+jai,jMi,j,{\displaystyle \det(A)=\sum _{j=1}^{n}(-1)^{i+j}a_{i,j}M_{i,j},}

which is called theLaplace expansion along theith row. For example, the Laplace expansion along the first row (i=1{\displaystyle i=1}) gives the following formula:

|abcdefghi|=a|efhi|b|dfgi|+c|degh|{\displaystyle {\begin{vmatrix}a&b&c\\d&e&f\\g&h&i\end{vmatrix}}=a{\begin{vmatrix}e&f\\h&i\end{vmatrix}}-b{\begin{vmatrix}d&f\\g&i\end{vmatrix}}+c{\begin{vmatrix}d&e\\g&h\end{vmatrix}}}

Unwinding the determinants of these2×2{\displaystyle 2\times 2}-matrices gives back the Leibniz formula mentioned above. Similarly, theLaplace expansion along thej{\displaystyle j}-th column is the equality

det(A)=i=1n(1)i+jai,jMi,j.{\displaystyle \det(A)=\sum _{i=1}^{n}(-1)^{i+j}a_{i,j}M_{i,j}.}

Laplace expansion can be used iteratively for computing determinants, but this approach is inefficient for large matrices. However, it is useful for computing the determinants of highly symmetric matrix such as theVandermonde matrix|1111x1x2x3xnx12x22x32xn2x1n1x2n1x3n1xnn1|=1i<jn(xjxi).{\displaystyle {\begin{vmatrix}1&1&1&\cdots &1\\x_{1}&x_{2}&x_{3}&\cdots &x_{n}\\x_{1}^{2}&x_{2}^{2}&x_{3}^{2}&\cdots &x_{n}^{2}\\\vdots &\vdots &\vdots &\ddots &\vdots \\x_{1}^{n-1}&x_{2}^{n-1}&x_{3}^{n-1}&\cdots &x_{n}^{n-1}\end{vmatrix}}=\prod _{1\leq i<j\leq n}\left(x_{j}-x_{i}\right).}Then-term Laplace expansion along a row or column can begeneralized to write ann xn determinant as a sum of(nk){\displaystyle {\tbinom {n}{k}}}terms, each the product of the determinant of ak xksubmatrix and the determinant of the complementary (n−k) x (n−k) submatrix.

Adjugate matrix

[edit]

Theadjugate matrixadj(A){\displaystyle \operatorname {adj} (A)} is the transpose of the matrix of the cofactors, that is,

(adj(A))i,j=(1)i+jMji.{\displaystyle (\operatorname {adj} (A))_{i,j}=(-1)^{i+j}M_{ji}.}

For every matrix, one has[10]

(detA)I=AadjA=(adjA)A.{\displaystyle (\det A)I=A\operatorname {adj} A=(\operatorname {adj} A)\,A.}

Thus the adjugate matrix can be used for expressing the inverse of anonsingular matrix:

A1=1detAadjA.{\displaystyle A^{-1}={\frac {1}{\det A}}\operatorname {adj} A.}

Block matrices

[edit]

The formula for the determinant of a2×2{\displaystyle 2\times 2} matrix above continues to hold, under appropriate further assumptions, for ablock matrix, i.e., a matrix composed of four submatricesA,B,C,D{\displaystyle A,B,C,D} of dimensionm×m{\displaystyle m\times m},m×n{\displaystyle m\times n},n×m{\displaystyle n\times m} andn×n{\displaystyle n\times n}, respectively. The easiest such formula, which can be proven using either the Leibniz formula or a factorization involving theSchur complement, is

det(A0CD)=det(A)det(D)=det(AB0D).{\displaystyle \det {\begin{pmatrix}A&0\\C&D\end{pmatrix}}=\det(A)\det(D)=\det {\begin{pmatrix}A&B\\0&D\end{pmatrix}}.}

IfA{\displaystyle A} isinvertible, then it follows with results from the section on multiplicativity that

det(ABCD)=det(A)det(ABCD)det(A1A1B0In)=det(A1)=(detA)1=det(A)det(Im0CA1DCA1B)=det(A)det(DCA1B),{\displaystyle {\begin{aligned}\det {\begin{pmatrix}A&B\\C&D\end{pmatrix}}&=\det(A)\det {\begin{pmatrix}A&B\\C&D\end{pmatrix}}\underbrace {\det {\begin{pmatrix}A^{-1}&-A^{-1}B\\0&I_{n}\end{pmatrix}}} _{=\,\det(A^{-1})\,=\,(\det A)^{-1}}\\&=\det(A)\det {\begin{pmatrix}I_{m}&0\\CA^{-1}&D-CA^{-1}B\end{pmatrix}}\\&=\det(A)\det(D-CA^{-1}B),\end{aligned}}}

which simplifies todet(A)(DCA1B){\displaystyle \det(A)(D-CA^{-1}B)} whenD{\displaystyle D} is a1×1{\displaystyle 1\times 1} matrix.

A similar result holds whenD{\displaystyle D} is invertible, namely

det(ABCD)=det(D)det(ABCD)det(Im0D1CD1)=det(D1)=(detD)1=det(D)det(ABD1CBD10In)=det(D)det(ABD1C).{\displaystyle {\begin{aligned}\det {\begin{pmatrix}A&B\\C&D\end{pmatrix}}&=\det(D)\det {\begin{pmatrix}A&B\\C&D\end{pmatrix}}\underbrace {\det {\begin{pmatrix}I_{m}&0\\-D^{-1}C&D^{-1}\end{pmatrix}}} _{=\,\det(D^{-1})\,=\,(\det D)^{-1}}\\&=\det(D)\det {\begin{pmatrix}A-BD^{-1}C&BD^{-1}\\0&I_{n}\end{pmatrix}}\\&=\det(D)\det(A-BD^{-1}C).\end{aligned}}}

Both results can be combined to deriveSylvester's determinant theorem, which is also stated below.

If the blocks are square matrices of thesame size further formulas hold. For example, ifC{\displaystyle C} andD{\displaystyle D}commute (i.e.,CD=DC{\displaystyle CD=DC}), then[11]

det(ABCD)=det(ADBC).{\displaystyle \det {\begin{pmatrix}A&B\\C&D\end{pmatrix}}=\det(AD-BC).}

This formula has been generalized to matrices composed of more than2×2{\displaystyle 2\times 2} blocks, again under appropriate commutativity conditions among the individual blocks.[12]

ForA=D{\displaystyle A=D} andB=C{\displaystyle B=C}, the following formula holds (even ifA{\displaystyle A} andB{\displaystyle B} do not commute).

det(ABBA)=det(A+BBB+AA)=det(A+BB0AB)=det(A+B)det(AB).{\displaystyle \det {\begin{pmatrix}A&B\\B&A\end{pmatrix}}=\det {\begin{pmatrix}A+B&B\\B+A&A\end{pmatrix}}=\det {\begin{pmatrix}A+B&B\\0&A-B\end{pmatrix}}=\det(A+B)\det(A-B).}

Sylvester's determinant theorem

[edit]

Sylvester's determinant theorem states that forA, anm ×n matrix, andB, ann ×m matrix (so thatA andB have dimensions allowing them to be multiplied in either order forming a square matrix):

det(Im+AB)=det(In+BA),{\displaystyle \det \left(I_{\mathit {m}}+AB\right)=\det \left(I_{\mathit {n}}+BA\right),}

whereIm andIn are them ×m andn ×n identity matrices, respectively.

From this general result several consequences follow.

  1. For the case of column vectorc and row vectorr, each withm components, the formula allows quick calculation of the determinant of a matrix that differs from the identity matrix by a matrix of rank 1:
    det(Im+cr)=1+rc.{\displaystyle \det \left(I_{\mathit {m}}+cr\right)=1+rc.}
  2. More generally,[13] for any invertiblem ×m matrixX,
    det(X+AB)=det(X)det(In+BX1A),{\displaystyle \det(X+AB)=\det(X)\det \left(I_{\mathit {n}}+BX^{-1}A\right),}
  3. For a column and row vector as above:
    det(X+cr)=det(X)det(1+rX1c)=det(X)+radj(X)c.{\displaystyle \det(X+cr)=\det(X)\det \left(1+rX^{-1}c\right)=\det(X)+r\,\operatorname {adj} (X)\,c.}
  4. For square matricesA{\displaystyle A} andB{\displaystyle B} of the same size, the matricesAB{\displaystyle AB} andBA{\displaystyle BA} have the same characteristic polynomials (hence the same eigenvalues).

A generalization isdet(Z+AWB)=det(Z)det(W)det(W1+BZ1A){\displaystyle \det \left(Z+AWB\right)=\det \left(Z\right)\det \left(W\right)\det \left(W^{-1}+BZ^{-1}A\right)}(seeMatrix determinant lemma), whereZ is anm ×m invertible matrix andW is ann ×n invertible matrix.

Sum

[edit]

The determinant of the sumA+B{\displaystyle A+B} of two square matrices of the same size is not in general expressible in terms of the determinants ofA and ofB.

However, forpositive semidefinite matricesA{\displaystyle A},B{\displaystyle B} andC{\displaystyle C} of equal size,det(A+B+C)+det(C)det(A+C)+det(B+C),{\displaystyle \det(A+B+C)+\det(C)\geq \det(A+C)+\det(B+C){\text{,}}}with the corollary[14][15]det(A+B)det(A)+det(B).{\displaystyle \det(A+B)\geq \det(A)+\det(B){\text{.}}}

Brunn–Minkowski theorem implies that thenth root of determinant is aconcave function, when restricted toHermitian positive-definiten×n{\displaystyle n\times n} matrices.[16] Therefore, ifA andB are Hermitian positive-definiten×n{\displaystyle n\times n} matrices, one hasdet(A+B)ndet(A)n+det(B)n,{\displaystyle {\sqrt[{n}]{\det(A+B)}}\geq {\sqrt[{n}]{\det(A)}}+{\sqrt[{n}]{\det(B)}},} since thenth root of the determinant is ahomogeneous function.

Sum identity for 2×2 matrices

[edit]

For the special case of2×2{\displaystyle 2\times 2} matrices with complex entries, the determinant of the sum can be written in terms of determinants and traces in the following identity:

det(A+B)=det(A)+det(B)+tr(A)tr(B)tr(AB).{\displaystyle \det(A+B)=\det(A)+\det(B)+{\text{tr}}(A){\text{tr}}(B)-{\text{tr}}(AB).}

Properties of the determinant in relation to other notions

[edit]

Eigenvalues and characteristic polynomial

[edit]

The determinant is closely related to two other central concepts in linear algebra, theeigenvalues and thecharacteristic polynomial of a matrix. LetA{\displaystyle A} be ann×n{\displaystyle n\times n} matrix withcomplex entries. Then, by the Fundamental Theorem of Algebra,A{\displaystyle A} must have exactlyneigenvaluesλ1,λ2,,λn{\displaystyle \lambda _{1},\lambda _{2},\ldots ,\lambda _{n}}. (Here it is understood that an eigenvalue withalgebraic multiplicityμ occursμ times in this list.) Then, it turns out the determinant ofA is equal to theproduct of these eigenvalues,

det(A)=i=1nλi=λ1λ2λn.{\displaystyle \det(A)=\prod _{i=1}^{n}\lambda _{i}=\lambda _{1}\lambda _{2}\cdots \lambda _{n}.}

The product of all non-zero eigenvalues is referred to aspseudo-determinant.

From this, one immediately sees that the determinant of a matrixA{\displaystyle A} is zero if and only if0{\displaystyle 0} is an eigenvalue ofA{\displaystyle A}. In other words,A{\displaystyle A} is invertible if and only if0{\displaystyle 0} is not an eigenvalue ofA{\displaystyle A}.

The characteristic polynomial is defined as[17]

χA(t)=det(tIA).{\displaystyle \chi _{A}(t)=\det(t\cdot I-A).}

Here,t{\displaystyle t} is theindeterminate of the polynomial andI{\displaystyle I} is the identity matrix of the same size asA{\displaystyle A}. By means of this polynomial, determinants can be used to find theeigenvalues of the matrixA{\displaystyle A}: they are precisely theroots of this polynomial, i.e., those complex numbersλ{\displaystyle \lambda } such that

χA(λ)=0.{\displaystyle \chi _{A}(\lambda )=0.}

AHermitian matrix ispositive definite if all its eigenvalues are positive.Sylvester's criterion asserts that this is equivalent to the determinants of the submatrices

Ak:=[a1,1a1,2a1,ka2,1a2,2a2,kak,1ak,2ak,k]{\displaystyle A_{k}:={\begin{bmatrix}a_{1,1}&a_{1,2}&\cdots &a_{1,k}\\a_{2,1}&a_{2,2}&\cdots &a_{2,k}\\\vdots &\vdots &\ddots &\vdots \\a_{k,1}&a_{k,2}&\cdots &a_{k,k}\end{bmatrix}}}

being positive, for allk{\displaystyle k} between1{\displaystyle 1} andn{\displaystyle n}.[18]

Trace

[edit]

Thetrace tr(A) is by definition the sum of the diagonal entries ofA and also equals the sum of the eigenvalues. Thus, for complex matricesA,

det(exp(A))=exp(tr(A)){\displaystyle \det(\exp(A))=\exp(\operatorname {tr} (A))}

or, for real matricesA,

tr(A)=log(det(exp(A))).{\displaystyle \operatorname {tr} (A)=\log(\det(\exp(A))).}

Here exp(A) denotes thematrix exponential ofA, because every eigenvalueλ ofA corresponds to the eigenvalue exp(λ) of exp(A). In particular, given anylogarithm ofA, that is, any matrixL satisfying

exp(L)=A{\displaystyle \exp(L)=A}

the determinant ofA is given by

det(A)=exp(tr(L)).{\displaystyle \det(A)=\exp(\operatorname {tr} (L)).}

For example, forn = 2,n = 3, andn = 4, respectively,

det(A)=12((tr(A))2tr(A2)),det(A)=16((tr(A))33tr(A) tr(A2)+2tr(A3)),det(A)=124((tr(A))46tr(A2)(tr(A))2+3(tr(A2))2+8tr(A3) tr(A)6tr(A4)).{\displaystyle {\begin{aligned}\det(A)&={\frac {1}{2}}\left(\left(\operatorname {tr} (A)\right)^{2}-\operatorname {tr} \left(A^{2}\right)\right),\\\det(A)&={\frac {1}{6}}\left(\left(\operatorname {tr} (A)\right)^{3}-3\operatorname {tr} (A)~\operatorname {tr} \left(A^{2}\right)+2\operatorname {tr} \left(A^{3}\right)\right),\\\det(A)&={\frac {1}{24}}\left(\left(\operatorname {tr} (A)\right)^{4}-6\operatorname {tr} \left(A^{2}\right)\left(\operatorname {tr} (A)\right)^{2}+3\left(\operatorname {tr} \left(A^{2}\right)\right)^{2}+8\operatorname {tr} \left(A^{3}\right)~\operatorname {tr} (A)-6\operatorname {tr} \left(A^{4}\right)\right).\end{aligned}}}

cf.Cayley-Hamilton theorem. Such expressions are deducible from combinatorial arguments,Newton's identities, or theFaddeev–LeVerrier algorithm. That is, for genericn,detA = (−1)nc0 the signed constant term of thecharacteristic polynomial, determined recursively from

cn=1;   cnm=1mk=1mcnm+ktr(Ak)  (1mn) .{\displaystyle c_{n}=1;~~~c_{n-m}=-{\frac {1}{m}}\sum _{k=1}^{m}c_{n-m+k}\operatorname {tr} \left(A^{k}\right)~~(1\leq m\leq n)~.}

In the general case, this may also be obtained from[19]

det(A)=k1,k2,,kn0k1+2k2++nkn=nl=1n(1)kl+1lklkl!tr(Al)kl,{\displaystyle \det(A)=\sum _{\begin{array}{c}k_{1},k_{2},\ldots ,k_{n}\geq 0\\k_{1}+2k_{2}+\cdots +nk_{n}=n\end{array}}\prod _{l=1}^{n}{\frac {(-1)^{k_{l}+1}}{l^{k_{l}}k_{l}!}}\operatorname {tr} \left(A^{l}\right)^{k_{l}},}

where the sum is taken over the set of all integerskl ≥ 0 satisfying the equation

l=1nlkl=n.{\displaystyle \sum _{l=1}^{n}lk_{l}=n.}

The formula can be expressed in terms of the complete exponentialBell polynomial ofn argumentssl = −(l – 1)! tr(Al) as

det(A)=(1)nn!Bn(s1,s2,,sn).{\displaystyle \det(A)={\frac {(-1)^{n}}{n!}}B_{n}(s_{1},s_{2},\ldots ,s_{n}).}

This formula can also be used to find the determinant of a matrixAIJ with multidimensional indicesI = (i1,i2, ...,ir) andJ = (j1,j2, ...,jr). The product and trace of such matrices are defined in a natural way as

(AB)JI=KAKIBJK,tr(A)=IAII.{\displaystyle (AB)_{J}^{I}=\sum _{K}A_{K}^{I}B_{J}^{K},\operatorname {tr} (A)=\sum _{I}A_{I}^{I}.}

An important arbitrary dimensionn identity can be obtained from theMercator series expansion of the logarithm when the expansion converges. If every eigenvalue ofA is less than 1 in absolute value,

det(I+A)=k=01k!(j=1(1)jjtr(Aj))k,{\displaystyle \det(I+A)=\sum _{k=0}^{\infty }{\frac {1}{k!}}\left(-\sum _{j=1}^{\infty }{\frac {(-1)^{j}}{j}}\operatorname {tr} \left(A^{j}\right)\right)^{k}\,,}

whereI is the identity matrix. More generally, if

k=01k!(j=1(1)jsjjtr(Aj))k,{\displaystyle \sum _{k=0}^{\infty }{\frac {1}{k!}}\left(-\sum _{j=1}^{\infty }{\frac {(-1)^{j}s^{j}}{j}}\operatorname {tr} \left(A^{j}\right)\right)^{k}\,,}

is expanded as a formalpower series ins then all coefficients ofsm form >n are zero and the remaining polynomial isdet(I +sA).

Upper and lower bounds

[edit]

For a positive definite matrixA, the trace operator gives the following tight lower and upper bounds on the log determinant

tr(IA1)logdet(A)tr(AI){\displaystyle \operatorname {tr} \left(I-A^{-1}\right)\leq \log \det(A)\leq \operatorname {tr} (A-I)}

with equality if and only ifA =I. This relationship can be derived via the formula for theKullback-Leibler divergence between twomultivariate normal distributions.

Also,

ntr(A1)det(A)1n1ntr(A)1ntr(A2).{\displaystyle {\frac {n}{\operatorname {tr} \left(A^{-1}\right)}}\leq \det(A)^{\frac {1}{n}}\leq {\frac {1}{n}}\operatorname {tr} (A)\leq {\sqrt {{\frac {1}{n}}\operatorname {tr} \left(A^{2}\right)}}.}

These inequalities can be proved by expressing the traces and the determinant in terms of the eigenvalues. As such, they represent the well-known fact that theharmonic mean is less than thegeometric mean, which is less than thearithmetic mean, which is, in turn, less than theroot mean square.

Derivative

[edit]

The Leibniz formula shows that the determinant of real (or analogously for complex) square matrices is apolynomial function fromRn×n{\displaystyle \mathbf {R} ^{n\times n}} toR{\displaystyle \mathbf {R} }. In particular, it is everywheredifferentiable. Its derivative can be expressed usingJacobi's formula:[20]

ddet(A)dα=tr(adj(A)dAdα).{\displaystyle {\frac {d\det(A)}{d\alpha }}=\operatorname {tr} \left(\operatorname {adj} (A){\frac {dA}{d\alpha }}\right).}

whereadj(A){\displaystyle \operatorname {adj} (A)} denotes theadjugate ofA{\displaystyle A}. In particular, ifA{\displaystyle A} is invertible, we have

ddet(A)dα=det(A)tr(A1dAdα).{\displaystyle {\frac {d\det(A)}{d\alpha }}=\det(A)\operatorname {tr} \left(A^{-1}{\frac {dA}{d\alpha }}\right).}

Expressed in terms of the entries ofA{\displaystyle A}, these are

det(A)Aij=adj(A)ji=det(A)(A1)ji.{\displaystyle {\frac {\partial \det(A)}{\partial A_{ij}}}=\operatorname {adj} (A)_{ji}=\det(A)\left(A^{-1}\right)_{ji}.}

Yet another equivalent formulation is

det(A+ϵX)det(A)=tr(adj(A)X)ϵ+O(ϵ2)=det(A)tr(A1X)ϵ+O(ϵ2){\displaystyle \det(A+\epsilon X)-\det(A)=\operatorname {tr} (\operatorname {adj} (A)X)\epsilon +O\left(\epsilon ^{2}\right)=\det(A)\operatorname {tr} \left(A^{-1}X\right)\epsilon +O\left(\epsilon ^{2}\right)},

usingbig O notation. The special case whereA=I{\displaystyle A=I}, the identity matrix, yields

det(I+ϵX)=1+tr(X)ϵ+O(ϵ2).{\displaystyle \det(I+\epsilon X)=1+\operatorname {tr} (X)\epsilon +O\left(\epsilon ^{2}\right).}

This identity is used in describingLie algebras associated to certain matrixLie groups. For example, the special linear groupSLn{\displaystyle \operatorname {SL} _{n}} is defined by the equationdetA=1{\displaystyle \det A=1}. The above formula shows that its Lie algebra is thespecial linear Lie algebrasln{\displaystyle {\mathfrak {sl}}_{n}} consisting of those matrices having trace zero.

Writing a3×3{\displaystyle 3\times 3} matrix asA=[abc]{\displaystyle A={\begin{bmatrix}a&b&c\end{bmatrix}}} wherea,b,c{\displaystyle a,b,c} are column vectors of length 3, then the gradient over one of the three vectors may be written as thecross product of the other two:

adet(A)=b×cbdet(A)=c×acdet(A)=a×b.{\displaystyle {\begin{aligned}\nabla _{\mathbf {a} }\det(A)&=\mathbf {b} \times \mathbf {c} \\\nabla _{\mathbf {b} }\det(A)&=\mathbf {c} \times \mathbf {a} \\\nabla _{\mathbf {c} }\det(A)&=\mathbf {a} \times \mathbf {b} .\end{aligned}}}

History

[edit]

Historically, determinants were used long before matrices: A determinant was originally defined as a property of asystem of linear equations.The determinant "determines" whether the system has a unique solution (which occurs precisely if the determinant is non-zero).In this sense, determinants were first used in the Chinese mathematics textbookThe Nine Chapters on the Mathematical Art (九章算術, Chinese scholars, around the 3rd century BCE). In Europe, solutions of linear systems of two equations were expressed byCardano in 1545 by a determinant-like entity.[21]

Determinants proper originated separately from the work ofSeki Takakazu in 1683 in Japan and parallelly ofLeibniz in 1693.[22][23][24][25]Cramer (1750) stated, without proof, Cramer's rule.[26] Both Cramer and alsoBézout (1779) were led to determinants by the question ofplane curves passing through a given set of points.[27]

Vandermonde (1771) first recognized determinants as independent functions.[23]Laplace (1772) gave the general method of expanding a determinant in terms of its complementaryminors: Vandermonde had already given a special case.[28] Immediately following,Lagrange (1773) treated determinants of the second and third order and applied it to questions ofelimination theory; he proved many special cases of general identities.

Gauss (1801) made the next advance. Like Lagrange, he made much use of determinants in thetheory of numbers. He introduced the word "determinant" (Laplace had used "resultant"), though not in the present signification, but rather as applied to thediscriminant of aquadratic form.[29] Gauss also arrived at the notion of reciprocal (inverse) determinants, and came very near the multiplication theorem.[clarification needed]

The next contributor of importance isBinet (1811, 1812), who formally stated the theorem relating to the product of two matrices ofm columns andn rows, which for the special case ofm =n reduces to the multiplication theorem. On the same day (November 30, 1812) that Binet presented his paper to the Academy,Cauchy also presented one on the subject. (SeeCauchy–Binet formula.) In this he used the word "determinant" in its present sense,[30][31] summarized and simplified what was then known on the subject, improved the notation, and gave the multiplication theorem with a proof more satisfactory than Binet's.[23][32] With him begins the theory in its generality.

Jacobi (1841) used the functional determinant which Sylvester later called theJacobian.[33] In his memoirs inCrelle's Journal for 1841 he specially treats this subject, as well as the class of alternating functions which Sylvester has calledalternants. About the time of Jacobi's last memoirs,Sylvester (1839) andCayley began their work.Cayley 1841 introduced the modern notation for the determinant using vertical bars.[34][35]

The study of special forms of determinants has been the natural result of the completion of the general theory. Axisymmetric determinants have been studied byLebesgue,Hesse, and Sylvester;persymmetric determinants by Sylvester andHankel;circulants byCatalan,Spottiswoode,Glaisher, and Scott; skew determinants andPfaffians, in connection with the theory oforthogonal transformation, by Cayley; continuants by Sylvester;Wronskians (so called byMuir) byChristoffel andFrobenius; compound determinants by Sylvester, Reiss, and Picquet; Jacobians andHessians by Sylvester; and symmetric gauche determinants byTrudi. Of the textbooks on the subject Spottiswoode's was the first. In America, Hanus (1886), Weld (1893), and Muir/Metzler (1933) published treatises.

Applications

[edit]

Cramer's rule

[edit]

Determinants can be used to describe the solutions of alinear system of equations, written in matrix form asAx=b{\displaystyle Ax=b}. This equation has a unique solutionx{\displaystyle x} if and only ifdet(A){\displaystyle \det(A)} is nonzero. In this case, the solution is given byCramer's rule:

xi=det(Ai)det(A)i=1,2,3,,n{\displaystyle x_{i}={\frac {\det(A_{i})}{\det(A)}}\qquad i=1,2,3,\ldots ,n}

whereAi{\displaystyle A_{i}} is the matrix formed by replacing thei{\displaystyle i}-th column ofA{\displaystyle A} by the column vectorb{\displaystyle b}. This follows immediately by column expansion of the determinant, i.e.

det(Ai)=det[a1ban]{\displaystyle \det(A_{i})=\det {\begin{bmatrix}a_{1}&\ldots &b&\ldots &a_{n}\end{bmatrix}}}

=j=1nxjdet[a1ai1ajai+1an]=xidet(A){\displaystyle =\sum _{j=1}^{n}x_{j}\det {\begin{bmatrix}a_{1}&\ldots &a_{i-1}&a_{j}&a_{i+1}&\ldots &a_{n}\end{bmatrix}}=x_{i}\det(A)}

where the vectorsaj{\displaystyle a_{j}} are the columns ofA. The rule is also implied by the identity

Aadj(A)=adj(A)A=det(A)In.{\displaystyle A\,\operatorname {adj} (A)=\operatorname {adj} (A)\,A=\det(A)\,I_{n}.}

Cramer's rule can be implemented inO(n3){\displaystyle \operatorname {O} (n^{3})} time, which is comparable to more common methods of solving systems of linear equations, such asLU,QR, orsingular value decomposition.[36]

Linear independence

[edit]

Determinants can be used to characterizelinearly dependent vectors:detA{\displaystyle \det A} is zero if and only if the column vectors of the matrixA{\displaystyle A} are linearly dependent.[37] For example, given two linearly independent vectorsv1,v2R3{\displaystyle v_{1},v_{2}\in \mathbf {R} ^{3}}, a third vectorv3{\displaystyle v_{3}} lies in theplanespanned by the former two vectors exactly if the determinant of the3×3{\displaystyle 3\times 3} matrix consisting of the three vectors is zero. The same idea is also used in the theory ofdifferential equations: given functionsf1(x),,fn(x){\displaystyle f_{1}(x),\dots ,f_{n}(x)} (supposed to ben1{\displaystyle n-1} timesdifferentiable), theWronskian is defined to be

W(f1,,fn)(x)=|f1(x)f2(x)fn(x)f1(x)f2(x)fn(x)f1(n1)(x)f2(n1)(x)fn(n1)(x)|.{\displaystyle W(f_{1},\ldots ,f_{n})(x)={\begin{vmatrix}f_{1}(x)&f_{2}(x)&\cdots &f_{n}(x)\\f_{1}'(x)&f_{2}'(x)&\cdots &f_{n}'(x)\\\vdots &\vdots &\ddots &\vdots \\f_{1}^{(n-1)}(x)&f_{2}^{(n-1)}(x)&\cdots &f_{n}^{(n-1)}(x)\end{vmatrix}}.}

It is non-zero (for somex{\displaystyle x}) in a specified interval if and only if the given functions and all their derivatives up to ordern1{\displaystyle n-1} are linearly independent. If it can be shown that the Wronskian is zero everywhere on an interval then, in the case ofanalytic functions, this implies the given functions are linearly dependent. Seethe Wronskian and linear independence. Another such use of the determinant is theresultant, which gives a criterion when twopolynomials have a commonroot.[38]

Orientation of a basis

[edit]
Main article:Orientation (vector space)

The determinant can be thought of as assigning a number to everysequence ofn vectors inRn, by using the square matrix whose columns are the given vectors. The determinant will be nonzero if and only if the sequence of vectors is abasis forRn. In that case, the sign of the determinant determines whether theorientation of the basis is consistent with or opposite to the orientation of thestandard basis. In the case of an orthogonal basis, the magnitude of the determinant is equal to theproduct of the lengths of the basis vectors. For instance, anorthogonal matrix with entries inRn represents anorthonormal basis inEuclidean space, and hence has determinant of ±1 (since all the vectors have length 1). The determinant is +1 if and only if the basis has the same orientation. It is −1 if and only if the basis has the opposite orientation.

More generally, if the determinant ofA is positive,A represents an orientation-preservinglinear transformation (ifA is an orthogonal2 × 2 or3 × 3 matrix, this is arotation), while if it is negative,A switches the orientation of the basis.

Volume and Jacobian determinant

[edit]

As pointed out above, theabsolute value of the determinant of real vectors is equal to the volume of theparallelepiped spanned by those vectors. As a consequence, iff:RnRn{\displaystyle f:\mathbf {R} ^{n}\to \mathbf {R} ^{n}} is the linear map given by multiplication with a matrixA{\displaystyle A}, andSRn{\displaystyle S\subset \mathbf {R} ^{n}} is anymeasurablesubset, then the volume off(S){\displaystyle f(S)} is given by|det(A)|{\displaystyle |\det(A)|} times the volume ofS{\displaystyle S}.[39] More generally, if the linear mapf:RnRm{\displaystyle f:\mathbf {R} ^{n}\to \mathbf {R} ^{m}} is represented by them×n{\displaystyle m\times n} matrixA{\displaystyle A}, then then{\displaystyle n}-dimensional volume off(S){\displaystyle f(S)} is given by:

volume(f(S))=det(ATA)volume(S).{\displaystyle \operatorname {volume} (f(S))={\sqrt {\det \left(A^{\textsf {T}}A\right)}}\operatorname {volume} (S).}

By calculating the volume of thetetrahedron bounded by four points, they can be used to identifyskew lines. The volume of any tetrahedron, given itsverticesa,b,c,d{\displaystyle a,b,c,d},16|det(ab,bc,cd)|{\displaystyle {\frac {1}{6}}\cdot |\det(a-b,b-c,c-d)|}, or any other combination of pairs of vertices that form aspanning tree over the vertices.

A nonlinear mapf:R2R2{\displaystyle f\colon \mathbf {R} ^{2}\to \mathbf {R} ^{2}} sends a small square (left, in red) to a distorted parallelogram (right, in red). The Jacobian at a point gives the best linear approximation of the distorted parallelogram near that point (right, in translucent white), and the Jacobian determinant gives the ratio of the area of the approximating parallelogram to that of the original square.

For a generaldifferentiable function, much of the above carries over by considering theJacobian matrix off. For

f:RnRn,{\displaystyle f:\mathbf {R} ^{n}\rightarrow \mathbf {R} ^{n},}

the Jacobian matrix is then ×n matrix whose entries are given by thepartial derivatives

D(f)=(fixj)1i,jn.{\displaystyle D(f)=\left({\frac {\partial f_{i}}{\partial x_{j}}}\right)_{1\leq i,j\leq n}.}

Its determinant, theJacobian determinant, appears in the higher-dimensional version ofintegration by substitution: for suitable functionsf and anopen subsetU ofRn (the domain off), the integral overf(U) of some other functionφ :RnRm is given by

f(U)ϕ(v)dv=Uϕ(f(u))|det(Df)(u)|du.{\displaystyle \int _{f(U)}\phi (\mathbf {v} )\,d\mathbf {v} =\int _{U}\phi (f(\mathbf {u} ))\left|\det(\operatorname {D} f)(\mathbf {u} )\right|\,d\mathbf {u} .}

The Jacobian also occurs in theinverse function theorem.

When applied to the field ofCartography, the determinant can be used to measure the rate of expansion of a map near the poles.[40]

Abstract algebraic aspects

[edit]

Determinant of an endomorphism

[edit]

The above identities concerning the determinant of products and inverses of matrices imply thatsimilar matrices have the same determinant: two matricesA andB are similar, if there exists an invertible matrixX such thatA =X−1BX. Indeed, repeatedly applying the above identities yields

det(A)=det(X)1det(B)det(X)=det(B)det(X)1det(X)=det(B).{\displaystyle \det(A)=\det(X)^{-1}\det(B)\det(X)=\det(B)\det(X)^{-1}\det(X)=\det(B).}

The determinant is therefore also called asimilarity invariant. The determinant of alinear transformation

T:VV{\displaystyle T:V\to V}

for some finite-dimensionalvector spaceV is defined to be the determinant of the matrix describing it, with respect to an arbitrary choice ofbasis inV. By the similarity invariance, this determinant is independent of the choice of the basis forV and therefore only depends on the endomorphismT.

Square matrices over commutative rings

[edit]

The above definition of the determinant using the Leibniz rule holds works more generally when the entries of the matrix are elements of acommutative ringR{\displaystyle R}, such as the integersZ{\displaystyle \mathbf {Z} }, as opposed to thefield of real or complex numbers. Moreover, the characterization of the determinant as the unique alternating multilinear map that satisfiesdet(I)=1{\displaystyle \det(I)=1} still holds, as do all the properties that result from that characterization.[41]

A matrixAMatn×n(R){\displaystyle A\in \operatorname {Mat} _{n\times n}(R)} is invertible (in the sense that there is an inverse matrix whose entries are inR{\displaystyle R}) if and only if its determinant is aninvertible element inR{\displaystyle R}.[42] ForR=Z{\displaystyle R=\mathbf {Z} }, this means that the determinant is +1 or −1. Such a matrix is calledunimodular.

The determinant being multiplicative, it defines agroup homomorphism

GLn(R)R×,{\displaystyle \operatorname {GL} _{n}(R)\rightarrow R^{\times },}

between thegeneral linear group (the group of invertiblen×n{\displaystyle n\times n}-matrices with entries inR{\displaystyle R}) and themultiplicative group of units inR{\displaystyle R}. Since it respects the multiplication in both groups, this map is agroup homomorphism.

The determinant is a natural transformation.

Given aring homomorphismf:RS{\displaystyle f:R\to S}, there is a mapGLn(f):GLn(R)GLn(S){\displaystyle \operatorname {GL} _{n}(f):\operatorname {GL} _{n}(R)\to \operatorname {GL} _{n}(S)} given by replacing all entries inR{\displaystyle R} by their images underf{\displaystyle f}. The determinant respects these maps, i.e., the identity

f(det((ai,j)))=det((f(ai,j))){\displaystyle f(\det((a_{i,j})))=\det((f(a_{i,j})))}

holds. In other words, the displayed commutative diagram commutes.

For example, the determinant of thecomplex conjugate of a complex matrix (which is also the determinant of its conjugate transpose) is the complex conjugate of its determinant, and for integer matrices: the reduction modulom{\displaystyle m} of the determinant of such a matrix is equal to the determinant of the matrix reduced modulom{\displaystyle m} (the latter determinant being computed usingmodular arithmetic). In the language ofcategory theory, the determinant is anatural transformation between the two functorsGLn{\displaystyle \operatorname {GL} _{n}} and()×{\displaystyle (-)^{\times }}.[43] Adding yet another layer of abstraction, this is captured by saying that the determinant is a morphism ofalgebraic groups, from the general linear group to themultiplicative group,

det:GLnGm.{\displaystyle \det :\operatorname {GL} _{n}\to \mathbb {G} _{m}.}

Exterior algebra

[edit]
See also:Exterior algebra § Linear algebra

The determinant of a linear transformationT:VV{\displaystyle T:V\to V} of ann{\displaystyle n}-dimensional vector spaceV{\displaystyle V} or, more generally afree module of (finite)rankn{\displaystyle n} over a commutative ringR{\displaystyle R} can be formulated in a coordinate-free manner by considering then{\displaystyle n}-thexterior powernV{\displaystyle \bigwedge ^{n}V} ofV{\displaystyle V}.[44] The mapT{\displaystyle T} induces a linear map

nT:nVnVv1v2vnTv1Tv2Tvn.{\displaystyle {\begin{aligned}\bigwedge ^{n}T:\bigwedge ^{n}V&\rightarrow \bigwedge ^{n}V\\v_{1}\wedge v_{2}\wedge \dots \wedge v_{n}&\mapsto Tv_{1}\wedge Tv_{2}\wedge \dots \wedge Tv_{n}.\end{aligned}}}

AsnV{\displaystyle \bigwedge ^{n}V} is one-dimensional, the mapnT{\displaystyle \bigwedge ^{n}T} is given by multiplying with some scalar, i.e., an element inR{\displaystyle R}. Some authors such as (Bourbaki 1998) use this fact todefine the determinant to be the element inR{\displaystyle R} satisfying the following identity (for allviV{\displaystyle v_{i}\in V}):

(nT)(v1vn)=det(T)v1vn.{\displaystyle \left(\bigwedge ^{n}T\right)\left(v_{1}\wedge \dots \wedge v_{n}\right)=\det(T)\cdot v_{1}\wedge \dots \wedge v_{n}.}

This definition agrees with the more concrete coordinate-dependent definition. This can be shown using the uniqueness of a multilinear alternating form onn{\displaystyle n}-tuples of vectors inRn{\displaystyle R^{n}}.For this reason, the highest non-zero exterior powernV{\displaystyle \bigwedge ^{n}V} (as opposed to the determinant associated to an endomorphism) is sometimes also called the determinant ofV{\displaystyle V} and similarly for more involved objects such asvector bundles orchain complexes of vector spaces. Minors of a matrix can also be cast in this setting, by considering lower alternating formskV{\displaystyle \bigwedge ^{k}V} withk<n{\displaystyle k<n}.[45]

Berezin integral

[edit]

The conventional definition of the determinant, as a sum over permutations over a product of matrix elements, can be written using the somewhat surprising notation of theBerezin integral. In this notation, the determinant can be written as

exp[θTAη]dθdη=detA{\displaystyle \int \exp \left[-\theta ^{T}A\eta \right]\,d\theta \,d\eta =\det A}

This holds for anyn×n{\displaystyle n\times n}-dimensional matrixA.{\displaystyle A.} The symbolsθ,η{\displaystyle \theta ,\eta } are twon{\displaystyle n}-dimensional vectors of anti-commutingGrassmann numbers (aka "supernumbers"), taken from theGrassmann algebra. Theexp{\displaystyle \exp } here is theexponential function. The integral sign is meant to be understood as the Berezin integral. Despite the use of the integral symbol, this expression is in fact an entirely finite sum.

This unusual-looking expression can be understood as a notational trick that rewrites the conventional expression for the determinant

detA=σSnsgn(σ)a1,σ(1)an,σ(n).{\displaystyle \det A=\sum _{\sigma \in S_{n}}\operatorname {sgn}(\sigma )a_{1,\sigma (1)}\cdots a_{n,\sigma (n)}.}

by using some novel notation. The anti-commuting property of the Grassmann numbers captures the sign (signature) of the permutation, while the integral combined with theexp{\displaystyle \exp } ensures that all permutations are explored. That is, theTaylor's series forexp{\displaystyle \exp } terminates after exactlyn{\displaystyle n} terms, because the square of a Grassmann number is zero, and there are exactlyn{\displaystyle n} distinct Grassmann variables. Meanwhile, the integral is defined to vanish, if the corresponding Grassmann number doesnot appear in the integrand. Thus, the integral selects out only those terms in theexp{\displaystyle \exp } series that have exactlyn{\displaystyle n} distinct variables; all lower-order terms vanish. Thus, the somewhat magical combination of the integral sign, the use of anti-commuting variables, and the Taylor's series forexp{\displaystyle \exp } just encodes a finite sum, identical to the conventional summation.

This form is popular in physics, where it is often used as a stand-in for the Jacobian determinant. The appeal is that, notationally, the integral takes the form of apath integral, such as in thepath integral formulation for quantizedHamiltonian mechanics. An example can be found in the theory ofFadeev–Popov ghosts; although this theory may seem rather abstruse, it's best to keep in mind that the use of the ghost fields is little more than a notational trick to express a Jacobian determinant.

ThePfaffianPfA{\displaystyle \mathrm {Pf} \,A} of askew-symmetric matrixA{\displaystyle A} is the square-root of the determinant: that is,(PfA)2=detA.{\displaystyle \left(\mathrm {Pf} \,A\right)^{2}=\det A.} The Berezin integral form for the Pfaffian is even more suggestive; it is

exp[12θTAθ]dθ=PfA{\displaystyle \int \exp \left[-{\tfrac {1}{2}}\theta ^{T}A\theta \right]\,d\theta =\mathrm {Pf} \,A}

The integrand has exactly the same formal structure as a normalGaussian distribution, albeit with Grassman numbers, instead of real numbers. This formal resemblance accounts for the occasional appearance of supernumbers in the theory ofstochastic dynamics andstochastic differential equations.

Generalizations and related notions

[edit]

Determinants as treated above admit several variants: thepermanent of a matrix is defined as the determinant, except that the factorssgn(σ){\displaystyle \operatorname {sgn}(\sigma )} occurring in Leibniz's rule are omitted. Theimmanant generalizes both by introducing acharacter of thesymmetric groupSn{\displaystyle S_{n}} in Leibniz's rule.

Determinants for finite-dimensional algebras

[edit]

For anyassociative algebraA{\displaystyle A} that isfinite-dimensional as a vector space over a fieldF{\displaystyle F}, there is a determinant map[46]

det:AF.{\displaystyle \det :A\to F.}

This definition proceeds by establishing the characteristic polynomial independently of the determinant, and defining the determinant as the lowest order term of this polynomial. This general definition recovers the determinant for thematrix algebraA=Matn×n(F){\displaystyle A=\operatorname {Mat} _{n\times n}(F)}, but also includes several further cases including the determinant of aquaternion,

det(a+ib+jc+kd)=a2+b2+c2+d2{\displaystyle \det(a+ib+jc+kd)=a^{2}+b^{2}+c^{2}+d^{2}},

thenormNL/F:LF{\displaystyle N_{L/F}:L\to F} of afield extension, as well as thePfaffian of a skew-symmetric matrix and thereduced norm of acentral simple algebra, also arise as special cases of this construction.

Infinite matrices

[edit]

For matrices with an infinite number of rows and columns, the above definitions of the determinant do not carry over directly. For example, in the Leibniz formula, an infinite sum (all of whose terms are infinite products) would have to be calculated.Functional analysis provides different extensions of the determinant for such infinite-dimensional situations, which however only work for particular kinds of operators.

TheFredholm determinant defines the determinant for operators known astrace class operators by an appropriate generalization of the formula

det(I+A)=exp(tr(log(I+A))).{\displaystyle \det(I+A)=\exp(\operatorname {tr} (\log(I+A))).}

Another infinite-dimensional notion of determinant is thefunctional determinant.

Operators in von Neumann algebras

[edit]

For operators in a finitefactor, one may define a positive real-valued determinant called theFuglede−Kadison determinant using the canonical trace. In fact, corresponding to everytracial state on avon Neumann algebra there is a notion of Fuglede−Kadison determinant.

Related notions for non-commutative rings

[edit]

For matrices over non-commutative rings, multilinearity and alternating properties are incompatible forn ≥ 2,[47] so there is no good definition of the determinant in this setting.

For square matrices with entries in a non-commutative ring, there are various difficulties in defining determinants analogously to that for commutative rings. A meaning can be given to the Leibniz formula provided that the order for the product is specified, and similarly for other definitions of the determinant, but non-commutativity then leads to the loss of many fundamental properties of the determinant, such as the multiplicative property or that the determinant is unchanged under transposition of the matrix. Over non-commutative rings, there is no reasonable notion of a multilinear form (existence of a nonzerobilinear form[clarify] with aregular element ofR as value on some pair of arguments implies thatR is commutative). Nevertheless, various notions of non-commutative determinant have been formulated that preserve some of the properties of determinants, notablyquasideterminants and theDieudonné determinant. For some classes of matrices with non-commutative elements, one can define the determinant and prove linear algebra theorems that are very similar to their commutative analogs. Examples include theq-determinant on quantum groups, theCapelli determinant on Capelli matrices, and theBerezinian onsupermatrices (i.e., matrices whose entries are elements ofZ2{\displaystyle \mathbb {Z} _{2}}-graded rings).[48]Manin matrices form the class closest to matrices with commutative elements.

Calculation

[edit]

Determinants are mainly used as a theoretical tool. They are rarely calculated explicitly innumerical linear algebra, where for applications such as checking invertibility and finding eigenvalues the determinant has largely been supplanted by other techniques.[49]Computational geometry, however, does frequently use calculations related to determinants.[50]

While the determinant can be computed directly using the Leibniz rule this approach is extremely inefficient for large matrices, since that formula requires calculatingn!{\displaystyle n!} (n{\displaystyle n}factorial) products for ann×n{\displaystyle n\times n} matrix. Thus, the number of required operations grows very quickly: it isof ordern!{\displaystyle n!}. The Laplace expansion is similarly inefficient. Therefore, more involved techniques have been developed for calculating determinants.

Gaussian elimination

[edit]

Gaussian elimination consists of left multiplying a matrix byelementary matrices for getting a matrix in arow echelon form. One can restrict the computation to elementary matrices of determinant1. In this case, the determinant of the resulting row echelon form equals the determinant of the initial matrix. As a row echelon form is atriangular matrix, its determinant is the product of the entries of its diagonal.

So, the determinant can be computed for almost free from the result of a Gaussian elimination.

Decomposition methods

[edit]

Some methods computedet(A){\displaystyle \det(A)} by writing the matrix as a product of matrices whose determinants can be more easily computed. Such techniques are referred to as decomposition methods. Examples include theLU decomposition, theQR decomposition or theCholesky decomposition (forpositive definite matrices). These methods are of orderO(n3){\displaystyle \operatorname {O} (n^{3})}, which is a significant improvement overO(n!){\displaystyle \operatorname {O} (n!)}.[51]

For example, LU decomposition expressesA{\displaystyle A} as a product

A=PLU.{\displaystyle A=PLU.}

of apermutation matrixP{\displaystyle P} (which has exactly a single1{\displaystyle 1} in each column, and otherwise zeros), a lower triangular matrixL{\displaystyle L} and an upper triangular matrixU{\displaystyle U}.The determinants of the two triangular matricesL{\displaystyle L} andU{\displaystyle U} can be quickly calculated, since they are the products of the respective diagonal entries. The determinant ofP{\displaystyle P} is just the signε{\displaystyle \varepsilon } of the corresponding permutation (which is+1{\displaystyle +1} for an even number of permutations and is1{\displaystyle -1} for an odd number of permutations). Once such a LU decomposition is known forA{\displaystyle A}, its determinant is readily computed as

det(A)=εdet(L)det(U).{\displaystyle \det(A)=\varepsilon \det(L)\cdot \det(U).}

Further methods

[edit]

The orderO(n3){\displaystyle \operatorname {O} (n^{3})} reached by decomposition methods has been improved by different methods. If two matrices of ordern{\displaystyle n} can be multiplied in timeM(n){\displaystyle M(n)}, whereM(n)na{\displaystyle M(n)\geq n^{a}} for somea>2{\displaystyle a>2}, then there is an algorithm computing the determinant in timeO(M(n)){\displaystyle O(M(n))}.[52] This means, for example, that anO(n2.376){\displaystyle \operatorname {O} (n^{2.376})} algorithm for computing the determinant exists based on theCoppersmith–Winograd algorithm. This exponent has been further lowered, as of 2016, to 2.373.[53]

In addition to the complexity of the algorithm, further criteria can be used to compare algorithms.Especially for applications concerning matrices over rings, algorithms that compute the determinant without any divisions exist. (By contrast, Gauss elimination requires divisions.) One such algorithm, having complexityO(n4){\displaystyle \operatorname {O} (n^{4})} is based on the following idea: one replaces permutations (as in the Leibniz rule) by so-calledclosed ordered walks, in which several items can be repeated. The resulting sum has more terms than in the Leibniz rule, but in the process several of these products can be reused, making it more efficient than naively computing with the Leibniz rule.[54] Algorithms can also be assessed according to theirbit complexity, i.e., how many bits of accuracy are needed to store intermediate values occurring in the computation. For example, theGaussian elimination (or LU decomposition) method is of orderO(n3){\displaystyle \operatorname {O} (n^{3})}, but the bit length of intermediate values can become exponentially long.[55] By comparison, theBareiss Algorithm, is an exact-division method (so it does use division, but only in cases where these divisions can be performed without remainder) is of the same order, but the bit complexity is roughly the bit size of the original entries in the matrix timesn{\displaystyle n}.[56]

If the determinant ofA and the inverse ofA have already been computed, thematrix determinant lemma allows rapid calculation of the determinant ofA +uvT, whereu andv are column vectors.

Charles Dodgson (i.e.Lewis Carroll ofAlice's Adventures in Wonderland fame) invented a method for computing determinants calledDodgson condensation. Unfortunately this interesting method does not always work in its original form.[57]

See also

[edit]

Notes

[edit]
  1. ^Lang 1985, §VII.1
  2. ^"Determinants and Volumes".textbooks.math.gatech.edu. Retrieved16 March 2018.
  3. ^McConnell (1957).Applications of Tensor Analysis. Dover Publications. pp. 10–17.
  4. ^Harris 2014, §4.7
  5. ^Serge Lang,Linear Algebra, 2nd Edition, Addison-Wesley, 1971, pp 173, 191.
  6. ^Lang 1987, §VI.7, Theorem 7.5
  7. ^Alternatively,Bourbaki 1998, §III.8, Proposition 1 proves this result using thefunctoriality of the exterior power.
  8. ^Horn & Johnson 2018, §0.8.7
  9. ^Kung, Rota & Yan 2009, p. 306
  10. ^Horn & Johnson 2018, §0.8.2.
  11. ^Silvester, J. R. (2000)."Determinants of Block Matrices".Math. Gaz.84 (501):460–467.doi:10.2307/3620776.JSTOR 3620776.S2CID 41879675.
  12. ^Sothanaphan, Nat (January 2017). "Determinants of block matrices with noncommuting blocks".Linear Algebra and Its Applications.512:202–218.arXiv:1805.06027.doi:10.1016/j.laa.2016.10.004.S2CID 119272194.
  13. ^Proofs can be found inhttp://www.ee.ic.ac.uk/hp/staff/dmb/matrix/proof003.html
  14. ^Lin, Minghua; Sra, Suvrit (2014). "Completely strong superadditivity of generalized matrix functions".arXiv:1410.1958 [math.FA].
  15. ^Paksoy; Turkmen; Zhang (2014)."Inequalities of Generalized Matrix Functions via Tensor Products".Electronic Journal of Linear Algebra.27:332–341.doi:10.13001/1081-3810.1622.
  16. ^Serre, Denis (Oct 18, 2010)."Concavity of det1/n over HPDn".MathOverflow.
  17. ^Lang 1985, §VIII.2,Horn & Johnson 2018, Def. 1.2.3
  18. ^Horn & Johnson 2018, Observation 7.1.2, Theorem 7.2.5
  19. ^A proof can be found in the Appendix B ofKondratyuk, L. A.; Krivoruchenko, M. I. (1992). "Superconducting quark matter in SU(2) color group".Zeitschrift für Physik A.344 (1):99–115.Bibcode:1992ZPhyA.344...99K.doi:10.1007/BF01291027.S2CID 120467300.
  20. ^Horn & Johnson 2018, § 0.8.10
  21. ^Grattan-Guinness 2003, §6.6
  22. ^Cajori, F.A History of Mathematics p. 80
  23. ^abcCampbell, H: "Linear Algebra With Applications", pages 111–112. Appleton Century Crofts, 1971
  24. ^Eves 1990, p. 405
  25. ^A Brief History of Linear Algebra and Matrix Theory at:"A Brief History of Linear Algebra and Matrix Theory". Archived fromthe original on 10 September 2012. Retrieved24 January 2012.
  26. ^Kleiner 2007, p. 80
  27. ^Bourbaki (1994, p. 59)
  28. ^Muir, Sir Thomas,The Theory of Determinants in the historical Order of Development [London, England: Macmillan and Co., Ltd., 1906].JFM 37.0181.02
  29. ^Kleiner 2007, §5.2
  30. ^The first use of the word "determinant" in the modern sense appeared in: Cauchy, Augustin-Louis "Memoire sur les fonctions qui ne peuvent obtenir que deux valeurs égales et des signes contraires par suite des transpositions operées entre les variables qu'elles renferment," which was first read at the Institute de France in Paris on November 30, 1812, and which was subsequently published in theJournal de l'Ecole Polytechnique, Cahier 17, Tome 10, pages 29–112 (1815).
  31. ^Origins of mathematical terms:http://jeff560.tripod.com/d.html
  32. ^History of matrices and determinants:http://www-history.mcs.st-and.ac.uk/history/HistTopics/Matrices_and_determinants.html
  33. ^Eves 1990, p. 494
  34. ^Cajori 1993, Vol. II, p. 92, no. 462
  35. ^History of matrix notation:http://jeff560.tripod.com/matrices.html
  36. ^Habgood & Arel 2012
  37. ^Lang 1985, §VII.3
  38. ^Lang 2002, §IV.8
  39. ^Lang 1985, §VII.6, Theorem 6.10
  40. ^Lay, David (2021).Linear Algebra and Its Applications 6th Edition. Pearson. p. 172.
  41. ^Dummit & Foote 2004, §11.4
  42. ^Dummit & Foote 2004, §11.4, Theorem 30
  43. ^Mac Lane 1998, §I.4. See alsoNatural transformation § Determinant.
  44. ^Bourbaki 1998, §III.8
  45. ^Lombardi & Quitté 2015, §5.2,Bourbaki 1998, §III.5
  46. ^Garibaldi 2004
  47. ^In a non-commutative setting left-linearity (compatibility with left-multiplication by scalars) should be distinguished from right-linearity. Assuming linearity in the columns is taken to be left-linearity, one would have, for non-commuting scalarsa,b:ab=ab|1001|=a|100b|=|a00b|=b|a001|=ba|1001|=ba,{\displaystyle {\begin{aligned}ab&=ab{\begin{vmatrix}1&0\\0&1\end{vmatrix}}=a{\begin{vmatrix}1&0\\0&b\end{vmatrix}}\\[5mu]&={\begin{vmatrix}a&0\\0&b\end{vmatrix}}=b{\begin{vmatrix}a&0\\0&1\end{vmatrix}}=ba{\begin{vmatrix}1&0\\0&1\end{vmatrix}}=ba,\end{aligned}}}a contradiction. There is no useful notion of multi-linear functions over a non-commutative ring.
  48. ^Varadarajan, V. S (2004),Supersymmetry for mathematicians: An introduction, American Mathematical Soc.,ISBN 978-0-8218-3574-6.
  49. ^"... we mention that the determinant, though a convenient notion theoretically, rarely finds a useful role in numerical algorithms.", seeTrefethen & Bau III 1997, Lecture 1.
  50. ^Fisikopoulos & Peñaranda 2016, §1.1, §4.3
  51. ^Camarero, Cristóbal (2018-12-05). "Simple, Fast and Practicable Algorithms for Cholesky, LU and QR Decomposition Using Fast Rectangular Matrix Multiplication".arXiv:1812.02056 [cs.NA].
  52. ^Bunch & Hopcroft 1974
  53. ^Fisikopoulos & Peñaranda 2016, §1.1
  54. ^Rote 2001
  55. ^Fang, Xin Gui; Havas, George (1997)."On the worst-case complexity of integer Gaussian elimination"(PDF).Proceedings of the 1997 international symposium on Symbolic and algebraic computation. ISSAC '97. Kihei, Maui, Hawaii, United States: ACM. pp. 28–31.doi:10.1145/258726.258740.ISBN 0-89791-875-4. Archived fromthe original(PDF) on 2011-08-07. Retrieved2011-01-22.
  56. ^Fisikopoulos & Peñaranda 2016, §1.1,Bareiss 1968
  57. ^Abeles, Francine F. (2008)."Dodgson condensation: The historical and mathematical development of an experimental method".Linear Algebra and Its Applications.429 (2–3):429–438.doi:10.1016/j.laa.2007.11.022.

References

[edit]
See also:Linear algebra § Further reading

Historical references

[edit]

External links

[edit]
The WikibookLinear Algebra has a page on the topic of:Determinants
Basic concepts
Three dimensional Euclidean space
Matrices
Bilinear
Multilinear algebra
Vector space constructions
Numerical
National
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Determinant&oldid=1293285851"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp