Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Matrix (mathematics)

This is a good article. Click here for more information.
From Wikipedia, the free encyclopedia
Array of numbers

"Matrix theory" redirects here. For the physics topic, seeMatrix theory (physics).For other uses of "Matrix", seeMatrix (disambiguation).

Two tall square brackets with m-many rows each containing n-many subscripted letter 'a' variables. Each letter 'a' is given a row number and column number as its subscript.
Anm × n matrix: them rows are horizontal and then columns are vertical. Each element of a matrix is often denoted by a variable with twosubscripts. For example,a2,1 represents the element at the second row and first column of the matrix.

Inmathematics, amatrix (pl.:matrices) is arectangular array ofnumbers or othermathematical objects with elements or entries arranged in rows and columns, usually satisfying certain properties ofaddition andmultiplication.

For example,[19132056]{\displaystyle {\begin{bmatrix}1&9&-13\\20&5&-6\end{bmatrix}}}denotes a matrix with two rows and three columns. This is often referred to as a "two-by-three matrix", a2 × 3 matrix, or a matrix of dimension2 × 3.

Inlinear algebra, matrices are used aslinear maps. Ingeometry, matrices are used forgeometric transformations (for examplerotations) andcoordinate changes. Innumerical analysis, many computational problems are solved by reducing them to a matrix computation, and this often involves computing with matrices of huge dimensions. Matrices are used in most areas of mathematics and scientific fields, either directly, or through their use in geometry and numerical analysis.

Square matrices, matrices with the same number of rows and columns, play a major role in matrix theory. Thedeterminant of a square matrix is a number associated with the matrix, which is fundamental for the study of a square matrix; for example, a square matrix isinvertible if and only if it has a nonzero determinant and theeigenvalues of a square matrix are the roots of itscharacteristic polynomial,det(λIA){\displaystyle \det(\lambda I-A)}.

Matrix theory is thebranch of mathematics that focuses on the study of matrices. It was initially a sub-branch of linear algebra, but soon grew to include subjects related tograph theory,algebra,combinatorics andstatistics.

Definition

[edit]

A matrix is a rectangular array ofnumbers (or other mathematical objects), called the "entries" of the matrix. Matrices are subject to standardoperations such asaddition andmultiplication.[1] Most commonly, a matrix over afieldF{\displaystyle F} is a rectangular array ofelements ofF{\displaystyle F}.[2][3] Areal matrix and acomplex matrix are matrices whose entries are respectivelyreal numbers orcomplex numbers. More general types of entries are discussedbelow. For instance, this is a real matrix:A=[1.30.620.45.59.76.2].{\displaystyle \mathbf {A} ={\begin{bmatrix}-1.3&0.6\\20.4&5.5\\9.7&-6.2\end{bmatrix}}.}

The numbers (or other objects) in the matrix are called itsentries or itselements. The horizontal and vertical lines of entries in a matrix are respectively calledrows andcolumns.[4]

Size

[edit]

The size of a matrix is defined by the number of rows and columns it contains. There is no limit to the number of rows and columns that a matrix (in the usual sense) can have as long as they are positive integers. A matrix withm rows andn columns is called anm × n matrix,[4] orm-by-n matrix,[5] wherem andn are called itsdimensions.[6] For example, the matrixA{\displaystyle {\mathbf {A} }} above is a3 × 2 matrix.

Matrices with a single row are calledrow matrices orrow vectors, and those with a single column are calledcolumn matrices orcolumn vectors. A matrix with the same number of rows and columns is called asquare matrix.[7] A matrix with an infinite number of rows or columns (or both) is called an infinite matrix. In some contexts, such ascomputer algebra programs, it is useful to consider a matrix with no rows or no columns, called an empty matrix.[8]

Overview of a matrix size
NameSizeExampleDescription
Row matrix1×n{\displaystyle 1\times n}[372]{\displaystyle {\begin{bmatrix}3&7&2\end{bmatrix}}}A matrix with one row and more than one columns, sometimes used to represent a vector
Column matrixn×1{\displaystyle n\times 1}[418]{\displaystyle {\begin{bmatrix}4\\1\\8\end{bmatrix}}}A matrix with one column and more than one rows, sometimes used to represent a vector
Square matrixn×n{\displaystyle n\times n}[91351117263]{\displaystyle {\begin{bmatrix}9&13&5\\1&11&7\\2&6&3\end{bmatrix}}}A matrix with the same number of rows and columns, sometimes used to represent alinear transformation from a vector space to itself, such asreflection,rotation, orshearing.

Notation

[edit]

The specifics of symbolic matrix notation vary widely, with some prevailing trends. Matrices are commonly written insquare brackets orparentheses,[9] so that anm × n matrixA{\displaystyle \mathbf {A} } is represented asA=[a11a12a1na21a22a2nam1am2amn]=(a11a12a1na21a22a2nam1am2amn).{\displaystyle \mathbf {A} ={\begin{bmatrix}a_{11}&a_{12}&\cdots &a_{1n}\\a_{21}&a_{22}&\cdots &a_{2n}\\\vdots &\vdots &\ddots &\vdots \\a_{m1}&a_{m2}&\cdots &a_{mn}\end{bmatrix}}={\begin{pmatrix}a_{11}&a_{12}&\cdots &a_{1n}\\a_{21}&a_{22}&\cdots &a_{2n}\\\vdots &\vdots &\ddots &\vdots \\a_{m1}&a_{m2}&\cdots &a_{mn}\end{pmatrix}}.}This may be abbreviated by writing only a single generic term, possibly along with indices, as inA=(aij),[aij],or(aij)1im,1jn{\displaystyle \mathbf {A} =\left(a_{ij}\right),\quad \left[a_{ij}\right],\quad {\text{or}}\quad \left(a_{ij}\right)_{1\leq i\leq m,\;1\leq j\leq n}}orA=(ai,j)1i,jn{\displaystyle \mathbf {A} =(a_{i,j})_{1\leq i,j\leq n}} in the case thatn=m{\displaystyle n=m}.

Matrices are usually symbolized usingupper-case letters (such asA{\displaystyle {\mathbf {A} }} in the examples above),[10] while the correspondinglower-case letters, with two subscript indices (e.g.,a11{\displaystyle a_{11}}, ora1,1{\displaystyle a_{1,1}}), represent the entries.[11] In addition to using upper-case letters to symbolize matrices, many authors use a specialtypographical style, commonly boldface roman (non-italic), to further distinguish matrices from other mathematical objects. An alternative notation involves the use of a double-underline with the variable name, with or without boldface style, as inA__{\displaystyle {\underline {\underline {A}}}}.[12]

The entry in theith row andjth column of a matrixA is sometimes referred to as thei,j{\displaystyle {i,j}} or(i,j){\displaystyle (i,j)} entry of the matrix, and commonly denoted byai,j{\displaystyle a_{i,j}} oraij{\displaystyle a_{ij}}.[13] Alternative notations for that entry areA[i,j]{\displaystyle {\mathbf {A} [i,j]}} andAi,j{\displaystyle \mathbf {A} _{i,j}}. For example, the(1,3){\displaystyle (1,3)} entry of the following matrixA{\displaystyle \mathbf {A} } is5 (also denoteda13{\displaystyle a_{13}},a1,3{\displaystyle a_{1,3}},A[1,3]{\displaystyle \mathbf {A} [1,3]} orA1,3{\displaystyle {\mathbf {A} }_{1,3}}):A=[475020118191312]{\displaystyle \mathbf {A} ={\begin{bmatrix}4&-7&\color {red}{5}&0\\-2&0&11&8\\19&1&-3&12\end{bmatrix}}}

Sometimes, the entries of a matrix can be defined by a formula such asai,j=f(i,j){\displaystyle a_{i,j}=f(i,j)}. For example, each of the entries of the following matrixA{\displaystyle \mathbf {A} } is determined by the formulaaij=ij{\displaystyle a_{ij}=i-j}.A=[012310122101]{\displaystyle \mathbf {A} ={\begin{bmatrix}0&-1&-2&-3\\1&0&-1&-2\\2&1&0&-1\end{bmatrix}}}In this case, the matrix itself is sometimes defined by that formula, within square brackets or double parentheses. For example, the matrix above is defined asA=[ij]{\displaystyle {\mathbf {A} }=[i-j]} orA=((ij)){\displaystyle \mathbf {A} =((i-j))}. If matrix size ism × n, the above-mentioned formulaf(i,j){\displaystyle f(i,j)} is valid for anyi=1,,m{\displaystyle i=1,\dots ,m} and anyj=1,,n{\displaystyle j=1,\dots ,n}. This can be specified separately or indicated usingm × n as a subscript. For instance, the matrixA{\displaystyle \mathbf {A} } above is3 × 4, and can be defined asA=[ij](i=1,2,3;j=1,,4){\displaystyle {\mathbf {A} }=[i-j](i=1,2,3;j=1,\dots ,4)} orA=[ij]3×4{\displaystyle \mathbf {A} =[i-j]_{3\times 4}}.

Some programming languages utilize doubly subscripted arrays (or arrays of arrays) to represent an {m-by-n matrix. Some programming languages start the numbering of array indexes at zero, in which case the entries of anm × n matrix are indexed by0im1{\displaystyle 0\leq i\leq m-1} and0jn1{\displaystyle 0\leq j\leq n-1}.[14] This article follows the more common convention in mathematical writing where enumeration starts from1.

Theset of allm-by-n real matrices is often denotedM(m,n){\displaystyle {\mathcal {M}}(m,n)}, orMm×n(R){\displaystyle {\mathcal {M}}_{m\times n}(\mathbb {R} )}. The set of allm × n matrices over anotherfield, or over aringR, is similarly denotedM(m,n,R){\displaystyle {\mathcal {M}}(m,n,R)}, orMm×n(R){\displaystyle {\mathcal {M}}_{m\times n}(R)}. Ifm =n, such as in the case ofsquare matrices, one does not repeat the dimension:M(n,R){\displaystyle {\mathcal {M}}(n,R)}, orMn(R){\displaystyle {\mathcal {M}}_{n}(R)}.[15] Often,M{\displaystyle M}, orMat{\displaystyle \operatorname {Mat} }, is used in place ofM{\displaystyle {\mathcal {M}}}.[16]

Basic operations

[edit]

Several basic operations can be applied to matrices. Some, such astransposition andsubmatrix do not depend on the nature of the entries. Others, such asmatrix addition,scalar multiplication,matrix multiplication, androw operations involve operations on matrix entries and therefore require that matrix entries are numbers or belong to afield or aring.[17]

In this section, it is supposed that matrix entries belong to a fixed ring, which is typically a field of numbers.

Addition

[edit]
Main article:Matrix addition
Illustration of the addition of two matrices.

Matrix addition and subtraction require matrices of a consistent size, and are calculated entrywise. ThesumA +B and the differenceAB of twom × n matrices are:[18]

(A+B)i,j=Ai,j+Bi,j,1im,1jn.(AB)i,j=Ai,jBi,j,1im,1jn.{\displaystyle {\begin{aligned}({\mathbf {A}}+{\mathbf {B}})_{i,j}={\mathbf {A}}_{i,j}+{\mathbf {B}}_{i,j},\quad 1\leq i\leq m,\quad 1\leq j\leq n.\\({\mathbf {A}}-{\mathbf {B}})_{i,j}={\mathbf {A}}_{i,j}-{\mathbf {B}}_{i,j},\quad 1\leq i\leq m,\quad 1\leq j\leq n.\end{aligned}}}

For example,

[131100]+[005750]=[1+03+01+51+70+50+0]=[136850]{\displaystyle {\begin{bmatrix}1&3&1\\1&0&0\end{bmatrix}}+{\begin{bmatrix}0&0&5\\7&5&0\end{bmatrix}}={\begin{bmatrix}1+0&3+0&1+5\\1+7&0+5&0+0\end{bmatrix}}={\begin{bmatrix}1&3&6\\8&5&0\end{bmatrix}}}

Familiar properties of numbers extend to these operations on matrices: for example, addition iscommutative, that is, the matrix sum does not depend on the order of the summands:A +B =B +A.[19]

Scalar multiplication

[edit]
Main article:Scalar multiplication

The productcA of a numberc (also called ascalar in this context) and a matrixA is computed by multiplying each entry ofA byc:[20](cA)i,j=cAi,j{\displaystyle (c{\mathbf {A}})_{i,j}=c\cdot {\mathbf {A}}_{i,j}}This operation is calledscalar multiplication, but its result is not named "scalar product" to avoid confusion, since "scalar product" is often used as a synonym for "inner product".[21] For example:

2[183425]=[212823242225]=[21668410]{\displaystyle 2\cdot {\begin{bmatrix}1&8&-3\\4&-2&5\end{bmatrix}}={\begin{bmatrix}2\cdot 1&2\cdot 8&2\cdot -3\\2\cdot 4&2\cdot -2&2\cdot 5\end{bmatrix}}={\begin{bmatrix}2&16&-6\\8&-4&10\end{bmatrix}}}

Matrix subtraction is consistent with composition of matrix addition with scalar multiplication by–1:[22]

AB=A+(1)B{\displaystyle \mathbf {A} -\mathbf {B} =\mathbf {A} +(-1)\cdot \mathbf {B} }

Transpose

[edit]
Main article:Transpose

Thetranspose of anm × n matrixA is then × m matrixAT (also denotedAtr ortA) formed by turning rows into columns and vice versa:(AT)i,j=Aj,i.{\displaystyle \left({\mathbf {A}}^{\rm {T}}\right)_{i,j}={\mathbf {A}}_{j,i}.}For example:[123067]T=[102637]{\displaystyle {\begin{bmatrix}1&2&3\\0&-6&7\end{bmatrix}}^{\mathrm {T} }={\begin{bmatrix}1&0\\2&-6\\3&7\end{bmatrix}}}

The transpose is compatible with addition and scalar multiplication, as expressed by(cA)T =c(AT) and(A +B)T =AT +BT. Finally,(AT)T =A.[23]

Matrix multiplication

[edit]
Main article:Matrix multiplication
Schematic depiction of the matrix productAB of two matricesA andB

Multiplication of two matrices corresponds to the composition oflinear transformations represented by each matrix. It is defined if and only if the number of columns of the left matrix is the same as the number of rows of the right matrix. IfA is anm × n matrix andB is ann × p matrix, then theirmatrix productAB is them × p matrix whose entries are given by thedot product of the corresponding row ofA and the corresponding column ofB:[24][AB]i,j=ai,1b1,j+ai,2b2,j++ai,nbn,j=r=1nai,rbr,j,{\displaystyle [\mathbf {AB} ]_{i,j}=a_{i,1}b_{1,j}+a_{i,2}b_{2,j}+\cdots +a_{i,n}b_{n,j}=\sum _{r=1}^{n}a_{i,r}b_{r,j},}where1 ≤im and1 ≤jp.[25] For example, the underlined entry 2340 in the product is calculated as(2 × 1000) + (3 × 100) + (4 × 10) = 2340:[2_3_4_100][01000_1100_010_]=[32340_01000].{\displaystyle {\begin{aligned}{\begin{bmatrix}{\underline {2}}&{\underline {3}}&{\underline {4}}\\1&0&0\\\end{bmatrix}}{\begin{bmatrix}0&{\underline {1000}}\\1&{\underline {100}}\\0&{\underline {10}}\\\end{bmatrix}}&={\begin{bmatrix}3&{\underline {2340}}\\0&1000\\\end{bmatrix}}.\end{aligned}}}

Matrix multiplication satisfies the rules(AB)C =A(BC) (associativity), and(A +B)C =AC +BC as well asC(A +B) =CA +CB (left and rightdistributivity), whenever the size of the matrices is such that the various products are defined.[26] The productAB may be defined withoutBA being defined, namely ifA andB arem × n andn × k matrices, respectively, andmk. Even if both products are defined, they generally need not be equal, that is:[27]ABBA.{\displaystyle {\mathbf {AB}}\neq {\mathbf {BA}}.}

In other words,matrix multiplication is notcommutative, in marked contrast to (rational, real, or complex) numbers, whose product is independent of the order of the factors.[24] An example of two matrices not commuting with each other is:[1234][0100]=[0103],{\displaystyle {\begin{bmatrix}1&2\\3&4\\\end{bmatrix}}{\begin{bmatrix}0&1\\0&0\\\end{bmatrix}}={\begin{bmatrix}0&1\\0&3\\\end{bmatrix}},}whereas[0100][1234]=[3400].{\displaystyle {\begin{bmatrix}0&1\\0&0\\\end{bmatrix}}{\begin{bmatrix}1&2\\3&4\\\end{bmatrix}}={\begin{bmatrix}3&4\\0&0\\\end{bmatrix}}.}

Besides the ordinary matrix multiplication just described, other less frequently used operations on matrices that can be considered forms of multiplication also exist, such as theHadamard product and theKronecker product.[28] They arise in solving matrix equations such as theSylvester equation.[29]

Row operations

[edit]
Main article:Row operations

There are three types of row operations:[30][31]

  1. row addition, that is, adding a row to another.
  2. row multiplication, that is, multiplying all entries of a row by a non-zero constant;
  3. row switching, that is, interchanging two rows of a matrix;

These operations are used in several ways, including solvinglinear equations and findingmatrix inverses withGauss elimination and Gauss–Jordan elimination, respectively.[32]

Submatrix

[edit]

Asubmatrix of a matrix is a matrix obtained by deleting any collection of rows or columns or both.[33][34][35] For example, from the following3 × 4 matrix, we can construct a2 × 3 submatrix by removing row 3 and column 2:A=[123456789101112][134578].{\displaystyle \mathbf {A} ={\begin{bmatrix}1&\color {red}{2}&3&4\\5&\color {red}{6}&7&8\\\color {red}{9}&\color {red}{10}&\color {red}{11}&\color {red}{12}\end{bmatrix}}\rightarrow {\begin{bmatrix}1&3&4\\5&7&8\end{bmatrix}}.}

Theminors and cofactors of a matrix are found by computing thedeterminant of certain submatrices.[35][36]

Aprincipal submatrix is a square submatrix obtained by removing certain rows and columns. The definition varies from author to author. According to some authors, a principal submatrix is a submatrix in which the set of row indices that remain is the same as the set of column indices that remain.[37][38] Other authors define a principal submatrix as one in which the firstk rows and columns, for some numberk, are the ones that remain;[39] this type of submatrix has also been called aleading principal submatrix.[40]

Linear equations

[edit]
Main articles:Linear equation,System of linear equations, andCoefficient matrix

Matrices can be used to compactly write and work with multiple linear equations, that is, systems of linear equations. For example, ifA is anm × n matrix,x designates a column vector (that is,n × 1 matrix) ofn variablesx1,x2, ...,xn, andb is anm × 1 column vector, then the matrix equationAx=b{\displaystyle \mathbf {Ax} =\mathbf {b} }is equivalent to the system of linear equations[41]a1,1x1+a1,2x2++a1,nxn=b1  am,1x1+am,2x2++am,nxn=bm{\displaystyle {\begin{aligned}a_{1,1}x_{1}+a_{1,2}x_{2}+&\cdots +a_{1,n}x_{n}=b_{1}\\&\ \ \vdots \\a_{m,1}x_{1}+a_{m,2}x_{2}+&\cdots +a_{m,n}x_{n}=b_{m}\end{aligned}}}

Using matrices, this can be solved more compactly than would be possible by writing out all the equations separately. Ifn =m and the equations areindependent, then this can be done by writing[42]x=A1b{\displaystyle \mathbf {x} =\mathbf {A} ^{-1}\mathbf {b} }whereA−1 is theinverse matrix ofA. IfA has no inverse, solutions—if any—can be found using itsgeneralized inverse.[43]

Linear transformations

[edit]
Main articles:Linear transformation andTransformation matrix
The vectors represented by a2 × 2 matrix correspond to the sides of a unit square transformed into a parallelogram.

Matrices and matrix multiplication reveal their essential features when related tolinear transformations, also known aslinear maps.A realm-by-n matrixA gives rise to a linear transformationRnRm{\displaystyle \mathbb {R} ^{n}\to \mathbb {R} ^{m}} mapping each vectorx inRn{\displaystyle \mathbb {R} ^{n}} to the (matrix) productAx, which is a vector inRm.{\displaystyle \mathbb {R} ^{m}.} Conversely, each linear transformationf:RnRm{\displaystyle f:\mathbb {R} ^{n}\to \mathbb {R} ^{m}} arises from a uniquem-by-n matrixA: explicitly, the(i,j)-entry ofA is theith coordinate off (ej), whereej = (0, ..., 0, 1, 0, ..., 0) is theunit vector with1 in thejth position and0 elsewhere. The matrixA is said to represent the linear mapf, andA is called thetransformation matrix off.[44]

For example, the2 × 2 matrixA=[acbd]{\displaystyle \mathbf {A} ={\begin{bmatrix}a&c\\b&d\end{bmatrix}}}can be viewed as the transform of theunit square into aparallelogram with vertices at(0, 0),(a,b),(a +c,b +d), and(c,d). The parallelogram pictured at the right is obtained by multiplyingA with each of the column vectors[00]{\displaystyle \left[{\begin{smallmatrix}0\\0\end{smallmatrix}}\right]},[10]{\displaystyle \left[{\begin{smallmatrix}1\\0\end{smallmatrix}}\right]},[11]{\displaystyle \left[{\begin{smallmatrix}1\\1\end{smallmatrix}}\right]}, and[01]{\displaystyle \left[{\begin{smallmatrix}0\\1\end{smallmatrix}}\right]} in turn. These vectors define the vertices of the unit square.[45] The following table shows several2 × 2 real matrices with the associated linear maps ofR2{\displaystyle \mathbb {R} ^{2}}. Theblue original is mapped to thegreen grid and shapes. The origin(0, 0) is marked with a black point.

Horizontal shear[46]
withm = 1.25.
Reflection[47] through the vertical axisSqueeze mapping[48]
withr = 3/2
Scaling[49]
by a factor of 3/2
Rotation[48]
byπ/6 = 30°
[11.2501]{\displaystyle {\begin{bmatrix}1&1.25\\0&1\end{bmatrix}}}[1001]{\displaystyle {\begin{bmatrix}-1&0\\0&1\end{bmatrix}}}[320023]{\displaystyle {\begin{bmatrix}{\frac {3}{2}}&0\\0&{\frac {2}{3}}\end{bmatrix}}}[320032]{\displaystyle {\begin{bmatrix}{\frac {3}{2}}&0\\0&{\frac {3}{2}}\end{bmatrix}}}[cos(π6)sin(π6)sin(π6)cos(π6)]{\displaystyle {\begin{bmatrix}\cos \left({\frac {\pi }{6}}\right)&-\sin \left({\frac {\pi }{6}}\right)\\\sin \left({\frac {\pi }{6}}\right)&\cos \left({\frac {\pi }{6}}\right)\end{bmatrix}}}

Under the1-to-1 correspondence between matrices and linear maps, matrix multiplication corresponds tocomposition of maps:[50] if ak-by-m matrixB represents another linear mapg:RmRk{\displaystyle g:\mathbb {R} ^{m}\to \mathbb {R} ^{k}}, then the compositiongf is represented byBA since[51](gf)(x)=g(f(x))=g(Ax)=B(Ax)=(BA)x.{\displaystyle (g\circ f)({\mathbf {x}})=g(f({\mathbf {x}}))=g({\mathbf {Ax}})={\mathbf {B}}({\mathbf {Ax}})=({\mathbf {BA}}){\mathbf {x}}.}

The last equality follows from the above-mentioned associativity of matrix multiplication.

Therank of a matrixA is the maximum number oflinearly independent row vectors of the matrix, which is the same as the maximum number of linearly independent column vectors.[52] Equivalently it is thedimension of theimage of the linear map represented byA.[53] Therank–nullity theorem states that the dimension of thekernel of a matrix plus the rank equals the number of columns of the matrix.[54]

Square matrix

[edit]
Main article:Square matrix

Asquare matrix is a matrix with the same number of rows and columns. Ann-by-n matrix is known as a square matrix of ordern. Any two square matrices of the same order can be added and multiplied.The entriesaii form themain diagonal of a square matrix. They lie on the imaginary line running from the top left corner to the bottom right corner of the matrix.[55]

Square matrices of a given dimension form anoncommutative ring, which is one of the most common examples of a noncommutative ring.[56]

Main types

[edit]
NameExample withn = 3
Diagonal matrix[a11000a22000a33]{\displaystyle {\begin{bmatrix}a_{11}&0&0\\0&a_{22}&0\\0&0&a_{33}\\\end{bmatrix}}}
Lower triangular matrix[a1100a21a220a31a32a33]{\displaystyle {\begin{bmatrix}a_{11}&0&0\\a_{21}&a_{22}&0\\a_{31}&a_{32}&a_{33}\\\end{bmatrix}}}
Upper triangular matrix[a11a12a130a22a2300a33]{\displaystyle {\begin{bmatrix}a_{11}&a_{12}&a_{13}\\0&a_{22}&a_{23}\\0&0&a_{33}\\\end{bmatrix}}}

Diagonal and triangular matrix

[edit]

If all entries ofA below the main diagonal are zero,A is called anuppertriangular matrix. Similarly, if all entries ofA above the main diagonal are zero,A is called alower triangular matrix.[57] If all entries outside the main diagonal are zero,A is called adiagonal matrix.[58]

Identity matrix

[edit]
Main article:Identity matrix

Theidentity matrixIn of sizen is then-by-n matrix in which all the elements on themain diagonal are equal to1 and all other elements are equal to0,[59] for example,I1=[1],I2=[1001],In=[100010001]{\displaystyle {\begin{aligned}\mathbf {I} _{1}&={\begin{bmatrix}1\end{bmatrix}},\\[4pt]\mathbf {I} _{2}&={\begin{bmatrix}1&0\\0&1\end{bmatrix}},\\[4pt]\vdots &\\[4pt]\mathbf {I} _{n}&={\begin{bmatrix}1&0&\cdots &0\\0&1&\cdots &0\\\vdots &\vdots &\ddots &\vdots \\0&0&\cdots &1\end{bmatrix}}\end{aligned}}}It is a square matrix of ordern, and also a special kind ofdiagonal matrix. It is called an identity matrix because multiplication with it leaves a matrix unchanged:[59]AIn=ImA=A{\displaystyle {\mathbf {AI}}_{n}={\mathbf {I}}_{m}{\mathbf {A}}={\mathbf {A}}}for anym-by-n matrixA.

A scalar multiple of an identity matrix is called ascalar matrix.[60]

Symmetric or skew-symmetric matrix

[edit]

A square matrixA that is equal to its transpose, that is,A =AT, is asymmetric matrix. If instead,A is equal to the negative of its transpose, that is,A = −AT, thenA is askew-symmetric matrix. In complex matrices, symmetry is often replaced by the concept ofHermitian matrices, which satisfiesA =A, where the star orasterisk denotes theconjugate transpose of the matrix, that is, the transpose of thecomplex conjugate ofA.[61]

By thespectral theorem, real symmetric matrices and complex Hermitian matrices have aneigenbasis; that is, every vector is expressible as alinear combination of eigenvectors. In both cases, all eigenvalues are real.[62] This theorem can be generalized to infinite-dimensional situations related to matrices with infinitely many rows and columns.[63]

Invertible matrix and its inverse

[edit]

A square matrixA is calledinvertible ornon-singular if there exists a matrixB such that[64][65]AB=BA=In,{\displaystyle {\mathbf {AB}}={\mathbf {BA}}={\mathbf {I}}_{n},}whereIn is then × nidentity matrix with1 for each entry on themain diagonal and0 elsewhere. IfB exists, it is unique and is called theinverse matrix ofA, denotedA−1.[66]

There are manyalgorithms for testing whether a square matrix is invertible, and, if it is, computing its inverse. One of the oldest, which is still in common use isGaussian elimination.[67]

Definite matrix

[edit]
Positive definite matrixIndefinite matrix
[14001]{\displaystyle {\begin{bmatrix}{\frac {1}{4}}&0\\0&1\\\end{bmatrix}}}[140014]{\displaystyle {\begin{bmatrix}{\frac {1}{4}}&0\\0&-{\frac {1}{4}}\end{bmatrix}}}
Q(x,y)=14x2+y2{\displaystyle Q(x,y)={\frac {1}{4}}x^{2}+y^{2}}Q(x,y)=14x214y2{\displaystyle Q(x,y)={\frac {1}{4}}x^{2}-{\frac {1}{4}}y^{2}}

Points such thatQ(x,y)=1{\textstyle Q(x,y)=1}
(Ellipse)

Points such thatQ(x,y)=1{\textstyle Q(x,y)=1}
(Hyperbola)

A symmetric real matrixA is calledpositive-definite if the associatedquadratic formf(x)=xTAx{\displaystyle f({\mathbf {x}})={\mathbf {x}}^{\rm {T}}{\mathbf {Ax}}}has a positive value for every nonzero vectorx inRn{\displaystyle \mathbb {R} ^{n}}. Iff(x) yields only negative values thenA isnegative-definite; iff does produce both negative and positive values thenA is indefinite.[68] If the quadratic formf yields only non-negative values (positive or zero), the symmetric matrix is calledpositive-semidefinite (or if only non-positive values, then negative-semidefinite); hence the matrix is indefinite precisely when it is neither positive-semidefinite nor negative-semidefinite.[69]

A symmetric matrix is positive-definite if and only if all its eigenvalues are positive, that is, the matrix is positive-semidefinite and it is invertible.[70] The table at the right shows two possibilities for 2-by-2 matrices. The eigenvalues of a diagonal matrix are simply the entries along the diagonal,[71] and so in these examples, the eigenvalues can be read directly from the matrices themselves. The first matrix has two eigenvalues that are both positive, while the second has one that is positive and another that is negative.

Allowing as input two different vectors instead yields thebilinear form associated toA:[72]BA(x,y)=xTAy.{\displaystyle B_{\mathbf {A}}({\mathbf {x}},{\mathbf {y}})={\mathbf {x}}^{\rm {T}}{\mathbf {Ay}}.}

In the case of complex matrices, the same terminology and results apply, withsymmetric matrix,quadratic form,bilinear form, andtransposexT replaced respectively byHermitian matrix,Hermitian form,sesquilinear form, andconjugate transposexH.[73]

Orthogonal matrix

[edit]
Main article:Orthogonal matrix

Anorthogonal matrix is a square matrix withreal entries whose columns and rows areorthogonalunit vectors (that is,orthonormal vectors).[74] Equivalently, a matrixA is orthogonal if itstranspose is equal to itsinverse:AT=A1,{\displaystyle \mathbf {A} ^{\mathrm {T} }=\mathbf {A} ^{-1},\,}which entailsATA=AAT=In,{\displaystyle \mathbf {A} ^{\mathrm {T} }\mathbf {A} =\mathbf {A} \mathbf {A} ^{\mathrm {T} }=\mathbf {I} _{n},}whereIn is theidentity matrix of sizen.[75]

An orthogonal matrixA is necessarilyinvertible (with inverseA−1 =AT),unitary (A−1 =A*), andnormal (A*A =AA*). Thedeterminant of any orthogonal matrix is either+1 or−1. Aspecial orthogonal matrix is an orthogonal matrix withdeterminant+1. As alinear transformation, every orthogonal matrix with determinant+1 is a purerotation without reflection, i.e., the transformation preserves the orientation of the transformed structure, while every orthogonal matrix with determinant−1 reverses the orientation, i.e., is a composition of a purereflection and a (possibly null) rotation. The identity matrices have determinant1 and are pure rotations by an angle zero.[76]

Thecomplex analog of an orthogonal matrix is aunitary matrix.[77]

Main operations

[edit]

Trace

[edit]

Thetrace,tr(A) of a square matrixA is the sum of its diagonal entries. While matrix multiplication is not commutative as mentionedabove, the trace of the product of two matrices is independent of the order of the factors:[78]tr(AB)=tr(BA).{\displaystyle \operatorname {tr} (\mathbf {AB} )=\operatorname {tr} (\mathbf {BA} ).}This is immediate from the definition of matrix multiplication:[79]tr(AB)=i=1mj=1naijbji=tr(BA).{\displaystyle \operatorname {tr} (\mathbf {AB} )=\sum _{i=1}^{m}\sum _{j=1}^{n}a_{ij}b_{ji}=\operatorname {tr} (\mathbf {BA} ).}It follows that the trace of the product of more than two matrices is independent ofcyclic permutations of the matrices; however, this does not in general apply for arbitrary permutations. For example,tr(ABC) ≠ tr(BAC), in general.[80] Also, the trace of a matrix is equal to that of its transpose,[81] that is,tr(A)=tr(AT).{\displaystyle \operatorname {tr} ({\mathbf {A}})=\operatorname {tr} ({\mathbf {A}}^{\rm {T}}).}

Determinant

[edit]
Main article:Determinant
A linear transformation onR2{\displaystyle \mathbb {R} ^{2}} given by the indicated matrix. The determinant of this matrix is−1, as the area of the green parallelogram at the right is1, but the map reverses theorientation, since it turns the counterclockwise orientation of the vectors to a clockwise one.

Thedeterminant of a square matrixA (denoteddet(A) or|A|) is a number encoding certain properties of the matrix. A matrix is invertibleif and only if its determinant is nonzero.[82] Itsabsolute value equals the area (inR2{\displaystyle \mathbb {R} ^{2}}) or volume (inR3{\displaystyle \mathbb {R} ^{3}}) of the image of the unit square (or cube), while its sign corresponds to the orientation of the corresponding linear map: the determinant is positive if and only if the orientation is preserved.[83]

The determinant of2 × 2 matrices is given by[84]det[abcd]=adbc.{\displaystyle \det {\begin{bmatrix}a&b\\c&d\end{bmatrix}}=ad-bc.}The determinant of3 × 3 matrices involves six terms (rule of Sarrus). The more lengthyLeibniz formula generalizes these two formulae to all dimensions.[85]

The determinant of a product of square matrices equals the product of their determinants:det(AB)=det(A)det(B),{\displaystyle \det({\mathbf {AB}})=\det({\mathbf {A}})\cdot \det({\mathbf {B}}),}or using alternate notation:[86]|AB|=|A||B|.{\displaystyle |{\mathbf {AB}}|=|{\mathbf {A}}|\cdot |{\mathbf {B}}|.}Adding a multiple of any row to another row, or a multiple of any column to another column, does not change the determinant. Interchanging two rows or two columns affects the determinant by multiplying it by−1.[87] Using these operations, any matrix can be transformed to a lower (or upper) triangular matrix, and for such matrices, the determinant equals the product of the entries on the main diagonal; this provides a method to calculate the determinant of any matrix. Finally, theLaplace expansion expresses the determinant in terms ofminors, that is, determinants of smaller matrices.[88] This expansion can be used for a recursive definition of determinants (taking as starting case the determinant of a1 × 1 matrix, which is its unique entry, or even the determinant of a0 × 0 matrix, which is1), that can be seen to be equivalent to the Leibniz formula. Determinants can be used to solvelinear systems usingCramer's rule, where the division of the determinants of two related square matrices equates to the value of each of the system's variables.[89]

Eigenvalues and eigenvectors

[edit]
Main article:Eigenvalues and eigenvectors

A numberλ{\textstyle \lambda } and a nonzero vectorv satisfyingAv=λv{\displaystyle \mathbf {A} \mathbf {v} =\lambda \mathbf {v} }are called aneigenvalue and aneigenvector ofA, respectively.[90][91] The numberλ is an eigenvalue of ann × n matrixA if and only if(AλIn) is not invertible, which isequivalent to[92]det(AλI)=0.{\displaystyle \det(\mathbf {A} -\lambda \mathbf {I} )=0.}The polynomialpA in anindeterminateX given by evaluation of the determinantdet(XInA) is called thecharacteristic polynomial ofA. It is amonic polynomial ofdegreen. Therefore the polynomial equationpA(λ) = 0 has at mostn different solutions, that is, eigenvalues of the matrix.[93] They may be complex even if the entries ofA are real.[94] According to theCayley–Hamilton theorem,pA(A) =0, that is, the result of substituting the matrix itself into its characteristic polynomial yields thezero matrix.[95]

Computational aspects

[edit]

Matrix calculations can be often performed with different techniques. Many problems can be solved by both direct algorithms and iterative approaches. For example, the eigenvectors of a square matrix can be obtained by finding asequence of vectorsxnconverging to an eigenvector whenn tends toinfinity.[96]

To choose the most appropriate algorithm for each specific problem, it is important to determine both the effectiveness and precision of all the available algorithms. The domain studying these matters is callednumerical linear algebra.[97] As with other numerical situations, two main aspects are thecomplexity of algorithms and theirnumerical stability.

Determining the complexity of an algorithm means findingupper bounds or estimates of how many elementary operations such as additions and multiplications of scalars are necessary to perform some algorithm, for example, multiplication of matrices. Calculating the matrix product of twon-by-n matrices using the definition given above needsn3 multiplications, since for any of then2 entries of the product,n multiplications are necessary. TheStrassen algorithm outperforms this "naive" algorithm; it needs onlyn2.807 multiplications.[98] Theoretically faster but impracticalmatrix multiplication algorithms have been developed,[99] as have speedups to this problem usingparallel algorithms ordistributed computation systems such asMapReduce.[100]

In many practical situations, additional information about the matrices involved is known. An important case concernssparse matrices, that is, matrices whose entries are mostly zero. There are specifically adapted algorithms for, say, solving linear systemsAx =b for sparse matricesA, such as theconjugate gradient method.[101]

An algorithm is, roughly speaking, numerically stable if little deviations in the input values do not lead to big deviations in the result. For example, one can calculate the inverse of a matrix by computing itsadjugate matrix:A1=adj(A)/det(A).{\displaystyle {\mathbf {A}}^{-1}=\operatorname {adj} ({\mathbf {A}})/\det({\mathbf {A}}).}However, this may lead to significant rounding errors if the determinant of the matrix is very small. Thenorm of a matrix can be used to capture theconditioning of linear algebraic problems, such as computing a matrix's inverse.[102]

Decomposition

[edit]
Main articles:Matrix decomposition,Matrix diagonalization,Gaussian elimination, andBareiss algorithm

There are several methods to render matrices into a more easily accessible form. They are generally referred to asmatrix decomposition ormatrix factorization techniques. These techniques are of interest because they can make computations easier.

TheLU decomposition factors matrices as a product of lower (L) and an uppertriangular matrices (U).[103] Once this decomposition is calculated, linear systems can be solved more efficiently by a simple technique calledforward and back substitution. Likewise, inverses of triangular matrices are algorithmically easier to calculate. TheGaussian elimination is a similar algorithm; it transforms any matrix torow echelon form.[104] Both methods proceed by multiplying the matrix by suitableelementary matrices, which correspond topermuting rows or columns and adding multiples of one row to another row.Singular value decomposition (SVD) expresses any matrixA as a productUDV, whereU andV areunitary matrices andD is a diagonal matrix.[105]

An example of a matrix in Jordan normal form. The grey blocks are called Jordan blocks.

Theeigendecomposition ordiagonalization expressesA as a productVDV−1, whereD is a diagonal matrix andV is a suitable invertible matrix.[106] IfA can be written in this form, it is calleddiagonalizable. More generally, and applicable to all matrices, the Jordan decomposition transforms a matrix intoJordan normal form, that is to say matrices whose only nonzero entries are the eigenvaluesλ1 toλn ofA, placed on the main diagonal and possibly entries equal to one directly above the main diagonal, as shown at the right.[107] Given the eigendecomposition, thenth power ofA (that is,n-fold iterated matrix multiplication) can be calculated viaAn=(VDV1)n=VDV1VDV1VDV1=VDnV1{\displaystyle {\mathbf {A}}^{n}=({\mathbf {VDV}}^{-1})^{n}={\mathbf {VDV}}^{-1}{\mathbf {VDV}}^{-1}\ldots {\mathbf {VDV}}^{-1}={\mathbf {VD}}^{n}{\mathbf {V}}^{-1}}and the power of a diagonal matrix can be calculated by taking the corresponding powers of the diagonal entries, which is much easier than doing the exponentiation forA instead. This can be used to compute thematrix exponentialeA, a need frequently arising in solvinglinear differential equations,matrix logarithms andsquare roots of matrices.[108] To avoid numericallyill-conditioned situations, further algorithms such as theSchur decomposition can be employed.[109]

Abstract algebraic aspects and generalizations

[edit]

Matrices can be generalized in different ways. Abstract algebra uses matrices with entries in more generalfields or evenrings, while linear algebra codifies properties of matrices in the notion of linear maps. It is possible to consider matrices with infinitely many columns and rows. Another extension istensors, which can be seen as higher-dimensional arrays of numbers, as opposed to vectors, which can often be realized as sequences of numbers, while matrices are rectangular or two-dimensional arrays of numbers.[110] Matrices, subject to certain requirements tend to formgroups known as matrix groups.[111] Similarly under certain conditions matrices formrings known asmatrix rings.[112] Though the product of matrices is not in general commutative, certain matrices formfields sometimes called matrix fields.[113] (However the term "matrix field" is ambiguous, also referring to certain forms of physicalfields that continuously map points of some space to matrices.[114]) In general, matrices over any ring and theirmultiplication can be represented as the arrows and composition of arrows in acategory, thecategory of matrices over that ring. The objects of this category are natural numbers, representing the dimensions of the matrices.[115]

Matrices with entries in a field or ring

[edit]

This article focuses on matrices whose entries are real or complex numbers.However, matrices can be considered with much more general types of entries than real or complex numbers. As a first step of generalization, anyfield, that is, aset whereaddition,subtraction,multiplication, anddivision operations are defined and well-behaved, may be used instead ofR{\displaystyle \mathbb {R} } orC{\displaystyle \mathbb {C} }, for examplerational numbers orfinite fields. For example,coding theory makes use of matrices over finite fields.[116] Wherevereigenvalues are considered, as these are roots of a polynomial, they may exist only in a larger field than that of the entries of the matrix. For instance, they may be complex in the case of a matrix with real entries. The possibility to reinterpret the entries of a matrix as elements of a larger field (for example, to view a real matrix as a complex matrix whose entries happen to be all real) then allows considering each square matrix to possess a full set of eigenvalues.[117] Alternatively one can consider only matrices with entries in analgebraically closed field, such asC,{\displaystyle \mathbb {C} ,} from the outset.[118]

Matrices whose entries arepolynomials,[119] and more generally, matrices with entries in aringR are widely used in mathematics.[1] Rings are a more general notion than fields in that a division operation need not exist. The very same addition and multiplication operations of matrices extend to this setting, too. The setM(n,R) (also denotedMn(R)[15]) of all squaren-by-n matrices overR is a ring calledmatrix ring, isomorphic to theendomorphism ring of the leftR-moduleRn.[120] If the ringR iscommutative, that is, its multiplication is commutative, then the ringM(n,R) is also anassociative algebra overR. Thedeterminant of square matrices over a commutative ringR can still be defined using theLeibniz formula; such a matrix is invertible if and only if its determinant isinvertible inR, generalizing the situation over a fieldF, where every nonzero element is invertible.[121] Matrices oversuperrings are calledsupermatrices.[122]

Matrices do not always have all their entries in the same ring – or even in any ring at all. One special but common case isblock matrices, which may be considered as matrices whose entries themselves are matrices. The entries need not be square matrices, and thus need not be members of anyring; but in order to multiply them, their sizes must fulfill certain conditions: each pair of submatrices that are multiplied in forming the overall product must have compatible sizes.[123]

Relationship to linear maps

[edit]

Linear mapsRnRm{\displaystyle \mathbb {R} ^{n}\to \mathbb {R} ^{m}} are equivalent tom-by-n matrices, as describedabove. More generally, any linear mapf :VW between finite-dimensionalvector spaces can be described by a matrixA = (aij), after choosingbasesv1, ...,vn ofV, andw1, ...,wm ofW (son is the dimension ofV andm is the dimension ofW), which is such thatf(vj)=i=1mai,jwifor j=1,,n.{\displaystyle f(\mathbf {v} _{j})=\sum _{i=1}^{m}a_{i,j}\mathbf {w} _{i}\qquad {\mbox{for}}\ j=1,\ldots ,n.}In other words, columnj ofA expresses the image ofvj in terms of the basis vectorswi ofW; thus this relation uniquely determines the entries of the matrixA. The matrix depends on the choice of the bases: different choices of bases give rise to different, butequivalent matrices.[124] Many of the above concrete notions can be reinterpreted in this light, for example, the transpose matrixAT describes thetranspose of the linear map given byA, concerning thedual bases.[125]

These properties can be restated more naturally: thecategory of matrices with entries in a fieldk{\displaystyle k} with multiplication as composition isequivalent to the category of finite-dimensionalvector spaces and linear maps over this field.[126]

More generally, the set ofm × n matrices can be used to represent theR-linear maps between the free modulesRm andRn for an arbitrary ringR with unity. Whenn =m composition of these maps is possible, and this gives rise to thematrix ring ofn × n matrices representing theendomorphism ring ofRn.[127]

Matrix groups

[edit]
Main article:Matrix group

Agroup is a mathematical structure consisting of a set of objects together with abinary operation, that is, an operation combining any two objects to a third, subject to certain requirements.[128] A group in which the objects areinvertiblen×n{\displaystyle n\times n} matrices and the group operation is matrix multiplication is called amatrix group of degreen{\displaystyle n}.[129] Every such matrix group is asubgroup of (that is, a smaller group contained within) the group ofall invertiblen×n{\displaystyle n\times n} matrices, thegeneral linear group of degreen{\displaystyle n}.[130]

Any property of square matrices that is preserved under matrix products and inverses can be used to define a matrix group. For example, the set of alln×n{\displaystyle n\times n} matrices whose determinant is1 form a group called thespecial linear group of degreen{\displaystyle n}.[131] The set oforthogonal matrices, determined by the conditionMTM=I,{\displaystyle {\mathbf {M}}^{\rm {T}}{\mathbf {M}}={\mathbf {I}},}form theorthogonal group.[132] Every orthogonal matrix hasdeterminant1 or−1. Orthogonal matrices with determinant1 form a group called thespecial orthogonal group.[133]

Everyfinite group isisomorphic to a matrix group, as one can see by considering theregular representation of thesymmetric group.[134] General groups can be studied using matrix groups, which are comparatively well understood, usingrepresentation theory.[135]

Infinite matrices

[edit]

It is also possible to consider matrices with infinitely many rows and columns.[136] The basic operations introduced above are defined the same way in this case. Matrix multiplication, however, and all operations stemming therefrom are only meaningful when restricted to certain matrices, since thesum featuring in the above definition of the matrix product will contain an infinity of summands.[137] An easy way to circumvent this issue is to restrict tofinitary matrices all of whose rows (or columns) contain only finitely many nonzero terms.[138] As in the finite case (seeabove), where matrices describe linear maps, infinite matrices can be used to describeoperators on Hilbert spaces, where convergence andcontinuity questions arise. However, the explicit point of view of matrices tends to obfuscate the matter,[139] and the abstract and more powerful tools offunctional analysis are used instead, by relating matrices to linear maps (as in the finite caseabove), but imposing additional convergence and continuity constraints.

Empty matrix

[edit]

Anempty matrix is a matrix in which the number of rows or columns (or both) is zero.[140][8] Empty matrices can be a usefulbase case for certainrecursive constructions,[141] and can help to deal with maps involving thezero vector space.[142] For example, ifA is a3 × 0 matrix andB is a0 × 3 matrix, thenAB is the3 × 3zero matrix corresponding to the null map from a 3-dimensional spaceV to itself, whileBA is a0 × 0 matrix. There is no common notation for empty matrices, but mostcomputer algebra systems allow creating and computing with them.[143] The determinant of the0 × 0 matrix is conventionally defined to be 1, consistent with theempty product occurring in the Leibniz formula for the determinant.[144] This value is also needed for consistency with the2 × 2 case of theDesnanot–Jacobi identity relating determinants to the determinants of smaller matrices.[145]

Matrices with entries in a semiring

[edit]

Asemiring is similar to a ring, but elements need not haveadditive inverses, therefore one cannot do subtraction freely there. The definition of addition and multiplication of matrices with entries in a ring applies to matrices with entries in a semiring without modification. Matrices of fixed size with entries in a semiring form acommutative monoidMat(m,n;R){\displaystyle \operatorname {Mat} (m,n;R)} under addition.[146] Square matrices of fixed size with entries in a semiring form a semiringMat(n;R){\displaystyle \operatorname {Mat} (n;R)} under addition and multiplication.[146]

The determinant of ann × n square matrixM{\displaystyle M} with entries in acommutative semiringR{\displaystyle R} cannot be defined in general because the definition would involve additive inverses of semiring elements. What plays its role instead is the pair of positive and negative determinants

det+M=σAlt(n)M1σ(1)Mnσ(n){\displaystyle \det \nolimits _{+}M=\sum _{\sigma \in \operatorname {Alt} (n)}M_{1\sigma (1)}\cdots M_{n\sigma (n)}}
detM=σSym(n)Alt(n)M1σ(1)Mnσ(n){\displaystyle \det \nolimits _{-}M=\sum _{\sigma \in \operatorname {Sym} (n)\setminus \operatorname {Alt} (n)}M_{1\sigma (1)}\cdots M_{n\sigma (n)}}

where the sums are taken overeven permutations and odd permutations, respectively.[147][148]

Matrices with entries in a category

[edit]

Matrices and their multiplication can be defined with entries objects of acategory equipped with a "tensor product" similar to multiplication in a ring, havingcoproducts similar to addition in a ring, in that the former isdistributive over the latter.[149] However, the multiplication thus defined may be only associative in a sense weaker than usual. These are part of a bigger structure called thebicategory of matrices. The complete description of the above summary for interested readers follows.

Let(C,,I){\displaystyle ({\mathcal {C}},\otimes ,I)} be amonoidal category satisfying the following two conditions:

Then, thebicategory ofC{\displaystyle {\mathcal {C}}}-matricesMat(C){\displaystyle \operatorname {Mat} ({\mathcal {C}})} is as follows:[149]

In general, the bicategory of matrices need not be a strict2-category. For example, the composition of 1-morphisms may not be associative in the usual strict sense, but only up tocoherent isomorphism.

Applications

[edit]

There are numerous applications of matrices, both in mathematics and other sciences. Some of them merely take advantage of the compact representation of a set of numbers in a matrix. For example,Text mining and automatedthesaurus compilation makes use ofdocument-term matrices such astf-idf to track frequencies of certain words in several documents.[150]

Complex numbers can be represented by particular real 2-by-2 matrices viaa+ib[abba],{\displaystyle a+ib\leftrightarrow {\begin{bmatrix}a&-b\\b&a\end{bmatrix}},}under which addition and multiplication of complex numbers and matrices correspond to each other. For example, 2-by-2 rotation matrices represent the multiplication with some complex number ofabsolute value 1, asabove. A similar interpretation is possible forquaternions[151] andClifford algebras in general.[152]

Ingame theory andeconomics, thepayoff matrix encodes the payoff for two players, depending on which out of a given (finite) set of strategies the players choose.[153] The expected outcome of the game, when both players playmixed strategies, is obtained by multiplying this matrix on both sides by vectors representing the strategies.[154] Theminimax theorem central to game theory is closely related to theduality theory of linear programs, which are often formulated in terms of matrix-vector products.[155]

Earlyencryption techniques such as theHill cipher also used matrices. However, due to the linear nature of matrices, these codes are comparatively easy to break.[156]Computer graphics uses matrices to represent objects; to calculate transformations of objects using affinerotation matrices to accomplish tasks such as projecting a three-dimensional object onto a two-dimensional screen, corresponding to a theoretical camera observation; and to apply image convolutions such as sharpening, blurring, edge detection, and more.[157] Matrices over apolynomial ring are important in the study ofcontrol theory.[158]

Chemistry makes use of matrices in various ways, particularly since the use ofquantum theory to discussmolecular bonding andspectroscopy. Examples are theoverlap matrix and theFock matrix used in solving theRoothaan equations to obtain themolecular orbitals of theHartree–Fock method.[159]

Graph theory

[edit]
An undirected graph with adjacency matrix:[110101010].{\displaystyle {\begin{bmatrix}1&1&0\\1&0&1\\0&1&0\end{bmatrix}}.}

Theadjacency matrix of afinite graph is a basic notion ofgraph theory.[160] It records which vertices of the graph are connected by an edge. Matrices containing just two different values (1 and0 meaning for example "yes" and "no", respectively) are calledlogical matrices. Thedistance (or cost) matrix contains information about the distances of the edges.[161] These concepts can be applied towebsites connected byhyperlinks,[162] or cities connected by roads etc., in which case (unless the connection network is extremely dense) the matrices tend to besparse, that is, contain few nonzero entries. Therefore, specifically tailored matrix algorithms can be used innetwork theory.[163]

Analysis and geometry

[edit]

TheHessian matrix of adifferentiable functionf:RnR{\displaystyle f:\mathbb {R} ^{n}\to \mathbb {R} } consists of thesecond derivatives ofƒ concerning the several coordinate directions, that is,[164]H(f)=[2fxixj].{\displaystyle H(f)=\left[{\frac {\partial ^{2}f}{\partial x_{i}\,\partial x_{j}}}\right].}

At thesaddle point(x = 0,y = 0) (red) of the functionf (x,−y) =x2y2, the Hessian matrix[2002]{\displaystyle {\begin{bmatrix}2&0\\0&-2\end{bmatrix}}} isindefinite.

It encodes information about the local growth behavior of the function: given acritical pointx = (x1, ...,xn), that is, a point where the firstpartial derivativesf/xi{\displaystyle \partial f/\partial x_{i}} off vanish, the function has alocal minimum if the Hessian matrix ispositive definite.Quadratic programming can be used to find global minima or maxima of quadratic functions closely related to the ones attached to matrices (seeabove).[165]

Another matrix frequently used in geometrical situations is theJacobi matrix of a differentiable mapf:RnRm{\displaystyle f:\mathbb {R} ^{n}\to \mathbb {R} ^{m}}. Iff1, ...,fm denote the components off, then the Jacobi matrix is defined as[166]J(f)=[fixj]1im,1jn.{\displaystyle J(f)=\left[{\frac {\partial f_{i}}{\partial x_{j}}}\right]_{1\leq i\leq m,1\leq j\leq n}.}Ifn >m, and if the rank of the Jacobi matrix attains its maximal valuem,f is locally invertible at that point, by theimplicit function theorem.[167]

Partial differential equations can be classified by considering the matrix of coefficients of the highest-order differential operators of the equation. Forelliptic partial differential equations this matrix is positive definite, which has a decisive influence on the set of possible solutions of the equation in question.[168]

Thefinite element method is an important numerical method to solve partial differential equations, widely applied in simulating complex physical systems. It attempts to approximate the solution to some equation by piecewise linear functions, where the pieces are chosen concerning a sufficiently fine grid, which in turn can be recast as a matrix equation.[169]

Probability theory and statistics

[edit]
Two different Markov chains. The chart depicts the number of particles (of a total of 1000) in state "2". Both limiting values can be determined from the transition matrices, which are given by[0.700.31]{\displaystyle \left[{\begin{smallmatrix}0.7&0\\0.3&1\end{smallmatrix}}\right]} (red) and[0.70.20.30.8]{\displaystyle \left[{\begin{smallmatrix}0.7&0.2\\0.3&0.8\end{smallmatrix}}\right]} (black).

Stochastic matrices are square matrices whose rows areprobability vectors, that is, whose entries are non-negative and sum up to one. Stochastic matrices are used to defineMarkov chains with finitely many states.[170] A row of the stochastic matrix gives the probability distribution for the next position of some particle currently in the state that corresponds to the row. Properties of the Markov chain—likeabsorbing states, that is, states that any particle attains eventually—can be read off the eigenvectors of the transition matrices.[171]

Statistics also makes use of matrices in many different forms.[172]Descriptive statistics is concerned with describing data sets, which can often be represented asdata matrices, which may then be subjected todimensionality reduction techniques. Thecovariance matrix encodes the mutualvariance of severalrandom variables.[173] Another technique using matrices arelinear least squares, a method that approximates a finite set of pairs(x1,y1), (x2,y2), ..., (xN,yN), by a linear functionyiaxi+b,i=1,,N{\displaystyle y_{i}\approx ax_{i}+b,\quad i=1,\ldots ,N}which can be formulated in terms of matrices, related to thesingular value decomposition of matrices.[174]

Random matrices are matrices whose entries are random numbers, subject to suitableprobability distributions, such asmatrix normal distribution. Beyond probability theory, they are applied in domains ranging fromnumber theory tophysics.[175][176]

Quantum mechanics and particle physics

[edit]

The first model ofquantum mechanics (Heisenberg, 1925) used infinite-dimensional matrices to define the operators that took over the role of variables like position, momentum and energy from classical physics.[177] (This is sometimes referred to asmatrix mechanics.[178]) Matrices, both finite and infinite-dimensional, have since been employed for many purposes in quantum mechanics. One particular example is thedensity matrix, a tool used in calculating theprobabilities of the outcomes ofmeasurements performed onphysical systems.[179][180]

Linear transformations and the associatedsymmetries play akey role in modern physics. For example,elementary particles inquantum field theory are classified as representations of theLorentz group of special relativity and, more specifically, by their behavior under thespin group. Concrete representations involving thePauli matrices and more generalgamma matrices are an integral part of the physical description offermions, which behave asspinors.[181] For the three lightestquarks, there is a group-theoretical representation involving thespecial unitary group SU(3); for their calculations, physicists use a convenient matrix representation known as theGell-Mann matrices, which are also used for the SU(3)gauge group that forms the basis of the modern description of strong nuclear interactions,quantum chromodynamics. TheCabibbo–Kobayashi–Maskawa matrix, in turn, expresses the fact that the basic quark states that are important forweak interactions are not the same as, but linearly related to the basic quark states that define particles with specific and distinctmasses.[182]

Another matrix serves as a key tool for describing the scattering experiments that form the cornerstone of experimental particle physics: Collision reactions such as occur inparticle accelerators, where non-interacting particles head towards each other and collide in a small interaction zone, with a new set of non-interacting particles as the result, can be described as the scalar product of outgoing particle states and a linear combination of ingoing particle states. The linear combination is given by a matrix known as theS-matrix, which encodes all information about the possible interactions between particles.[183]

Normal modes

[edit]

A general application of matrices in physics is the description of linearly coupled harmonic systems. Theequations of motion of such systems can be described in matrix form, with a mass matrix multiplying a generalized velocity to give the kinetic term, and aforce matrix multiplying a displacement vector to characterize the interactions. The best way to obtain solutions is to determine the system'seigenvectors, itsnormal modes, by diagonalizing the matrix equation. Techniques like this are crucial when it comes to the internal dynamics ofmolecules: the internal vibrations of systems consisting of mutually bound component atoms.[184] They are also needed for describing mechanical vibrations, and oscillations in electrical circuits.[185]

Geometrical optics

[edit]

Geometrical optics provides further matrix applications. In this approximative theory, thewave nature of light is neglected. The result is a model in whichlight rays are indeedgeometrical rays. If the deflection of light rays by optical elements is small, the action of alens or reflective element on a given light ray can be expressed as multiplication of a two-component vector with a two-by-two matrix calledray transfer matrix analysis: the vector's components are the light ray's slope and its distance from the optical axis, while the matrix encodes the properties of the optical element. There are two kinds of matrices, viz. arefraction matrix describing the refraction at a lens surface, and atranslation matrix, describing the translation of the plane of reference to the next refracting surface, where another refraction matrix applies.The optical system, consisting of a combination of lenses and reflective elements, is simply described by the matrix resulting from the product of the components' matrices.[186]

TheJones calculus models thepolarization of a light source as a2×2{\displaystyle 2\times 2} vector, and the effects ofoptical filters on this polarization vector as a matrix.[48]

Electronics

[edit]

Electronic circuits that are composed of linear components (such as resistors, inductors and capacitors) obeyKirchhoff's circuit laws, which leads to a system of linear equations, which can be described with a matrix equation that relates the source currents and voltages to the resultant currents and voltages at each point in the circuit, and where the matrix entries are determined by the circuit.[187]

History

[edit]

Matrices have a long history of application in solvinglinear equations but they were known as arrays until the 1800s. TheChinese textThe Nine Chapters on the Mathematical Art written in the 10th–2nd century BCE is the first example of the use of array methods to solvesimultaneous equations,[188] including the concept ofdeterminants. In 1545 Italian mathematicianGerolamo Cardano introduced the method to Europe when he publishedArs Magna.[189] TheJapanese mathematicianSeki used the same array methods to solve simultaneous equations in 1683.[190] The Dutch mathematicianJan de Witt represented transformations using arrays in his 1659 bookElements of Curves (1659).[191] Between 1700 and 1710Gottfried Wilhelm Leibniz publicized the use of arrays for recording information or solutions and experimented with over 50 different systems of arrays.[189]Cramer presentedhis rule in 1750.[192][193]

This use of the termmatrix in mathematics (an English word for "womb" in the 19th century, from Latin, as well as a jargon wordin printing,in biology andin geology[194]) was coined byJames Joseph Sylvester in 1850,[195] who understood a matrix as an object giving rise to several determinants today calledminors, that is to say, determinants of smaller matrices that derive from the original one by removing columns and rows. In an 1851 paper, Sylvester explains:[196]

I have in previous papers defined a "Matrix" as a rectangular array of terms, out of which different systems of determinants may be engendered from the womb of a common parent.

Arthur Cayley published a treatise on geometric transformations using matrices that were not rotated versions of the coefficients being investigated as had previously been done. Instead, he defined operations such as addition, subtraction, multiplication, and division as transformations of those matrices and showed the associative and distributive properties held. Cayley investigated and demonstrated the non-commutative property of matrix multiplication as well as the commutative property of matrix addition.[189] Early matrix theory had limited the use of arrays almost exclusively to determinants and Cayley's abstract matrix operations were revolutionary. He was instrumental in proposing a matrix concept independent of equation systems. In 1858, Cayley published hisA memoir on the theory of matrices[197][198] in which he proposed and demonstrated theCayley–Hamilton theorem.[189]

The English mathematicianCuthbert Edmund Cullis was the first to use modern bracket notation for matrices in 1913 and he simultaneously demonstrated the first significant use of the notationA = [ai,j] to represent a matrix whereai,j refers to theith row and thejth column.[189]

The modern study of determinants sprang from several sources.[199]Number-theoretical problems ledGauss to relate coefficients ofquadratic forms, that is, expressions such asx2 +xy − 2y2, andlinear maps in three dimensions to matrices.Eisenstein further developed these notions, including the remark that, in modern parlance,matrix products arenon-commutative.Cauchy was the first to prove general statements about determinants, using as the definition of the determinant of a matrixA = [ai,j] the following: replace the powersajk byaj,k in thepolynomiala1a2ani<j(ajai),{\displaystyle a_{1}a_{2}\cdots a_{n}\prod _{i<j}(a_{j}-a_{i}),}where{\displaystyle \textstyle \prod } denotes theproduct of the indicated terms. He also showed, in 1829, that theeigenvalues of symmetric matrices are real.[200]Jacobi studied "functional determinants"—later calledJacobi determinants by Sylvester—which can be used to describe geometric transformations at a local (orinfinitesimal) level, seeabove.Kronecker'sVorlesungen über die Theorie der Determinanten[201] andWeierstrass'sZur Determinantentheorie,[202] both published in 1903, first treated determinantsaxiomatically, as opposed to previous more concrete approaches such as the mentioned formula of Cauchy. At that point, determinants were firmly established.[203][199]

Many theorems were first established for small matrices only, for example, theCayley–Hamilton theorem was proved for2 × 2 matrices by Cayley in the aforementioned memoir, and byHamilton for4 × 4 matrices.Frobenius, working onbilinear forms, generalized the theorem to all dimensions (1898). Also at the end of the 19th century, theGauss–Jordan elimination (generalizing a special case now known asGauss elimination) was established byWilhelm Jordan. In the early 20th century, matrices attained a central role in linear algebra,[204] partially due to their use in the classification of thehypercomplex number systems of the previous century.[205]

The inception ofmatrix mechanics byHeisenberg,Born andJordan led to studying matrices with infinitely many rows and columns.[206] Later,von Neumann carried out themathematical formulation of quantum mechanics, by further developingfunctional analytic notions such aslinear operators onHilbert spaces, which, very roughly speaking, correspond toEuclidean space, but with an infinity ofindependent directions.[207]

Other historical usages of the word "matrix" in mathematics

[edit]

The word has been used in unusual ways by at least two authors of historical importance.

Bertrand Russell andAlfred North Whitehead in theirPrincipia Mathematica (1910–1913) use the word "matrix" in the context of theiraxiom of reducibility. They proposed this axiom as a means to reduce any function to one of lower type, successively, so that at the "bottom" (0 order) the function is identical to itsextension:[208]

Let us give the name ofmatrix to any function, of however many variables, that does not involve anyapparent variables. Then, any possible function other than a matrix derives from a matrix using generalization, that is, by considering the proposition that the function in question is true with all possible values or with some value of one of the arguments, the other argument or arguments remaining undetermined.

For example, a functionΦ(x,y) of two variablesx andy can be reduced to acollection of functions of a single variable, such asy, by "considering" the function for all possible values of "individuals"ai substituted in place of a variablex. And then the resulting collection of functions of the single variabley, that is,ai: Φ(ai,y), can be reduced to a "matrix" of values by "considering" the function for all possible values of "individuals"bi substituted in place of variabley:bjai:ϕ(ai,bj).{\displaystyle \forall b_{j}\forall a_{i}\colon \phi (a_{i},b_{j}).}

Alfred Tarski in his 1941Introduction to Logic used the word "matrix" synonymously with the notion oftruth table as used in mathematical logic.[209]

See also

[edit]

Notes

[edit]
  1. ^abLang (2002), Chapter XIII.
  2. ^Fraleigh (1976), p. 209.
  3. ^Nering (1970), p. 37.
  4. ^abBrown (1991), p. 1.
  5. ^Golub & Van Loan (1996), p. 3.
  6. ^Horn & Johnson (1985), p. 5.
  7. ^Gbur (2011), p. 89.
  8. ^ab"A matrix having at least one dimension equal to zero is called an empty matrix",MATLAB Data StructuresArchived 2009-12-28 at theWayback Machine
  9. ^Ramachandra Rao & Bhimasankaram (2000), p. 71.
  10. ^Hamilton (1987), p. 29.
  11. ^Gentle (1998), pp. 52–53.
  12. ^Bauchau & Craig (2009), p. 915.
  13. ^Johnston (2021), p. 21.
  14. ^Oualline (2003), Ch. 5.
  15. ^abPop & Furdui (2017).
  16. ^For example, forM{\displaystyle M}, seeMello (2017),p. 48; forMat{\displaystyle \operatorname {Mat} }, seeAxler (1997),p. 50.
  17. ^Brown (1991), Definition I.2.1 (addition), Definition I.2.4 (scalar multiplication), and Definition I.2.33 (transpose).
  18. ^Whitelaw (1991), p. 29.
  19. ^Brown (1991), Theorem I.2.6.
  20. ^Whitelaw (1991), p. 30.
  21. ^Maxwell (1969), p. 46.
  22. ^Lancaster & Tismenetsky (1985), pp. 6–7.
  23. ^Andrilli & Hecker (2022), p. 38, The transpose of a matrix and its properties.
  24. ^abLancaster & Tismenetsky (1985), p. 9.
  25. ^Brown (1991), Definition I.2.20.
  26. ^Brown (1991), Theorem I.2.24.
  27. ^Boas (2005), p. 117.
  28. ^Horn & Johnson (1985), Ch. 4 and 5.
  29. ^Van Loan (2000).
  30. ^Perrone (2024), p. 119–120.
  31. ^Lang (1986), p. 71.
  32. ^Watkins (2002), p. 102.
  33. ^Bronson (1970), p. 16.
  34. ^Kreyszig (1972), p. 220.
  35. ^abProtter & Morrey (1970), p. 869.
  36. ^Kreyszig (1972), pp. 241, 244.
  37. ^Schneider & Barker (2012).
  38. ^Perlis (1991).
  39. ^Anton (2010).
  40. ^Horn, Roger A.; Johnson, Charles R. (2012),Matrix Analysis (2nd ed.), Cambridge University Press, p. 17,ISBN 978-0-521-83940-2.
  41. ^Brown (1991), I.2.21 and 22.
  42. ^Gbur (2011), p. 95.
  43. ^Ben-Israel & Greville (2003), pp. 1–2.
  44. ^Grossman (1994), pp. 494–495.
  45. ^Bierens (2004), p. 263.
  46. ^Johnston (2021), p. 56.
  47. ^Pettofrezzo (1978), p. 60.
  48. ^abcHan, Kim & Noz (1997).
  49. ^Jeffrey (2010), p. 264.
  50. ^Greub (1975, p. 90). Note however that Greub follows a transposed convention of representing a transformation by multiplying a row vector by a matrix, rather than multiplying a matrix by a column vector, leading to the reversed order for the two matrices in the product that represents a composition.
  51. ^Lang (1986), §VI.1.
  52. ^Brown (1991), Definition II.3.3.
  53. ^Greub (1975), Section III.1.
  54. ^Brown (1991), Theorem II.3.22.
  55. ^Anton (2010), p. 27.
  56. ^Reyes (2025).
  57. ^Anton (2010), p. 68.
  58. ^Gbur (2011), p. 91.
  59. ^abBoas (2005), p. 118.
  60. ^Horn & Johnson (1985), §0.9.1 Diagonal matrices.
  61. ^Boas (2005), p. 138.
  62. ^Horn & Johnson (1985), Theorem 2.5.6.
  63. ^Conway (1990), pp. 262–263.
  64. ^Brown (1991), Definition I.2.28.
  65. ^Brown (1991), Definition I.5.13.
  66. ^Anton (2010), p. 62.
  67. ^Gbur (2011), pp. 99–100.
  68. ^Horn & Johnson (1985), Chapter 7.
  69. ^Anton (2010), Thm. 7.3.2.
  70. ^Horn & Johnson (1985), Theorem 7.2.1.
  71. ^Boas (2005), p. 150.
  72. ^Horn & Johnson (1985), p. 169, Example 4.0.6.
  73. ^Lang (1986), Appendix. Complex numbers.
  74. ^Horn & Johnson (1985), pp. 66–67.
  75. ^Gbur (2011), pp. 102–103.
  76. ^Boas (2005), pp. 127, 153–154.
  77. ^Boas (2005), p. 141.
  78. ^Horn & Johnson (1985), pp. 40, 42.
  79. ^Lang (1986), p. 281.
  80. ^Tang (2006), p. 226.
  81. ^Bernstein (2009), p. 94.
  82. ^Horn & Johnson (1985), §0.5 Nonsingularity.
  83. ^Margalit & Rabinoff (2019).
  84. ^"Matrix | mathematics",Encyclopedia Britannica, retrieved2020-08-19
  85. ^Brown (1991), Definition III.2.1.
  86. ^Brown (1991), Theorem III.2.12.
  87. ^Brown (1991), Corollary III.2.16.
  88. ^Mirsky (1990), Theorem 1.4.1.
  89. ^Brown (1991), Theorem III.3.18.
  90. ^Eigen means "own" inGerman and inDutch. SeeWiktionary.
  91. ^Brown (1991), Definition III.4.1.
  92. ^Brown (1991), Definition III.4.9.
  93. ^Brown (1991), Corollary III.4.10.
  94. ^Anton (2010), pp. 317–319.
  95. ^Bernstein (2009), p. 265.
  96. ^Householder (1975), Ch. 7.
  97. ^Bau III & Trefethen (1997).
  98. ^Golub & Van Loan (1996), Algorithm 1.3.1.
  99. ^Vassilevska Williams et al. (2024).
  100. ^Misra, Bhattacharya & Ghosh (2022).
  101. ^Golub & Van Loan (1996), Chapters 9 and 10, esp. section 10.2.
  102. ^Golub & Van Loan (1996), Chapter 2.3.
  103. ^Press et al. (1992).
  104. ^Stoer & Bulirsch (2002), Section 4.1.
  105. ^Gbur (2011), pp. 146–153.
  106. ^Horn & Johnson (1985), Theorem 2.5.4.
  107. ^Horn & Johnson (1985), Ch. 3.1, 3.2.
  108. ^Arnold (1992), Sections 14.5, 7, 8.
  109. ^Bronson (1989), Ch. 15.
  110. ^Coburn (1955), Ch. V.
  111. ^Tapp (2016).
  112. ^Lam (1999), pp. 461–470, Chapter 7, §17 Matrix Rings, §17A Characterization and Examples.
  113. ^Hachenberger & Jungnickel (2020), p. 302, Definition 7.2.1.
  114. ^Ydri (2016).
  115. ^Riehl (2016), pp. 4-6.
  116. ^Roth (2006), p. 27.
  117. ^Chahal (2018), pp. 115–116.
  118. ^Meckes & Meckes (2018), pp. 360–361.
  119. ^Edwards (2004), p. 80.
  120. ^Lang (2002), p. 643, XVII.1.
  121. ^Lang (2002), Proposition XIII.4.16.
  122. ^Reichl (2004), Section L.2.
  123. ^Jeffrey (2010), pp. 54ff, 3.7 Partitioning of matrices.
  124. ^Greub (1975), Section III.3.
  125. ^Greub (1975), Section III.3.13.
  126. ^Perrone (2024), pp. 99–100.
  127. ^Hungerford (1980), pp. 328–335, VII.1: Matrices and maps.
  128. ^Horn & Johnson (1985), p. 69.
  129. ^Baker (2003), Def. 1.30.
  130. ^Cameron (2014).
  131. ^Baker (2003), Theorem 1.2.
  132. ^Artin (1991), Chapter 4.5.
  133. ^Serre (2007), p. 20.
  134. ^Rowen (2008), p. 198, Example 19.2.
  135. ^See any reference in representation theory orgroup representation.
  136. ^See the item "Matrix" in Itô 1987.
  137. ^Boos (2000), pp. 34–39, 2.2 Dealing with infinite matrices.
  138. ^Grillet (2007), p. 334.
  139. ^"Not much of matrix theory carries over to infinite-dimensional spaces, and what does is not so useful, but it sometimes helps." Halmos 1982, p. 23, Chapter 5.
  140. ^"Empty Matrix: A matrix is empty if either its row or column dimension is zero",GlossaryArchived 2009-04-29 at theWayback Machine, O-Matrix v6 User Guide
  141. ^Coleman & Van Loan (1988), p. 213.
  142. ^Hazewinkel & Gubareni (2017), p. 151.
  143. ^The notation of empty matrix is used differently from some sources likeBernstein (2009), p. 90 use00×n{\displaystyle 0_{0\times n}}, resembling thezero matrix;Hazewinkel & Gubareni (2017), p. 151 useI0×n{\displaystyle {\mathfrak {I}}_{0\times n}}.
  144. ^West (2020), p. 750.
  145. ^Brualdi et al. (2018), p. 19.
  146. ^abFarid, Khan & Wang (2013), 2087.
  147. ^Reutenauer & Straubing (1984), 351.
  148. ^Ghosh (1996), 222.
  149. ^abCarboni, Kasangian & Walters (1987), 137.
  150. ^Manning & Schütze (1999), Section 15.3.4.
  151. ^Ward (1997), Ch. 2.8.
  152. ^Abłamowicz (2000), p. 436.
  153. ^Fudenberg & Tirole (1983), Section 1.1.1.
  154. ^McHugh (2025), p. 390, 11.2.3 The expected payoff as a vector–matrix–vector product.
  155. ^Matoušek & Gärtner (2007), pp. 136–137.
  156. ^Stinson (2005), Ch. 1.1.5 and 1.2.4.
  157. ^ISRD Group (2005), Ch. 7.
  158. ^Bhaya & Kaszkurewicz (2006), p. 230.
  159. ^Jensen (1999), p. 65–69.
  160. ^Godsil & Royle (2004), Ch. 8.1.
  161. ^Punnen & Gutin (2002).
  162. ^Zhang, Yu & Hou (2006), p. 7.
  163. ^Scott & Tůma (2023).
  164. ^Lang (1987), Ch. XVI.6.
  165. ^Nocedal & Wright (2006), Ch. 16.
  166. ^Lang (1987), Ch. XVI.1.
  167. ^Lang 1987, Ch. XVI.5. For a more advanced, and more general statement see Lang 1969, Ch. VI.2.
  168. ^Gilbarg & Trudinger (2001).
  169. ^Šolin 2005, Ch. 2.5. See alsostiffness method.
  170. ^Latouche & Ramaswami (1999).
  171. ^Mehata & Srinivasan (1978), Ch. 2.8.
  172. ^Healy, Michael (1986),Matrices for Statistics,Oxford University Press,ISBN 978-0-19-850702-4
  173. ^Krzanowski (1988), p. 60, Ch. 2.2.
  174. ^Krzanowski (1988), Ch. 4.1.
  175. ^Conrey 2007
  176. ^Zabrodin, Brézin & Kazakov et al. 2006
  177. ^Schiff (1968), Ch. 6.
  178. ^Peres (1993), p. 20.
  179. ^Bohm (2001), sections I.8, II.4, and II.8.
  180. ^Peres (1993), p. 73.
  181. ^Itzykson & Zuber (1980), Ch. 2.
  182. ^Burgess & Moore (2007), section 1.6.3. (SU(3)), section 2.4.3.2. (Kobayashi–Maskawa matrix).
  183. ^Weinberg (1995), Ch. 3.
  184. ^Wherrett (1987), part II.
  185. ^Riley, Hobson & Bence (1997), 7.17.
  186. ^Guenther (1990), Ch. 5.
  187. ^Suresh Kumar (2009), pp. 747–749.
  188. ^Shen, Crossley & Lun 1999 cited by Bretscher 2005, p. 1
  189. ^abcdeDossey (2002), pp. 564–565.
  190. ^Needham, Joseph;Wang Ling (1959),Science and Civilisation in China, vol. III, Cambridge: Cambridge University Press, p. 117,ISBN 978-0-521-05801-8{{cite book}}: CS1 maint: ignored ISBN errors (link)
  191. ^Dossey (2002), p. 564.
  192. ^Cramer (1750).
  193. ^Kosinski (2001).
  194. ^Murray, James;Bradley, Henry, eds. (1908),"Matrix",A New English Dictionary on Historical Principles, vol. 6, pt. 2 (M–N), Oxford: Clarendon Press, p. 238
  195. ^The earliest published example is J. J. Sylvester (1850) "Additions to the articles in the September number of this journal, 'On a new class of theorems,' and on Pascal's theorem,"The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science,37: 363-370.From page 369: "For this purpose, we must commence, not with a square, but with an oblong arrangement of terms consisting, suppose, of m lines and n columns. This does not in itself represent a determinant, but is, as it were, a Matrix out of which we may form various systems of determinants ... "
  196. ^Sylvester (1904), p. 247,Paper 37.
  197. ^Cayley (1858).
  198. ^Dieudonné (1978), Vol. 1, Ch. III, p. 96.
  199. ^abKnobloch (1994).
  200. ^Hawkins (1975).
  201. ^Kronecker 1897
  202. ^Weierstrass 1915, pp. 271–286
  203. ^ & Miller (1930).
  204. ^Bôcher (2004).
  205. ^Hawkins (1972).
  206. ^van der Waerden (2007), pp. 28–40.
  207. ^Peres (1993), pp. 79, 106–107.
  208. ^Whitehead, Alfred North; and Russell, Bertrand (1913)Principia Mathematica to *56, Cambridge at the University Press, Cambridge UK (republished 1962) cf page 162ff.
  209. ^Tarski (1941), p. 40.

References

[edit]

Mathematical references

[edit]

Physics references

[edit]

Historical references

[edit]

Further reading

[edit]

External links

[edit]
Matrix (mathematics) at Wikipedia'ssister projects
Linear equations
Three dimensional Euclidean space
Matrices
Matrix decompositions
Relations and computations
Vector spaces
Structures
Multilinear algebra
Affine and projective
Numerical linear algebra
Scope
Mathematics
Notation
Tensor
definitions
Operations
Related
abstractions
Notable tensors
Mathematics
Physics
Mathematicians
International
National
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Matrix_(mathematics)&oldid=1320124175"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp