Movatterモバイル変換

[0]ホーム

Jump to content

Kernel (linear algebra)

Edit links

From Wikipedia, the free encyclopedia

(Redirected fromNull space)

Vectors mapped to 0 by a linear map

For other uses, seeKernel (disambiguation).

An example for a kernel- the linear operator $L:(x,y)\longrightarrow (x,x)$ transforms all points on the $(x=0,y)$ line to the zero point $(0,0)$ , thus they form the kernel for the linear operator

{\displaystyle L:(x,y)\longrightarrow (x,x)} — An example for a kernel- the linear operator $L:(x,y)\longrightarrow (x,x)$ transforms all points on the $(x=0,y)$ line to the zero point $(0,0)$ , thus they form the kernel for the linear operator

Inmathematics, thekernel of alinear map, also known as thenull space ornullspace, is the part of thedomain which is mapped to thezero vector of theco-domain; the kernel is always alinear subspace of the domain.^[1] That is, given a linear mapL :V →W between twovector spacesV andW, the kernel ofL is the vector space of all elementsv ofV such thatL(v) =0, where0 denotes thezero vector inW,^[2] or more symbolically: $\ker(L)=\left\{\mathbf {v} \in V\mid L(\mathbf {v} )=\mathbf {0} \right\}=L^{-1}(\mathbf {0} ).$

Properties

[edit]

Kernel and image of a linear mapL fromV toW

The kernel ofL is alinear subspace of the domainV.^[3]^[2]

In the linear map $L:V\to W,$ two elements ofV have the sameimage inWif and only if their difference lies in the kernel ofL, that is,

$L\left(\mathbf {v} _{1}\right)=L\left(\mathbf {v} _{2}\right)\quad \iff \quad L\left(\mathbf {v} _{1}-\mathbf {v} _{2}\right)=\mathbf {0} .$

From this, it follows by thefirst isomorphism theorem that the image ofL isisomorphic to thequotient ofV by the kernel: $\operatorname {im} (L)\cong V/\ker(L).$ In the case whereV isfinite-dimensional, this implies therank–nullity theorem: $\dim(\ker L)+\dim(\operatorname {im} L)=\dim(V).$ where the termrank refers to the dimension of the image ofL, $\dim(\operatorname {im} L),$ whilenullity refers to the dimension of the kernel ofL, $\dim(\ker L).$ ^[4] That is, $\operatorname {Rank} (L)=\dim(\operatorname {im} L)\qquad {\text{ and }}\qquad \operatorname {Nullity} (L)=\dim(\ker L),$ so that the rank–nullity theorem can be restated as $\operatorname {Rank} (L)+\operatorname {Nullity} (L)=\dim \left(\operatorname {domain} L\right).$

WhenV is aninner product space, the quotient $V/\ker(L)$ can be identified with theorthogonal complement inV of $\ker(L)$ . This is the generalization to linear operators of therow space, orcoimage, of a matrix.

Generalization to modules

[edit]

Main article:Module (mathematics)

The notion of kernel also makes sense forhomomorphisms ofmodules, which are generalizations of vector spaces where the scalars are elements of aring, rather than afield. The domain of the mapping is a module, with the kernel constituting asubmodule. Here, the concepts of rank and nullity do not necessarily apply.

In functional analysis

[edit]

Main article:Topological vector space

IfV andW aretopological vector spaces such thatW is finite-dimensional, then a linear operatorL:V →W iscontinuous if and only if the kernel ofL is aclosed subspace ofV.

Representation as matrix multiplication

[edit]

Consider a linear map represented as am ×n matrixA with coefficients in afieldK (typically $\mathbb {R}$ or $\mathbb {C}$ ), that is operating on column vectorsx withn components overK.The kernel of this linear map is the set of solutions to the equationAx =0, where0 is understood as thezero vector. Thedimension of the kernel ofA is called thenullity ofA. Inset-builder notation, $\operatorname {N} (A)=\operatorname {Null} (A)=\operatorname {ker} (A)=\left\{\mathbf {x} \in K^{n}\mid A\mathbf {x} =\mathbf {0} \right\}.$ The matrix equation is equivalent to a homogeneoussystem of linear equations: $A\mathbf {x} =\mathbf {0} \;\;\Leftrightarrow \;\;{\begin{alignedat}{7}a_{11}x_{1}&&\;+\;&&a_{12}x_{2}&&\;+\;\cdots \;+\;&&a_{1n}x_{n}&&\;=\;&&&0\\a_{21}x_{1}&&\;+\;&&a_{22}x_{2}&&\;+\;\cdots \;+\;&&a_{2n}x_{n}&&\;=\;&&&0\\&&&&&&&&&&\vdots \ \;&&&\\a_{m1}x_{1}&&\;+\;&&a_{m2}x_{2}&&\;+\;\cdots \;+\;&&a_{mn}x_{n}&&\;=\;&&&0{\text{.}}\\\end{alignedat}}$ Thus the kernel ofA is the same as the solution set to the above homogeneous equations.

Subspace properties

[edit]

The kernel of am ×n matrixA over a fieldK is alinear subspace ofKⁿ. That is, the kernel ofA, the setNull(A), has the following three properties:

Null(A) always contains thezero vector, sinceA0 =0.
Ifx ∈ Null(A) andy ∈ Null(A), thenx +y ∈ Null(A). This follows from the distributivity ofmatrix multiplication over addition.
Ifx ∈ Null(A) andc is ascalarc ∈K, thencx ∈ Null(A), sinceA(cx) =c(Ax) =c0 =0.

The row space of a matrix

[edit]

Main article:Rank–nullity theorem

The productAx can be written in terms of thedot product of vectors as follows: $A\mathbf {x} ={\begin{bmatrix}\mathbf {a} _{1}\cdot \mathbf {x} \\\mathbf {a} _{2}\cdot \mathbf {x} \\\vdots \\\mathbf {a} _{m}\cdot \mathbf {x} \end{bmatrix}}.$

Here,a₁, ... ,a_m denote the rows of the matrixA. It follows thatx is in the kernel ofA, if and only ifx isorthogonal (or perpendicular) to each of the row vectors ofA (since orthogonality is defined as having a dot product of 0).

Therow space, or coimage, of a matrixA is thespan of the row vectors ofA. By the above reasoning, the kernel ofA is theorthogonal complement to the row space. That is, a vectorx lies in the kernel ofA, if and only if it is perpendicular to every vector in the row space ofA.

The dimension of the row space ofA is called therank ofA, and the dimension of the kernel ofA is called thenullity ofA. These quantities are related by therank–nullity theorem^[4] $\operatorname {rank} (A)+\operatorname {nullity} (A)=n.$

Left null space

[edit]

Theleft null space, orcokernel, of a matrixA consists of all column vectorsx such thatx^TA =0^T, where T denotes thetranspose of a matrix. The left null space ofA is the same as the kernel ofA^T. The left null space ofA is the orthogonal complement to thecolumn space ofA, and is dual to thecokernel of the associated linear transformation. The kernel, the row space, the column space, and the left null space ofA are thefour fundamental subspaces associated with the matrixA.

Nonhomogeneous systems of linear equations

[edit]

The kernel also plays a role in the solution to a nonhomogeneous system of linear equations: $A\mathbf {x} =\mathbf {b} \quad {\text{or}}\quad {\begin{alignedat}{7}a_{11}x_{1}&&\;+\;&&a_{12}x_{2}&&\;+\;\cdots \;+\;&&a_{1n}x_{n}&&\;=\;&&&b_{1}\\a_{21}x_{1}&&\;+\;&&a_{22}x_{2}&&\;+\;\cdots \;+\;&&a_{2n}x_{n}&&\;=\;&&&b_{2}\\&&&&&&&&&&\vdots \ \;&&&\\a_{m1}x_{1}&&\;+\;&&a_{m2}x_{2}&&\;+\;\cdots \;+\;&&a_{mn}x_{n}&&\;=\;&&&b_{m}\\\end{alignedat}}$ Ifu andv are two possible solutions to the above equation, then $A(\mathbf {u} -\mathbf {v} )=A\mathbf {u} -A\mathbf {v} =\mathbf {b} -\mathbf {b} =\mathbf {0}$ Thus, the difference of any two solutions to the equationAx =b lies in the kernel ofA.

It follows that any solution to the equationAx =b can be expressed as the sum of a fixed solutionv and an arbitrary element of the kernel. That is, the solution set to the equationAx =b is $\left\{\mathbf {v} +\mathbf {x} \mid A\mathbf {v} =\mathbf {b} \land \mathbf {x} \in \operatorname {Null} (A)\right\},$ Geometrically, this says that the solution set toAx =b is thetranslation of the kernel ofA by the vectorv. See alsoFredholm alternative andflat (geometry).

Illustration

[edit]

The following is a simple illustration of the computation of the kernel of a matrix (see§ Computation by Gaussian elimination, below for methods better suited to more complex calculations). The illustration also touches on the row space and its relation to the kernel.

Consider the matrix $A={\begin{bmatrix}2&3&5\\-4&2&3\end{bmatrix}}.$ The kernel of this matrix consists of all vectors(x,y,z) ∈R³ for which ${\begin{bmatrix}2&3&5\\-4&2&3\end{bmatrix}}{\begin{bmatrix}x\\y\\z\end{bmatrix}}={\begin{bmatrix}0\\0\end{bmatrix}},$ which can be expressed as a homogeneoussystem of linear equations involvingx,y, andz: ${\begin{aligned}2x+3y+5z&=0,\\-4x+2y+3z&=0.\end{aligned}}$

The same linear equations can also be written in matrix form as: $\left[{\begin{array}{ccc|c}2&3&5&0\\-4&2&3&0\end{array}}\right].$

ThroughGauss–Jordan elimination, the matrix can be reduced to: $\left[{\begin{array}{ccc|c}1&0&1/16&0\\0&1&13/8&0\end{array}}\right].$

Rewriting the matrix in equation form yields: ${\begin{aligned}x&=-{\frac {1}{16}}z\\y&=-{\frac {13}{8}}z.\end{aligned}}$

The elements of the kernel can be further expressed inparametric vector form, as follows: ${\begin{bmatrix}x\\y\\z\end{bmatrix}}=c{\begin{bmatrix}-1/16\\-13/8\\1\end{bmatrix}}\quad ({\text{where }}c\in \mathbb {R} )$

Sincec is afree variable ranging over all real numbers, this can be expressed equally well as: ${\begin{bmatrix}x\\y\\z\end{bmatrix}}=c{\begin{bmatrix}-1\\-26\\16\end{bmatrix}}.$ The kernel ofA is precisely the solution set to these equations (in this case, aline through the origin inR³). Here, the vector(−1,−26,16)^T constitutes abasis of the kernel ofA. The nullity ofA is therefore 1, as it is spanned by a single vector.

The following dot products are zero: ${\begin{bmatrix}2&3&5\end{bmatrix}}{\begin{bmatrix}-1\\-26\\16\end{bmatrix}}=0\quad \mathrm {and} \quad {\begin{bmatrix}-4&2&3\end{bmatrix}}{\begin{bmatrix}-1\\-26\\16\end{bmatrix}}=0,$ which illustrates that vectors in the kernel ofA are orthogonal to each of the row vectors ofA.

These two (linearly independent) row vectors span the row space ofA—a plane orthogonal to the vector(−1,−26,16)^T.

With the rank 2 ofA, the nullity 1 ofA, and the dimension 3 ofA, we have an illustration of the rank-nullity theorem.

Examples

[edit]

IfL:R^m →Rⁿ, then the kernel ofL is the solution set to a homogeneoussystem of linear equations. As in the above illustration, ifL is the operator: $L(x_{1},x_{2},x_{3})=(2x_{1}+3x_{2}+5x_{3},\;-4x_{1}+2x_{2}+3x_{3})$ then the kernel ofL is the set of solutions to the equations ${\begin{alignedat}{7}2x_{1}&\;+\;&3x_{2}&\;+\;&5x_{3}&\;=\;&0\\-4x_{1}&\;+\;&2x_{2}&\;+\;&3x_{3}&\;=\;&0\end{alignedat}}$
LetC[0,1] denote thevector space of all continuous real-valued functions on the interval [0,1], and defineL:C[0,1] →R by the rule $L(f)=f(0.3).$ Then the kernel ofL consists of all functionsf ∈C[0,1] for whichf(0.3) = 0.
LetC^∞(R) be the vector space of all infinitely differentiable functionsR →R, and letD:C^∞(R) →C^∞(R) be thedifferentiation operator: $D(f)={\frac {df}{dx}}.$ Then the kernel ofD consists of all functions inC^∞(R) whose derivatives are zero, i.e. the set of allconstant functions.
LetR^∞ be thedirect product of infinitely many copies ofR, and lets:R^∞ →R^∞ be theshift operator $s(x_{1},x_{2},x_{3},x_{4},\ldots )=(x_{2},x_{3},x_{4},\ldots ).$ Then the kernel ofs is the one-dimensional subspace consisting of all vectors(x₁, 0, 0, 0, ...).
IfV is aninner product space andW is a subspace, the kernel of theorthogonal projectionV →W is theorthogonal complement toW inV.

Computation by Gaussian elimination

[edit]

Abasis of the kernel of a matrix may be computed byGaussian elimination.

For this purpose, given anm ×n matrixA, we construct first the rowaugmented matrix ${\begin{bmatrix}A\\\hline I\end{bmatrix}},$ whereI is then ×nidentity matrix.

Computing itscolumn echelon form by Gaussian elimination (or any other suitable method), we get a matrix ${\begin{bmatrix}B\\\hline C\end{bmatrix}}.$ A basis of the kernel ofA consists in the non-zero columns ofC such that the corresponding column ofB is azero column.

In fact, the computation may be stopped as soon as the upper matrix is in column echelon form: the remainder of the computation consists in changing the basis of the vector space generated by the columns whose upper part is zero.

For example, suppose that $A={\begin{bmatrix}1&0&-3&0&2&-8\\0&1&5&0&-1&4\\0&0&0&1&7&-9\\0&0&0&0&0&0\end{bmatrix}}.$ Then ${\begin{bmatrix}A\\\hline I\end{bmatrix}}={\begin{bmatrix}1&0&-3&0&2&-8\\0&1&5&0&-1&4\\0&0&0&1&7&-9\\0&0&0&0&0&0\\\hline 1&0&0&0&0&0\\0&1&0&0&0&0\\0&0&1&0&0&0\\0&0&0&1&0&0\\0&0&0&0&1&0\\0&0&0&0&0&1\end{bmatrix}}.$

Putting the upper part in column echelon form by column operations on the whole matrix gives ${\begin{bmatrix}B\\\hline C\end{bmatrix}}={\begin{bmatrix}1&0&0&0&0&0\\0&1&0&0&0&0\\0&0&1&0&0&0\\0&0&0&0&0&0\\\hline 1&0&0&3&-2&8\\0&1&0&-5&1&-4\\0&0&0&1&0&0\\0&0&1&0&-7&9\\0&0&0&0&1&0\\0&0&0&0&0&1\end{bmatrix}}.$

The last three columns ofB are zero columns. Therefore, the three last vectors ofC, $\left[\!\!{\begin{array}{r}3\\-5\\1\\0\\0\\0\end{array}}\right],\;\left[\!\!{\begin{array}{r}-2\\1\\0\\-7\\1\\0\end{array}}\right],\;\left[\!\!{\begin{array}{r}8\\-4\\0\\9\\0\\1\end{array}}\right]$ are a basis of the kernel ofA.

Proof that the method computes the kernel: Since column operations correspond to post-multiplication by invertible matrices, the fact that ${\begin{bmatrix}A\\\hline I\end{bmatrix}}$ reduces to ${\begin{bmatrix}B\\\hline C\end{bmatrix}}$ means that there exists an invertible matrix $P {\displaystyle P}$ such that ${\begin{bmatrix}A\\\hline I\end{bmatrix}}P={\begin{bmatrix}B\\\hline C\end{bmatrix}},$ with $B {\displaystyle B}$ in column echelon form. Thus $AP=B$ , $IP=C$ , and $AC=B$ . A column vector $\mathbf {v}$ belongs to the kernel of $A {\displaystyle A}$ (that is $A\mathbf {v} =\mathbf {0}$ ) if and only if $B\mathbf {w} =\mathbf {0} ,$ where $\mathbf {w} =P^{-1}\mathbf {v} =C^{-1}\mathbf {v}$ . As $B {\displaystyle B}$ is in column echelon form, $B\mathbf {w} =\mathbf {0}$ , if and only if the nonzero entries of $\mathbf {w}$ correspond to the zero columns of $B {\displaystyle B}$ . By multiplying by $C {\displaystyle C}$ , one may deduce that this is the case if and only if $\mathbf {v} =C\mathbf {w}$ is a linear combination of the corresponding columns of $C {\displaystyle C}$ .

Numerical computation

[edit]

The problem of computing the kernel on a computer depends on the nature of the coefficients.

Exact coefficients

[edit]

If the coefficients of the matrix are exactly given numbers, thecolumn echelon form of the matrix may be computed withBareiss algorithm more efficiently than with Gaussian elimination. It is even more efficient to usemodular arithmetic andChinese remainder theorem, which reduces the problem to several similar ones overfinite fields (this avoids the overhead induced by the non-linearity of thecomputational complexity of integer multiplication).^{[citation needed]}

For coefficients in a finite field, Gaussian elimination works well, but for the large matrices that occur incryptography andGröbner basis computation, better algorithms are known, which have roughly the samecomputational complexity, but are faster and behave better with moderncomputer hardware.^{[citation needed]}

Floating point computation

[edit]

For matrices whose entries arefloating-point numbers, the problem of computing the kernel makes sense only for matrices such that the number of rows is equal to their rank: because of therounding errors, a floating-point matrix has almost always afull rank, even when it is an approximation of a matrix of a much smaller rank. Even for a full-rank matrix, it is possible to compute its kernel only if it iswell conditioned, i.e. it has a lowcondition number.^[5]^{[citation needed]}

Even for a well conditioned full rank matrix, Gaussian elimination does not behave correctly: it introduces rounding errors that are too large for getting a significant result. As the computation of the kernel of a matrix is a special instance of solving a homogeneous system of linear equations, the kernel may be computed with any of the various algorithms designed to solve homogeneous systems. A state of the art software for this purpose is theLapack library.^{[citation needed]}

Notes and references

[edit]

^Weisstein, Eric W."Kernel".mathworld.wolfram.com. Retrieved2019-12-09.
^^a ^b"Kernel (Nullspace) | Brilliant Math & Science Wiki".brilliant.org. Retrieved2019-12-09.
^Linear algebra, as discussed in this article, is a very well established mathematical discipline for which there are many sources. Almost all of the material in this article can be found inLay 2005,Meyer 2001, and Strang's lectures.
^^a ^bWeisstein, Eric W."Rank-Nullity Theorem".mathworld.wolfram.com. Retrieved2019-12-09.
^"Archived copy"(PDF). Archived fromthe original(PDF) on 2017-08-29. Retrieved2015-04-14.{{cite web}}: CS1 maint: archived copy as title (link)

Bibliography

[edit]

Axler, Sheldon Jay (1997),Linear Algebra Done Right (2nd ed.), Springer-Verlag,ISBN 0-387-98259-0.
Lay, David C. (2005),Linear Algebra and Its Applications (3rd ed.), Addison Wesley,ISBN 978-0-321-28713-7.
Meyer, Carl D. (2001),Matrix Analysis and Applied Linear Algebra, Society for Industrial and Applied Mathematics (SIAM),ISBN 978-0-89871-454-8, archived fromthe original on 2009-10-31.
Poole, David (2006),Linear Algebra: A Modern Introduction (2nd ed.), Brooks/Cole,ISBN 0-534-99845-3.
Anton, Howard (2005),Elementary Linear Algebra (Applications Version) (9th ed.), Wiley International.
Leon, Steven J. (2006),Linear Algebra With Applications (7th ed.), Pearson Prentice Hall.
Lang, Serge (1987).Linear Algebra. Springer.ISBN 9780387964126.
Trefethen, Lloyd N.; Bau, David III (1997),Numerical Linear Algebra, SIAM,ISBN 978-0-89871-361-9.

External links

[edit]

Wikibooks has a book on the topic of:Linear Algebra/Null Spaces

v t e Linear algebra
Outline Glossary Template:Matrix classes
Linear equations	Linear equation System of linear equations Determinant Minor Cauchy–Binet formula Cramer's rule Gaussian elimination Gauss–Jordan elimination Overcompleteness Strassen algorithm
Matrices	Matrix Matrix addition Matrix multiplication Basis transformation matrix Characteristic polynomial Spectrum Trace Eigenvalue, eigenvector and eigenspace Cayley–Hamilton theorem Jordan normal form Weyr canonical form Rank Inverse,Pseudoinverse Adjugate,Transpose Dot product Symmetric matrix,Skew-symmetric matrix Orthogonal matrix,Unitary matrix Hermitian matrix,Antihermitian matrix Positive-(semi)definite Pfaffian Projection Spectral theorem Perron–Frobenius theorem Diagonal matrix,Triangular matrix,Tridiagonal matrix Block matrix Sparse matrix Hessenberg matrix,Hessian matrix Vandermonde matrix Stochastic matrix,Toeplitz matrix,Circulant matrix,Hankel matrix (0,1)-matrix List of matrices
Matrix decompositions	Cholesky decomposition LU decomposition QR decomposition Polar decomposition Spectral theorem Singular value decomposition Higher-order singular value decomposition Schur decomposition Schur complement Haynsworth inertia additivity formula Reducing subspace
Relations and computations	Matrix equivalence Matrix congruence Matrix similarity Matrix consimilarity Row equivalence Elementary row operations Householder transformation Least squares Linear least squares Gram–Schmidt process Woodbury matrix identity
Vector spaces	Vector space Linear combination Linear span Linear independence Basis,Hamel basis Change of basis Dimension theorem for vector spaces Hamel dimension Examples of vector spaces Linear map Shear mapping Squeeze mapping Linear subspace Row and column spaces,Null space Rank–nullity theorem Nullity theorem Cyclic subspace Dual space,Linear functional Category of vector spaces
Structures	Topological vector space Normed vector space Inner product space Euclidean space Orthogonality Orthogonal complement Orthogonal projection Orthogonal group Pseudo-Euclidean space Null vector Indefinite orthogonal group Orientation Improper rotation Symplectic structure
Multilinear algebra	Multilinear algebra Tensor Tensors (classical) Component-free treatment of tensors Outer product Tensor algebra Exterior algebra Symmetric algebra Clifford algebra Geometric algebra Bivector Multivector Gamas's theorem
Affine and projective	Affine space Affine transformation,Affine group,Affine geometry Affine coordinate system,Flat (geometry) Cartesian coordinate system Euclidean group Poincaré group Galilean group Projective space Projective transformation Projective geometry Projective linear group Quadric
Numerical linear algebra	Numerical linear algebra Floating-point arithmetic Numerical stability Basic Linear Algebra Subprograms Sparse matrix Comparison of linear algebra libraries
Category

Retrieved from "https://en.wikipedia.org/w/index.php?title=Kernel_(linear_algebra)&oldid=1323774073"

Categories:

Hidden categories:

[8]ページ先頭