Kernel (linear algebra)

An example for a kernel- the linear operator $L:(x,y)\longrightarrow (x,x)$ transforms all points on the $(x=0,y)$ line to the zero point $(0,0)$ , thus they form the kernel for the linear operator

In mathematics, the kernel of a linear map, also known as the null space or nullspace, is the part of the domain which is mapped to the zero vector of the co-domain; the kernel is always a linear subspace of the domain.^[1] That is, given a linear map $L : V \to W$ between two vector spaces $V$ and $W$ , the kernel of $L$ is the vector space of all elements $v$ of $V$ such that $L (v) = 0$ , where $0$ denotes the zero vector in $W$ ,^[2] or more symbolically: $\ker(L)=\left\{\mathbf {v} \in V\mid L(\mathbf {v} )=\mathbf {0} \right\}=L^{-1}(\mathbf {0} ).$

Properties

The kernel of $L$ is a linear subspace of the domain $V$ .^[3]^[2] In the linear map $L:V\to W,$ two elements of $V$ have the same image in $W$ if and only if their difference lies in the kernel of $L$ , that is, $L\left(\mathbf {v} _{1}\right)=L\left(\mathbf {v} _{2}\right)\quad {\text{ if and only if }}\quad L\left(\mathbf {v} _{1}-\mathbf {v} _{2}\right)=\mathbf {0} .$

From this, it follows by the first isomorphism theorem that the image of $L$ is isomorphic to the quotient of $V$ by the kernel: $\operatorname {im} (L)\cong V/\ker(L).$ In the case where $V$ is finite-dimensional, this implies the rank–nullity theorem: $\dim(\ker L)+\dim(\operatorname {im} L)=\dim(V).$ where the term rank refers to the dimension of the image of $L$ , $\dim(\operatorname {im} L),$ while nullity refers to the dimension of the kernel of $L$ , $\dim(\ker L).$ ^[4] That is, $\operatorname {Rank} (L)=\dim(\operatorname {im} L)\qquad {\text{ and }}\qquad \operatorname {Nullity} (L)=\dim(\ker L),$ so that the rank–nullity theorem can be restated as $\operatorname {Rank} (L)+\operatorname {Nullity} (L)=\dim \left(\operatorname {domain} L\right).$

When $V$ is an inner product space, the quotient $V/\ker(L)$ can be identified with the orthogonal complement in $V$ of $\ker(L)$ . This is the generalization to linear operators of the row space, or coimage, of a matrix.

Generalization to modules

The notion of kernel also makes sense for homomorphisms of modules, which are generalizations of vector spaces where the scalars are elements of a ring, rather than a field. The domain of the mapping is a module, with the kernel constituting a submodule. Here, the concepts of rank and nullity do not necessarily apply.

In functional analysis

If $V$ and $W$ are topological vector spaces such that $W$ is finite-dimensional, then a linear operator $L : V \to W$ is continuous if and only if the kernel of $L$ is a closed subspace of $V$ .

Representation as matrix multiplication

Consider a linear map represented as a $m \times n$ matrix $A$ with coefficients in a field $K$ (typically $\mathbb {R}$ or $\mathbb {C}$ ), that is operating on column vectors $x$ with $n$ components over $K$ . The kernel of this linear map is the set of solutions to the equation $A x = 0$ , where $0$ is understood as the zero vector. The dimension of the kernel of A is called the nullity of A. In set-builder notation, $\operatorname {N} (A)=\operatorname {Null} (A)=\operatorname {ker} (A)=\left\{\mathbf {x} \in K^{n}\mid A\mathbf {x} =\mathbf {0} \right\}.$ The matrix equation is equivalent to a homogeneous system of linear equations: $A\mathbf {x} =\mathbf {0} \;\;\Leftrightarrow \;\;{\begin{alignedat}{7}a_{11}x_{1}&&\;+\;&&a_{12}x_{2}&&\;+\;\cdots \;+\;&&a_{1n}x_{n}&&\;=\;&&&0\\a_{21}x_{1}&&\;+\;&&a_{22}x_{2}&&\;+\;\cdots \;+\;&&a_{2n}x_{n}&&\;=\;&&&0\\&&&&&&&&&&\vdots \ \;&&&\\a_{m1}x_{1}&&\;+\;&&a_{m2}x_{2}&&\;+\;\cdots \;+\;&&a_{mn}x_{n}&&\;=\;&&&0{\text{.}}\\\end{alignedat}}$ Thus the kernel of A is the same as the solution set to the above homogeneous equations.

Subspace properties

The kernel of a $m \times n$ matrix $A$ over a field $K$ is a linear subspace of $K n$ . That is, the kernel of $A$ , the set $Null(A)$ , has the following three properties:

$Null(A)$ always contains the zero vector, since $A 0 = 0$ .
If $x \in Null(A)$ and $y \in Null(A)$ , then $x + y \in Null(A)$ . This follows from the distributivity of matrix multiplication over addition.
If $x \in Null(A)$ and $c$ is a scalar $c \in K$ , then $c x \in Null(A)$ , since $A (c x) = c (A x) = c 0 = 0$ .

The row space of a matrix

The product Ax can be written in terms of the dot product of vectors as follows: $A\mathbf {x} ={\begin{bmatrix}\mathbf {a} _{1}\cdot \mathbf {x} \\\mathbf {a} _{2}\cdot \mathbf {x} \\\vdots \\\mathbf {a} _{m}\cdot \mathbf {x} \end{bmatrix}}.$

Here, $a 1, ... , a m$ denote the rows of the matrix $A$ . It follows that $x$ is in the kernel of $A$ , if and only if $x$ is orthogonal (or perpendicular) to each of the row vectors of $A$ (since orthogonality is defined as having a dot product of 0).

The row space, or coimage, of a matrix $A$ is the span of the row vectors of $A$ . By the above reasoning, the kernel of $A$ is the orthogonal complement to the row space. That is, a vector $x$ lies in the kernel of $A$ , if and only if it is perpendicular to every vector in the row space of $A$ .

The dimension of the row space of $A$ is called the rank of A, and the dimension of the kernel of $A$ is called the nullity of $A$ . These quantities are related by the rank–nullity theorem^[4] $\operatorname {rank} (A)+\operatorname {nullity} (A)=n.$

Left null space

The left null space, or cokernel, of a matrix $A$ consists of all column vectors $x$ such that $x T A = 0 T$ , where T denotes the transpose of a matrix. The left null space of $A$ is the same as the kernel of $A T$ . The left null space of $A$ is the orthogonal complement to the column space of $A$ , and is dual to the cokernel of the associated linear transformation. The kernel, the row space, the column space, and the left null space of $A$ are the four fundamental subspaces associated with the matrix $A$ .

Nonhomogeneous systems of linear equations

The kernel also plays a role in the solution to a nonhomogeneous system of linear equations: $A\mathbf {x} =\mathbf {b} \quad {\text{or}}\quad {\begin{alignedat}{7}a_{11}x_{1}&&\;+\;&&a_{12}x_{2}&&\;+\;\cdots \;+\;&&a_{1n}x_{n}&&\;=\;&&&b_{1}\\a_{21}x_{1}&&\;+\;&&a_{22}x_{2}&&\;+\;\cdots \;+\;&&a_{2n}x_{n}&&\;=\;&&&b_{2}\\&&&&&&&&&&\vdots \ \;&&&\\a_{m1}x_{1}&&\;+\;&&a_{m2}x_{2}&&\;+\;\cdots \;+\;&&a_{mn}x_{n}&&\;=\;&&&b_{m}\\\end{alignedat}}$ If $u$ and $v$ are two possible solutions to the above equation, then $A(\mathbf {u} -\mathbf {v} )=A\mathbf {u} -A\mathbf {v} =\mathbf {b} -\mathbf {b} =\mathbf {0}$ Thus, the difference of any two solutions to the equation $A x = b$ lies in the kernel of $A$ .

It follows that any solution to the equation $A x = b$ can be expressed as the sum of a fixed solution $v$ and an arbitrary element of the kernel. That is, the solution set to the equation $A x = b$ is $\left\{\mathbf {v} +\mathbf {x} \mid A\mathbf {v} =\mathbf {b} \land \mathbf {x} \in \operatorname {Null} (A)\right\},$ Geometrically, this says that the solution set to $A x = b$ is the translation of the kernel of $A$ by the vector $v$ . See also Fredholm alternative and flat (geometry).

Illustration

The following is a simple illustration of the computation of the kernel of a matrix (see § Computation by Gaussian elimination, below for methods better suited to more complex calculations). The illustration also touches on the row space and its relation to the kernel.

Consider the matrix $A={\begin{bmatrix}2&3&5\\-4&2&3\end{bmatrix}}.$ The kernel of this matrix consists of all vectors $(x, y, z) \in R 3$ for which ${\begin{bmatrix}2&3&5\\-4&2&3\end{bmatrix}}{\begin{bmatrix}x\\y\\z\end{bmatrix}}={\begin{bmatrix}0\\0\end{bmatrix}},$ which can be expressed as a homogeneous system of linear equations involving $x$ , $y$ , and $z$ : ${\begin{aligned}2x+3y+5z&=0,\\-4x+2y+3z&=0.\end{aligned}}$

The same linear equations can also be written in matrix form as: $\left[{\begin{array}{ccc|c}2&3&5&0\\-4&2&3&0\end{array}}\right].$

Through Gauss–Jordan elimination, the matrix can be reduced to: $\left[{\begin{array}{ccc|c}1&0&1/16&0\\0&1&13/8&0\end{array}}\right].$

Rewriting the matrix in equation form yields: ${\begin{aligned}x&=-{\frac {1}{16}}z\\y&=-{\frac {13}{8}}z.\end{aligned}}$

The elements of the kernel can be further expressed in parametric vector form, as follows: ${\begin{bmatrix}x\\y\\z\end{bmatrix}}=c{\begin{bmatrix}-1/16\\-13/8\\1\end{bmatrix}}\quad ({\text{where }}c\in \mathbb {R} )$

Since $c$ is a free variable ranging over all real numbers, this can be expressed equally well as: ${\begin{bmatrix}x\\y\\z\end{bmatrix}}=c{\begin{bmatrix}-1\\-26\\16\end{bmatrix}}.$ The kernel of $A$ is precisely the solution set to these equations (in this case, a line through the origin in $R 3$ ). Here, the vector $(-1,-26,16) T$ constitutes a basis of the kernel of $A$ . The nullity of $A$ is therefore 1, as it is spanned by a single vector.

The following dot products are zero: ${\begin{bmatrix}2&3&5\end{bmatrix}}{\begin{bmatrix}-1\\-26\\16\end{bmatrix}}=0\quad \mathrm {and} \quad {\begin{bmatrix}-4&2&3\end{bmatrix}}{\begin{bmatrix}-1\\-26\\16\end{bmatrix}}=0,$ which illustrates that vectors in the kernel of $A$ are orthogonal to each of the row vectors of $A$ .

These two (linearly independent) row vectors span the row space of $A$ —a plane orthogonal to the vector $(-1,-26,16) T$ .

With the rank 2 of $A$ , the nullity 1 of $A$ , and the dimension 3 of $A$ , we have an illustration of the rank-nullity theorem.

Examples

If $L : R m \to R n$ , then the kernel of $L$ is the solution set to a homogeneous system of linear equations. As in the above illustration, if $L$ is the operator: $L(x_{1},x_{2},x_{3})=(2x_{1}+3x_{2}+5x_{3},\;-4x_{1}+2x_{2}+3x_{3})$ then the kernel of $L$ is the set of solutions to the equations ${\begin{alignedat}{7}2x_{1}&\;+\;&3x_{2}&\;+\;&5x_{3}&\;=\;&0\\-4x_{1}&\;+\;&2x_{2}&\;+\;&3x_{3}&\;=\;&0\end{alignedat}}$
Let $C [0,1]$ denote the vector space of all continuous real-valued functions on the interval [0,1], and define $L : C [0,1] \to R$ by the rule $L(f)=f(0.3).$ Then the kernel of $L$ consists of all functions $f \in C [0,1]$ for which $f (0.3) = 0$ .
Let $C \infty (R)$ be the vector space of all infinitely differentiable functions $R \to R$ , and let $D : C \infty (R) \to C \infty (R)$ be the differentiation operator: $D(f)={\frac {df}{dx}}.$ Then the kernel of $D$ consists of all functions in $C \infty (R)$ whose derivatives are zero, i.e. the set of all constant functions.
Let $R \infty$ be the direct product of infinitely many copies of $R$ , and let $s : R \infty \to R \infty$ be the shift operator $s(x_{1},x_{2},x_{3},x_{4},\ldots )=(x_{2},x_{3},x_{4},\ldots ).$ Then the kernel of $s$ is the one-dimensional subspace consisting of all vectors $(x 1, 0, 0, 0, ...)$ .
If $V$ is an inner product space and $W$ is a subspace, the kernel of the orthogonal projection $V \to W$ is the orthogonal complement to $W$ in $V$ .

Computation by Gaussian elimination

A basis of the kernel of a matrix may be computed by Gaussian elimination.

For this purpose, given an $m \times n$ matrix $A$ , we construct first the row augmented matrix ${\begin{bmatrix}A\\\hline I\end{bmatrix}},$ where $I$ is the $n \times n$ identity matrix.

Computing its column echelon form by Gaussian elimination (or any other suitable method), we get a matrix ${\begin{bmatrix}B\\\hline C\end{bmatrix}}.$ A basis of the kernel of $A$ consists in the non-zero columns of $C$ such that the corresponding column of $B$ is a zero column.

In fact, the computation may be stopped as soon as the upper matrix is in column echelon form: the remainder of the computation consists in changing the basis of the vector space generated by the columns whose upper part is zero.

For example, suppose that $A={\begin{bmatrix}1&0&-3&0&2&-8\\0&1&5&0&-1&4\\0&0&0&1&7&-9\\0&0&0&0&0&0\end{bmatrix}}.$ Then ${\begin{bmatrix}A\\\hline I\end{bmatrix}}={\begin{bmatrix}1&0&-3&0&2&-8\\0&1&5&0&-1&4\\0&0&0&1&7&-9\\0&0&0&0&0&0\\\hline 1&0&0&0&0&0\\0&1&0&0&0&0\\0&0&1&0&0&0\\0&0&0&1&0&0\\0&0&0&0&1&0\\0&0&0&0&0&1\end{bmatrix}}.$

Putting the upper part in column echelon form by column operations on the whole matrix gives ${\begin{bmatrix}B\\\hline C\end{bmatrix}}={\begin{bmatrix}1&0&0&0&0&0\\0&1&0&0&0&0\\0&0&1&0&0&0\\0&0&0&0&0&0\\\hline 1&0&0&3&-2&8\\0&1&0&-5&1&-4\\0&0&0&1&0&0\\0&0&1&0&-7&9\\0&0&0&0&1&0\\0&0&0&0&0&1\end{bmatrix}}.$

The last three columns of $B$ are zero columns. Therefore, the three last vectors of $C$ , $\left[\!\!{\begin{array}{r}3\\-5\\1\\0\\0\\0\end{array}}\right],\;\left[\!\!{\begin{array}{r}-2\\1\\0\\-7\\1\\0\end{array}}\right],\;\left[\!\!{\begin{array}{r}8\\-4\\0\\9\\0\\1\end{array}}\right]$ are a basis of the kernel of $A$ .

Proof that the method computes the kernel: Since column operations correspond to post-multiplication by invertible matrices, the fact that ${\begin{bmatrix}A\\\hline I\end{bmatrix}}$ reduces to ${\begin{bmatrix}B\\\hline C\end{bmatrix}}$ means that there exists an invertible matrix $P$ such that ${\begin{bmatrix}A\\\hline I\end{bmatrix}}P={\begin{bmatrix}B\\\hline C\end{bmatrix}},$ with $B$ in column echelon form. Thus $AP=B$ , $IP=C$ , and $AC=B$ . A column vector $\mathbf {v}$ belongs to the kernel of $A$ (that is $A\mathbf {v} =\mathbf {0}$ ) if and only if $B\mathbf {w} =\mathbf {0} ,$ where $\mathbf {w} =P^{-1}\mathbf {v} =C^{-1}\mathbf {v}$ . As $B$ is in column echelon form, $B\mathbf {w} =\mathbf {0}$ , if and only if the nonzero entries of $\mathbf {w}$ correspond to the zero columns of $B$ . By multiplying by $C$ , one may deduce that this is the case if and only if $\mathbf {v} =C\mathbf {w}$ is a linear combination of the corresponding columns of $C$ .

Numerical computation

The problem of computing the kernel on a computer depends on the nature of the coefficients.

Exact coefficients

If the coefficients of the matrix are exactly given numbers, the column echelon form of the matrix may be computed with Bareiss algorithm more efficiently than with Gaussian elimination. It is even more efficient to use modular arithmetic and Chinese remainder theorem, which reduces the problem to several similar ones over finite fields (this avoids the overhead induced by the non-linearity of the computational complexity of integer multiplication).^{[citation needed]}

For coefficients in a finite field, Gaussian elimination works well, but for the large matrices that occur in cryptography and Gröbner basis computation, better algorithms are known, which have roughly the same computational complexity, but are faster and behave better with modern computer hardware.^{[citation needed]}

Floating point computation

For matrices whose entries are floating-point numbers, the problem of computing the kernel makes sense only for matrices such that the number of rows is equal to their rank: because of the rounding errors, a floating-point matrix has almost always a full rank, even when it is an approximation of a matrix of a much smaller rank. Even for a full-rank matrix, it is possible to compute its kernel only if it is well conditioned, i.e. it has a low condition number.^[5]^{[citation needed]}

Even for a well conditioned full rank matrix, Gaussian elimination does not behave correctly: it introduces rounding errors that are too large for getting a significant result. As the computation of the kernel of a matrix is a special instance of solving a homogeneous system of linear equations, the kernel may be computed with any of the various algorithms designed to solve homogeneous systems. A state of the art software for this purpose is the Lapack library.^{[citation needed]}

Notes and references

^ Weisstein, Eric W. "Kernel". mathworld.wolfram.com. Retrieved 2019-12-09.
^ ^a ^b "Kernel (Nullspace) | Brilliant Math & Science Wiki". brilliant.org. Retrieved 2019-12-09.
^ Linear algebra, as discussed in this article, is a very well established mathematical discipline for which there are many sources. Almost all of the material in this article can be found in Lay 2005, Meyer 2001, and Strang's lectures.
^ ^a ^b Weisstein, Eric W. "Rank-Nullity Theorem". mathworld.wolfram.com. Retrieved 2019-12-09.
^ "Archived copy" (PDF). Archived from the original (PDF) on 2017-08-29. Retrieved 2015-04-14.{{cite web}}: CS1 maint: archived copy as title (link)

Bibliography

Axler, Sheldon Jay (1997), Linear Algebra Done Right (2nd ed.), Springer-Verlag, ISBN 0-387-98259-0.
Lay, David C. (2005), Linear Algebra and Its Applications (3rd ed.), Addison Wesley, ISBN 978-0-321-28713-7.
Meyer, Carl D. (2001), Matrix Analysis and Applied Linear Algebra, Society for Industrial and Applied Mathematics (SIAM), ISBN 978-0-89871-454-8, archived from the original on 2009-10-31.
Poole, David (2006), Linear Algebra: A Modern Introduction (2nd ed.), Brooks/Cole, ISBN 0-534-99845-3.
Anton, Howard (2005), Elementary Linear Algebra (Applications Version) (9th ed.), Wiley International.
Leon, Steven J. (2006), Linear Algebra With Applications (7th ed.), Pearson Prentice Hall.
Lang, Serge (1987). Linear Algebra. Springer. ISBN 9780387964126.
Trefethen, Lloyd N.; Bau, David III (1997), Numerical Linear Algebra, SIAM, ISBN 978-0-89871-361-9.

External links

"Kernel of a matrix", Encyclopedia of Mathematics, EMS Press, 2001 [1994]
Khan Academy, Introduction to the Null Space of a Matrix

[1] Weisstein, Eric W. "Kernel". mathworld.wolfram.com. Retrieved 2019-12-09.

[:0-2] "Kernel (Nullspace) | Brilliant Math & Science Wiki". brilliant.org. Retrieved 2019-12-09.

[textbooks-3] Linear algebra, as discussed in this article, is a very well established mathematical discipline for which there are many sources. Almost all of the material in this article can be found in Lay 2005, Meyer 2001, and Strang's lectures.

[:1-4] Weisstein, Eric W. "Rank-Nullity Theorem". mathworld.wolfram.com. Retrieved 2019-12-09.

[5] "Archived copy" (PDF). Archived from the original (PDF) on 2017-08-29. Retrieved 2015-04-14.{{cite web}}: CS1 maint: archived copy as title (link)

[1]

[2]

[3]

[4]

[5]

v t e Linear algebra
Outline Glossary Template:Matrix classes
Linear equations	Linear equation System of linear equations Determinant Minor Cauchy–Binet formula Cramer's rule Gaussian elimination Gauss–Jordan elimination Overcompleteness Strassen algorithm
Matrices	Matrix Matrix addition Matrix multiplication Basis transformation matrix Characteristic polynomial Spectrum Trace Eigenvalue, eigenvector and eigenspace Cayley–Hamilton theorem Jordan normal form Weyr canonical form Rank Inverse, Pseudoinverse Adjugate, Transpose Dot product Symmetric matrix, Skew-symmetric matrix Orthogonal matrix, Unitary matrix Hermitian matrix, Antihermitian matrix Positive-(semi)definite Pfaffian Projection Spectral theorem Perron–Frobenius theorem Diagonal matrix, Triangular matrix, Tridiagonal matrix Block matrix Sparse matrix Hessenberg matrix, Hessian matrix Vandermonde matrix Stochastic matrix, Toeplitz matrix, Circulant matrix, Hankel matrix (0,1)-matrix List of matrices
Matrix decompositions	Cholesky decomposition LU decomposition QR decomposition Polar decomposition Spectral theorem Singular value decomposition Higher-order singular value decomposition Schur decomposition Schur complement Haynsworth inertia additivity formula Reducing subspace
Relations and computations	Matrix equivalence Matrix congruence Matrix similarity Matrix consimilarity Row equivalence Elementary row operations Householder transformation Least squares Linear least squares Gram–Schmidt process Woodbury matrix identity
Vector spaces	Vector space Linear combination Linear span Linear independence Basis, Hamel basis Change of basis Dimension theorem for vector spaces Hamel dimension Examples of vector spaces Linear map Shear mapping Squeeze mapping Linear subspace Row and column spaces, Null space Rank–nullity theorem Nullity theorem Cyclic subspace Dual space, Linear functional Category of vector spaces
Structures	Topological vector space Normed vector space Inner product space Euclidean space Orthogonality Orthogonal complement Orthogonal projection Orthogonal group Pseudo-Euclidean space Null vector Indefinite orthogonal group Orientation Improper rotation Symplectic structure
Multilinear algebra	Multilinear algebra Tensor Tensors (classical) Component-free treatment of tensors Outer product Tensor algebra Exterior algebra Symmetric algebra Clifford algebra Geometric algebra Bivector Multivector Gamas's theorem
Affine and projective	Affine space Affine transformation, Affine group, Affine geometry Affine coordinate system, Flat (geometry) Cartesian coordinate system Euclidean group Poincaré group Galilean group Projective space Projective transformation Projective geometry Projective linear group Quadric
Numerical linear algebra	Numerical linear algebra Floating-point arithmetic Numerical stability Basic Linear Algebra Subprograms Sparse matrix Comparison of linear algebra libraries
Category

History

Media collections

Kernel (linear algebra)

Recent from talks

Recent from talks

Contribute something

Contribute something

Media Pages

Timelines

Articles

Notes collections

Notes

Notes

Days in Chronicle

Kernel (linear algebra)

Properties

Generalization to modules

In functional analysis

Representation as matrix multiplication

Subspace properties

The row space of a matrix

Left null space

Nonhomogeneous systems of linear equations

Illustration

Examples

Computation by Gaussian elimination

Numerical computation

Exact coefficients

Floating point computation

See also

Notes and references

Bibliography

External links

Kernel (linear algebra)

Definition and Fundamentals

Definition for linear transformations

Kernel as a vector subspace

Algebraic Properties

Dimension and rank-nullity theorem

References

Add your contribution

Related Hubs

Contribute something