Hubbry Logo
Rank (linear algebra)Rank (linear algebra)Main
Open search
Rank (linear algebra)
Community hub
Rank (linear algebra)
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Rank (linear algebra)
Rank (linear algebra)
from Wikipedia

In linear algebra, the rank of a matrix A is the dimension of the vector space generated (or spanned) by its columns.[1][2][3] This corresponds to the maximal number of linearly independent columns of A. This, in turn, is identical to the dimension of the vector space spanned by its rows.[4] Rank is thus a measure of the "nondegenerateness" of the system of linear equations and linear transformation encoded by A. There are multiple equivalent definitions of rank. A matrix's rank is one of its most fundamental characteristics.

The rank is commonly denoted by rank(A) or rk(A);[2] sometimes the parentheses are not written, as in rank A.[i]

Main definitions

[edit]

In this section, we give some definitions of the rank of a matrix. Many definitions are possible; see Alternative definitions for several of these.

The column rank of A is the dimension of the column space of A, while the row rank of A is the dimension of the row space of A.

A fundamental result in linear algebra is that the column rank and the row rank are always equal. (Three proofs of this result are given in § Proofs that column rank = row rank, below.) This number (i.e., the number of linearly independent rows or columns) is simply called the rank of A.

A matrix is said to have full rank if its rank equals the largest possible for a matrix of the same dimensions, which is the lesser of the number of rows and columns. A matrix is said to be rank-deficient if it does not have full rank. The rank deficiency of a matrix is the difference between the lesser of the number of rows and columns, and the rank.

The rank of a linear map or operator is defined as the dimension of its image:[5][6][7][8]where is the dimension of a vector space, and is the image of a map.

Examples

[edit]

The matrix has rank 2: the first two columns are linearly independent, so the rank is at least 2, but since the third is a linear combination of the first two (the first column plus the second), the three columns are linearly dependent so the rank must be less than 3.

The matrix has rank 1: there are nonzero columns, so the rank is positive, but any pair of columns is linearly dependent. Similarly, the transpose of A has rank 1. Indeed, since the column vectors of A are the row vectors of the transpose of A, the statement that the column rank of a matrix equals its row rank is equivalent to the statement that the rank of a matrix is equal to the rank of its transpose, i.e., rank(A) = rank(AT).

Computing the rank of a matrix

[edit]

Rank from row echelon forms

[edit]

A common approach to finding the rank of a matrix is to reduce it to a simpler form, generally row echelon form, by elementary row operations. Row operations do not change the row space (hence do not change the row rank), and, being invertible, map the column space to an isomorphic space (hence do not change the column rank). Once in row echelon form, the rank is clearly the same for both row rank and column rank, and equals the number of pivots (or basic columns) and also the number of non-zero rows.

For example, the matrix A given by can be put in reduced row-echelon form by using the following elementary row operations: The final matrix (in reduced row echelon form) has two non-zero rows and thus the rank of matrix A is 2.

Computation

[edit]

When applied to floating point computations on computers, basic Gaussian elimination (LU decomposition) can be unreliable, and a rank-revealing decomposition should be used instead. An effective alternative is the singular value decomposition (SVD), but there are other less computationally expensive choices, such as QR decomposition with pivoting (so-called rank-revealing QR factorization), which are still more numerically robust than Gaussian elimination. Numerical determination of rank requires a criterion for deciding when a value, such as a singular value from the SVD, should be treated as zero, a practical choice which depends on both the matrix and the application.

Proofs that column rank = row rank

[edit]

Proof using row reduction

[edit]

The fact that the column and row ranks of any matrix are equal is fundamental in linear algebra. Many proofs have been given. One of the most elementary ones has been sketched in § Rank from row echelon forms. Here is a variant of this proof:

It is straightforward to show that neither the row rank nor the column rank are changed by an elementary row operation. As Gaussian elimination proceeds by elementary row operations, the reduced row echelon form of a matrix has the same row rank and the same column rank as the original matrix. Further elementary column operations allow putting the matrix in the form of an identity matrix possibly bordered by rows and columns of zeros. Again, this changes neither the row rank nor the column rank. It is immediate that both the row and column ranks of this resulting matrix is the number of its nonzero entries.

We present two other proofs of this result. The first uses only basic properties of linear combinations of vectors, and is valid over any field. The proof is based upon Wardlaw (2005).[9] The second uses orthogonality and is valid for matrices over the real numbers; it is based upon Mackiw (1995).[4] Both proofs can be found in the book by Banerjee and Roy (2014).[10]

Proof using linear combinations

[edit]

Let A be an m × n matrix. Let the column rank of A be r, and let c1, ..., cr be any basis for the column space of A. Place these as the columns of an m × r matrix C. Every column of A can be expressed as a linear combination of the r columns in C. This means that there is an r × n matrix R such that A = CR. R is the matrix whose ith column is formed from the coefficients giving the ith column of A as a linear combination of the r columns of C. In other words, R is the matrix which contains the multiples for the bases of the column space of A (which is C), which are then used to form A as a whole. Now, each row of A is given by a linear combination of the r rows of R. Therefore, the rows of R form a spanning set of the row space of A and, by the Steinitz exchange lemma, the row rank of A cannot exceed r. This proves that the row rank of A is less than or equal to the column rank of A. This result can be applied to any matrix, so apply the result to the transpose of A. Since the row rank of the transpose of A is the column rank of A and the column rank of the transpose of A is the row rank of A, this establishes the reverse inequality and we obtain the equality of the row rank and the column rank of A. (Also see Rank factorization.)

Proof using orthogonality

[edit]

Let A be an m × n matrix with entries in the real numbers whose row rank is r. Therefore, the dimension of the row space of A is r. Let x1, x2, ..., xr be a basis of the row space of A. We claim that the vectors Ax1, Ax2, ..., Axr are linearly independent. To see why, consider a linear homogeneous relation involving these vectors with scalar coefficients c1, c2, ..., cr: where v = c1x1 + c2x2 + ⋯ + crxr. We make two observations: (a) v is a linear combination of vectors in the row space of A, which implies that v belongs to the row space of A, and (b) since Av = 0, the vector v is orthogonal to every row vector of A and, hence, is orthogonal to every vector in the row space of A. The facts (a) and (b) together imply that v is orthogonal to itself, which proves that v = 0 or, by the definition of v, But recall that the xi were chosen as a basis of the row space of A and so are linearly independent. This implies that c1 = c2 = ⋯ = cr = 0. It follows that Ax1, Ax2, ..., Axr are linearly independent.

Every Axi is in the column space of A. So, Ax1, Ax2, ..., Axr is a set of r linearly independent vectors in the column space of A and, hence, the dimension of the column space of A (i.e., the column rank of A) must be at least as big as r. This proves that row rank of A is no larger than the column rank of A. Now apply this result to the transpose of A to get the reverse inequality and conclude as in the previous proof.

Alternative definitions

[edit]

In all the definitions in this section, the matrix A is taken to be an m × n matrix over an arbitrary field F.

Dimension of image

[edit]

Given the matrix , there is an associated linear mapping defined by The rank of is the dimension of the image of . This definition has the advantage that it can be applied to any linear map without need for a specific matrix.

Rank in terms of nullity

[edit]

Given the same linear mapping f as above, the rank is n minus the dimension of the kernel of f. The rank–nullity theorem states that this definition is equivalent to the preceding one.

Column rank – dimension of column space

[edit]

The rank of A is the maximal number of linearly independent columns of A; this is the dimension of the column space of A (the column space being the subspace of Fm generated by the columns of A, which is in fact just the image of the linear map f associated to A).

Row rank – dimension of row space

[edit]

The rank of A is the maximal number of linearly independent rows of A; this is the dimension of the row space of A.

Decomposition rank

[edit]

The rank of A is the smallest positive integer k such that A can be factored as , where C is an m × k matrix and R is a k × n matrix. In fact, for all integers k, the following are equivalent:

  1. the column rank of A is less than or equal to k,
  2. there exist k columns of size m such that every column of A is a linear combination of ,
  3. there exist an matrix C and a matrix R such that (when k is the rank, this is a rank factorization of A),
  4. there exist k rows of size n such that every row of A is a linear combination of ,
  5. the row rank of A is less than or equal to k.

Indeed, the following equivalences are obvious: . For example, to prove (3) from (2), take C to be the matrix whose columns are from (2). To prove (2) from (3), take to be the columns of C.

It follows from the equivalence that the row rank is equal to the column rank.

As in the case of the "dimension of image" characterization, this can be generalized to a definition of the rank of any linear map: the rank of a linear map f : VW is the minimal dimension k of an intermediate space X such that f can be written as the composition of a map VX and a map XW. Unfortunately, this definition does not suggest an efficient manner to compute the rank (for which it is better to use one of the alternative definitions). See rank factorization for details.

Rank in terms of singular values

[edit]

The rank of A equals the number of non-zero singular values, which is the same as the number of non-zero diagonal elements in Σ in the singular value decomposition .

Determinantal rank – size of largest non-vanishing minor

[edit]

The rank of A is the largest order of any non-zero minor in A. (The order of a minor is the side-length of the square sub-matrix of which it is the determinant.) Like the decomposition rank characterization, this does not give an efficient way of computing the rank, but it is useful theoretically: a single non-zero minor witnesses a lower bound (namely its order) for the rank of the matrix, which can be useful (for example) to prove that certain operations do not lower the rank of a matrix.

A non-vanishing p-minor (p × p submatrix with non-zero determinant) shows that the rows and columns of that submatrix are linearly independent, and thus those rows and columns of the full matrix are linearly independent (in the full matrix), so the row and column rank are at least as large as the determinantal rank; however, the converse is less straightforward. The equivalence of determinantal rank and column rank is a strengthening of the statement that if the span of n vectors has dimension p, then p of those vectors span the space (equivalently, that one can choose a spanning set that is a subset of the vectors): the equivalence implies that a subset of the rows and a subset of the columns simultaneously define an invertible submatrix (equivalently, if the span of n vectors has dimension p, then p of these vectors span the space and there is a set of p coordinates on which they are linearly independent).

Tensor rank – minimum number of simple tensors

[edit]

The rank of A is the smallest number k such that A can be written as a sum of k rank 1 matrices, where a matrix is defined to have rank 1 if and only if it can be written as a nonzero product of a column vector c and a row vector r. This notion of rank is called tensor rank; it can be generalized in the separable models interpretation of the singular value decomposition.

Properties

[edit]

We assume that A is an m × n matrix, and we define the linear map f by f(x) = Ax as above.

  • The rank of an m × n matrix is a nonnegative integer and cannot be greater than either m or n. That is, A matrix that has rank min(m, n) is said to have full rank; otherwise, the matrix is rank deficient.
  • Only a zero matrix has rank zero.
  • f is injective (or "one-to-one") if and only if A has rank n (in this case, we say that A has full column rank).
  • f is surjective (or "onto") if and only if A has rank m (in this case, we say that A has full row rank).
  • If A is a square matrix (i.e., m = n), then A is invertible if and only if A has rank n (that is, A has full rank).
  • If B is any n × k matrix, then
  • If B is an n × k matrix of rank n, then
  • If C is an l × m matrix of rank m, then
  • The rank of A is equal to r if and only if there exists an invertible m × m matrix X and an invertible n × n matrix Y such that where Ir denotes the r × r identity matrix and the three zero matrices have the sizes r × (nr), (mr) × r and (mr) × (nr).
  • Sylvester’s rank inequality: if A is an m × n matrix and B is n × k, then[ii] This is a special case of the next inequality.
  • The inequality due to Frobenius: if AB, ABC and BC are defined, then[iii]
  • Subadditivity: when A and B are of the same dimension. As a consequence, a rank-k matrix can be written as the sum of k rank-1 matrices, but not fewer.
  • The rank of a matrix plus the nullity of the matrix equals the number of columns of the matrix. (This is the rank–nullity theorem.)
  • If A is a matrix over the real numbers then the rank of A and the rank of its corresponding Gram matrix are equal. Thus, for real matrices This can be shown by proving equality of their null spaces. The null space of the Gram matrix is given by vectors x for which If this condition is fulfilled, we also have [11]
  • If A is a matrix over the complex numbers and denotes the complex conjugate of A and A the conjugate transpose of A (i.e., the adjoint of A), then

Applications

[edit]

One useful application of calculating the rank of a matrix is the computation of the number of solutions of a system of linear equations. According to the Rouché–Capelli theorem, the system is inconsistent if the rank of the augmented matrix is greater than the rank of the coefficient matrix. If on the other hand, the ranks of these two matrices are equal, then the system must have at least one solution. The solution is unique if and only if the rank equals the number of variables. Otherwise the general solution has k free parameters where k is the difference between the number of variables and the rank. In this case (and assuming the system of equations is in the real or complex numbers) the system of equations has infinitely many solutions.

In control theory, the rank of a matrix can be used to determine whether a linear system is controllable, or observable.

In the field of communication complexity, the rank of the communication matrix of a function gives bounds on the amount of communication needed for two parties to compute the function.

Generalization

[edit]

There are different generalizations of the concept of rank to matrices over arbitrary rings, where column rank, row rank, dimension of column space, and dimension of row space of a matrix may be different from the others or may not exist.

Thinking of matrices as tensors, the tensor rank generalizes to arbitrary tensors; for tensors of order greater than 2 (matrices are order 2 tensors), rank is very hard to compute, unlike for matrices.

There is a notion of rank for smooth maps between smooth manifolds. It is equal to the linear rank of the derivative.

Matrices as tensors

[edit]

Matrix rank should not be confused with tensor order, which is called tensor rank. Tensor order is the number of indices required to write a tensor, and thus matrices all have tensor order 2. More precisely, matrices are tensors of type (1,1), having one row index and one column index, also called covariant order 1 and contravariant order 1; see Tensor (intrinsic definition) for details.

The tensor rank of a matrix can also mean the minimum number of simple tensors necessary to express the matrix as a linear combination, and that this definition does agree with matrix rank as here discussed.

See also

[edit]

Notes

[edit]

References

[edit]

Sources

[edit]

Further reading

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
In linear algebra, the rank of a matrix ARm×nA \in \mathbb{R}^{m \times n}, denoted rank(A)\operatorname{rank}(A), is defined as the of its column , which is the span of its column vectors. Equivalently, the rank is the of the row , the span of its row vectors, and these two dimensions are always equal—a fundamental property known as the equality of row rank and column rank. This value also represents the maximum number of linearly independent rows or columns in the matrix, and it satisfies rank(A)min(m,n)\operatorname{rank}(A) \leq \min(m, n). The rank plays a central role in the rank-nullity theorem, which states that for any matrix AA with nn columns, rank(A)+nullity(A)=n\operatorname{rank}(A) + \operatorname{nullity}(A) = n, where the nullity is the of the null space (the set of vectors mapped to the zero vector by the linear transformation represented by AA). In practice, the rank can be computed using , where it equals the number of nonzero rows (or pivot positions) in the of the matrix. Additionally, rank(A)=rank(AT)\operatorname{rank}(A) = \operatorname{rank}(A^T), and for a product of matrices, rank(AB)min(rank(A),rank(B))\operatorname{rank}(AB) \leq \min(\operatorname{rank}(A), \operatorname{rank}(B)). A matrix has full rank if rank(A)=min(m,n)\operatorname{rank}(A) = \min(m, n); for square matrices (m=nm = n), this implies invertibility and a nonzero determinant. The rank determines the solvability of linear systems Ax=bAx = b: the system is consistent if and only if bb lies in the column space of AA, with the solution space dimension equal to the nullity. Beyond solving equations, the rank is essential in applications such as data analysis (e.g., ), control theory, and optimization, where low-rank approximations reveal underlying structures in high-dimensional data.

Basic Concepts

Definition of Matrix Rank

The rank of a matrix provides an intuitive measure of its linear independence and dimensional complexity, capturing the extent to which its rows or columns span a subspace without redundancy. For a square matrix, full rank corresponds to the matrix being invertible, meaning it has no nontrivial kernel and represents a bijective linear transformation, whereas rank deficiency indicates noninvertibility, often arising in underdetermined systems or when rows/columns are linearly dependent. This concept distinguishes matrices that fully utilize their potential dimensionality from those constrained by dependencies, serving as a foundational invariant in linear algebra. Formally, for an m×nm \times n matrix AA over a field FF, the rank of AA, denoted rank(A)\operatorname{rank}(A) or ρ(A)\rho(A), is defined as the of the row space of AA (the subspace spanned by its rows) or equivalently the of the column space of AA (the subspace spanned by its columns). This definition holds over any field FF, including finite fields such as the integers modulo a prime. The row and column spaces are key subspaces associated with AA, whose dimensions both equal the rank. The term "rank" was introduced by in 1878, defining it as the order of the largest non-vanishing minor of a matrix.

Row Rank and Column Rank

The row rank of an m×nm \times n matrix AA over a field FF is defined as the of the row space of AA, which is the subspace of FnF^n spanned by the row vectors of AA. Similarly, the column rank of AA is the of the column space of AA, which is the subspace of FmF^m spanned by the column vectors of AA. In linear algebra, the span of a set of vectors {v1,,vk}\{ \mathbf{v}_1, \dots, \mathbf{v}_k \} in a is the set of all s of those vectors, forming a subspace. A of vectors is linearly independent if no vector in the can be expressed as a of the others, and a basis for a subspace is a linearly independent set that spans the subspace, with the being the number of vectors in such a basis. For the row space, denoted Row(A)\operatorname{Row}(A), this is formally Row(A)=span{r1,,rm}\operatorname{Row}(A) = \operatorname{span}\{ \mathbf{r}_1, \dots, \mathbf{r}_m \}, where ri\mathbf{r}_i are the row vectors of AA viewed as elements of FnF^n. Analogously, the column space Col(A)\operatorname{Col}(A) is span{c1,,cn}\operatorname{span}\{ \mathbf{c}_1, \dots, \mathbf{c}_n \}, with cj\mathbf{c}_j the column vectors in FmF^m. A fundamental theorem in linear algebra states that for any matrix AA, the row rank equals the column rank, both of which define the rank of AA. This equivalence holds over any field and underscores the intrinsic of the of the linear transformation represented by AA. The matrix AA has full rank if its row rank (or equivalently, column rank) equals min(m,n)\min(m, n), meaning the row space or column space achieves the maximum possible given the matrix dimensions.

Illustrative Examples

Low-Dimensional Matrices

To illustrate the concept of matrix rank, consider simple matrices, where the rank equals the number of linearly independent rows or columns, as established by the equality of row and column ranks. The identity matrix I2=(1001)I_2 = \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} has rank 2, as its two rows (or columns) are linearly independent, forming a basis for R2\mathbb{R}^2. This full rank indicates that the matrix spans the entire without any dependencies. In contrast, the zero matrix O2=(0000)O_2 = \begin{pmatrix} 0 & 0 \\ 0 & 0 \end{pmatrix} has rank 0, since all rows (and columns) are linearly dependent, spanning only the trivial subspace {0}\{ \mathbf{0} \}. A singular matrix such as A=(1111)A = \begin{pmatrix} 1 & 1 \\ 1 & 1 \end{pmatrix} has rank 1, because the second row is a scalar multiple of the first (specifically, row 2 = 1 × row 1), demonstrating linear dependence, while the first row alone is nonzero and independent. Geometrically in two dimensions, the rank of a 2×2 matrix corresponds to the dimension of the subspace it spans: rank 0 maps to a single point (the origin), rank 1 to a line through the origin, and rank 2 to the full plane. For the singular matrix AA above, this manifests as a projection onto the line y=xy = x in the plane.

Full Rank and Rank Deficiency

For an m×nm \times n matrix AA, the rank satisfies rank(A)min(m,n)\operatorname{rank}(A) \leq \min(m, n), with equality holding when the matrix achieves full rank. A matrix has full row rank if mnm \leq n and rank(A)=m\operatorname{rank}(A) = m, meaning its rows are linearly independent; full column rank if nmn \leq m and rank(A)=n\operatorname{rank}(A) = n, meaning its columns are linearly independent; otherwise, it is rank deficient. Consider a 3×23 \times 2 matrix such as A=(232437),A = \begin{pmatrix} 2 & 3 \\ 2 & 4 \\ 3 & 7 \end{pmatrix},
Add your contribution
Related Hubs
User Avatar
No comments yet.