Hubbry Logo
Block matrixBlock matrixMain
Open search
Block matrix
Community hub
Block matrix
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Block matrix
Block matrix
from Wikipedia

In mathematics, a block matrix or a partitioned matrix is a matrix that is interpreted as having been broken into sections called blocks or submatrices.[1][2]

Intuitively, a matrix interpreted as a block matrix can be visualized as the original matrix with a collection of horizontal and vertical lines, which break it up, or partition it, into a collection of smaller matrices.[3][2] For example, the 3x4 matrix presented below is divided by horizontal and vertical lines into four blocks: the top-left 2x3 block, the top-right 2x1 block, the bottom-left 1x3 block, and the bottom-right 1x1 block.

Any matrix may be interpreted as a block matrix in one or more ways, with each interpretation defined by how its rows and columns are partitioned.

This notion can be made more precise for an by matrix by partitioning into a collection , and then partitioning into a collection . The original matrix is then considered as the "total" of these groups, in the sense that the entry of the original matrix corresponds in a 1-to-1 way with some offset entry of some , where and .[4]

Block matrix algebra arises in general from biproducts in categories of matrices.[5]

A 168×168 element block matrix with 12×12, 12×24, 24×12, and 24×24 sub-matrices. Non-zero elements are in blue, zero elements are grayed.

Example

[edit]

The matrix

can be visualized as divided into four blocks, as

.

The horizontal and vertical lines have no special mathematical meaning,[6][7] but are a common way to visualize a partition.[6][7] By this partition, is partitioned into four 2×2 blocks, as

The partitioned matrix can then be written as

[8]

Formal definition

[edit]

Let . A partitioning of is a representation of in the form

,

where are contiguous submatrices, , and .[9] The elements of the partition are called blocks.[9]

By this definition, the blocks in any one column must all have the same number of columns.[9] Similarly, the blocks in any one row must have the same number of rows.[9]

Partitioning methods

[edit]

A matrix can be partitioned in many ways.[9] For example, a matrix is said to be partitioned by columns if it is written as

,

where is the th column of .[9] A matrix can also be partitioned by rows:

,

where is the th row of .[9]

Common partitions

[edit]

Often,[9] we encounter the 2x2 partition

,[9]

particularly in the form where is a scalar:

.[9]

Block matrix operations

[edit]

Transpose

[edit]

Let

where . (This matrix will be reused in § Addition and § Multiplication.) Then its transpose is

,[9][10]

and the same equation holds with the transpose replaced by the conjugate transpose.[9]

Block transpose

[edit]

A special form of matrix transpose can also be defined for block matrices, where individual blocks are reordered but not transposed. Let be a block matrix with blocks , the block transpose of is the block matrix with blocks .[11] As with the conventional trace operator, the block transpose is a linear mapping such that .[10] However, in general the property does not hold unless the blocks of and commute.

Addition

[edit]

Let

,

where , and let be the matrix defined in § Transpose. (This matrix will be reused in § Multiplication.) Then if , , , and , then

.[9]

Multiplication

[edit]

It is possible to use a block partitioned matrix product that involves only algebra on submatrices of the factors. The partitioning of the factors is not arbitrary, however, and requires "conformable partitions"[12] between two matrices and such that all submatrix products that will be used are defined.[13]

Two matrices and are said to be partitioned conformally for the product , when and are partitioned into submatrices and if the multiplication is carried out treating the submatrices as if they are scalars, but keeping the order, and when all products and sums of submatrices involved are defined.

— Arak M. Mathai and Hans J. Haubold, Linear Algebra: A Course for Physicists and Engineers[14]

Let be the matrix defined in § Transpose, and let be the matrix defined in § Addition. Then the matrix product

can be performed blockwise, yielding as an matrix. The matrices in the resulting matrix are calculated by multiplying:

[6]

Or, using the Einstein notation that implicitly sums over repeated indices:

Depicting as a matrix, we have

.[9]

Inversion

[edit]

If a matrix is partitioned into four blocks, it can be inverted blockwise as follows:

where A and D are square blocks of arbitrary size, and B and C are conformable with them for partitioning. Furthermore, A and the Schur complement of A in P: P/A = DCA−1B must be invertible.[15]

Equivalently, by permuting the blocks:

[16]

Here, D and the Schur complement of D in P: P/D = ABD−1C must be invertible.

If A and D are both invertible, then:

By the Weinstein–Aronszajn identity, one of the two matrices in the block-diagonal matrix is invertible exactly when the other is.

Computing submatrix inverses from the full inverse

[edit]

By the symmetry between a matrix and its inverse in the block inversion formula, if a matrix P and its inverse P−1 are partitioned conformally:

then the inverse of any principal submatrix can be computed from the corresponding blocks of P−1:

This relationship follows from recognizing that E−1 = ABD−1C (the Schur complement), and applying the same block inversion formula with the roles of P and P−1 reversed.[17] [18]

Determinant

[edit]

The formula for the determinant of a -matrix above continues to hold, under appropriate further assumptions, for a matrix composed of four submatrices with and square. The easiest such formula, which can be proven using either the Leibniz formula or a factorization involving the Schur complement, is

[16]

Using this formula, we can derive that characteristic polynomials of and are same and equal to the product of characteristic polynomials of and . Furthermore, If or is diagonalizable, then and are diagonalizable too. The converse is false; simply check .

If is invertible, one has

[16]

and if is invertible, one has

[19][16]

If the blocks are square matrices of the same size further formulas hold. For example, if and commute (i.e., ), then

[20]

Similar statements hold when , , or . Namely, if , then

Note the change in order of and (we have instead of ). Similarly, if , then should be replaced with (i.e. we get ) and if , then we should have . Note for the last two results, you have to use commutativity of the underlying ring, but not for the first two.

This formula has been generalized to matrices composed of more than blocks, again under appropriate commutativity conditions among the individual blocks.[21]

For and , the following formula holds (even if and do not commute)

[16]

Special types of block matrices

[edit]

Direct sums and block diagonal matrices

[edit]

Direct sum

[edit]

For any arbitrary matrices A (of size m × n) and B (of size p × q), we have the direct sum of A and B, denoted by A  B and defined as

[10]

For instance,

This operation generalizes naturally to arbitrary dimensioned arrays (provided that A and B have the same number of dimensions).

Note that any element in the direct sum of two vector spaces of matrices could be represented as a direct sum of two matrices.

Block diagonal matrices

[edit]

A block diagonal matrix is a block matrix that is a square matrix such that the main-diagonal blocks are square matrices and all off-diagonal blocks are zero matrices.[16] That is, a block diagonal matrix A has the form

where Ak is a square matrix for all k = 1, ..., n. In other words, matrix A is the direct sum of A1, ..., An.[16] It can also be indicated as A1 ⊕ A2 ⊕ ... ⊕ An[10] or diag(A1, A2, ..., An)[10] (the latter being the same formalism used for a diagonal matrix). Any square matrix can trivially be considered a block diagonal matrix with only one block.

For the determinant and trace, the following properties hold:

[22][23] and
[16][23]

A block diagonal matrix is invertible if and only if each of its main-diagonal blocks are invertible, and in this case its inverse is another block diagonal matrix given by

[24]

The eigenvalues[25] and eigenvectors of are simply those of the s combined.[23]

Block tridiagonal matrices

[edit]

A block tridiagonal matrix is another special block matrix, which is just like the block diagonal matrix a square matrix, having square matrices (blocks) in the lower diagonal, main diagonal and upper diagonal, with all other blocks being zero matrices. It is essentially a tridiagonal matrix but has submatrices in places of scalars. A block tridiagonal matrix has the form

where , and are square sub-matrices of the lower, main and upper diagonal respectively.[26][27]

Block tridiagonal matrices are often encountered in numerical solutions of engineering problems (e.g., computational fluid dynamics). Optimized numerical methods for LU factorization are available[28] and hence efficient solution algorithms for equation systems with a block tridiagonal matrix as coefficient matrix. The Thomas algorithm, used for efficient solution of equation systems involving a tridiagonal matrix can also be applied using matrix operations to block tridiagonal matrices (see also Block LU decomposition).

Block triangular matrices

[edit]

An matrix is upper block triangular (or block upper triangular[29]) if there are positive integers such that and where the matrix is for all .[25][29] Similarly, is lower block triangular if where is for all .[25]

Block Toeplitz matrices

[edit]

A block Toeplitz matrix is another special block matrix, which contains blocks that are repeated down the diagonals of the matrix, as a Toeplitz matrix has elements repeated down the diagonal.

A matrix is block Toeplitz if for all , that is,

,

where .[25]

Block Hankel matrices

[edit]

A matrix is block Hankel if for all , that is,

,

where .[25]

See also

[edit]
  • Kronecker product (matrix direct product resulting in a block matrix)
  • Jordan normal form (canonical form of a linear operator on a finite-dimensional complex vector space)
  • Strassen algorithm (algorithm for matrix multiplication that is faster than the conventional matrix multiplication algorithm)

Notes

[edit]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
A block matrix, also known as a partitioned matrix, is a matrix whose entries are themselves matrices arranged in a rectangular array of submatrices or blocks, allowing for a hierarchical structure that facilitates analysis and computation in linear algebra. This partitioning divides the overall matrix into contiguous rectangular blocks, such as the common 2×2 form (ABCD)\begin{pmatrix} A & B \\ C & D \end{pmatrix}, where AA, BB, CC, and DD are submatrices of compatible dimensions, enabling the matrix to be viewed at multiple levels of granularity. Block matrices support standard operations like and , performed block-wise under compatible partitioning; for instance, the product of two 2×2 block matrices (A1B1C1D1)(A2B2C2D2)=(A1A2+B1C2A1B2+B1D2C1A2+D1C2C1B2+D1D2)\begin{pmatrix} A_1 & B_1 \\ C_1 & D_1 \end{pmatrix} \begin{pmatrix} A_2 & B_2 \\ C_2 & D_2 \end{pmatrix} = \begin{pmatrix} A_1 A_2 + B_1 C_2 & A_1 B_2 + B_1 D_2 \\ C_1 A_2 + D_1 C_2 & C_1 B_2 + D_1 D_2 \end{pmatrix}, provided the inner dimensions align. Notable properties include the invertibility of certain block forms, such as block triangular matrices, where the inverse can be computed using the inverses of diagonal blocks and solving for off-diagonal terms, and the of block diagonal matrices, which is the product of the determinants of the diagonal blocks. Special types, like block diagonal matrices (with non-zero blocks only on the ) and block triangular matrices (zero blocks above or below the diagonal), simplify eigenvalues, traces, and other invariants, as the trace of a product remains invariant under regardless of block structure. In , block matrices are essential for efficient algorithms, including block LU and Cholesky factorizations, matrix inversions via Schur complements, and strategies that exploit sparsity or structure in large-scale problems like those in and scientific simulations. They also underpin advanced techniques, such as the anti block diagonal method for computing matrix square roots and identities like det(I+AB)=det(I+BA)\det(I + AB) = \det(I + BA) for rectangular blocks, enhancing solvability in systems with partitioned data. Overall, block matrices provide a powerful framework for decomposing complex linear systems, revealing underlying patterns, and optimizing computational performance in both theoretical and .

Definition and Partitioning

Formal Definition

A block matrix is a matrix that is partitioned into smaller submatrices, known as blocks, forming a rectangular or square where the overall structure maintains the dimensions of the original matrix. This partitioning allows the matrix to be viewed as composed of these submatrices arranged in a grid-like fashion, facilitating analysis and computation in linear algebra. Formally, consider an m×nm \times n matrix AA. It can be partitioned into blocks AijA_{ij} for i=1,,pi = 1, \dots, p and j=1,,qj = 1, \dots, q, where each block AijA_{ij} is an mi×njm_i \times n_j submatrix, with i=1pmi=m\sum_{i=1}^p m_i = m and j=1qnj=n\sum_{j=1}^q n_j = n. The block matrix is then denoted as A=[Aij]A = [A_{ij}], emphasizing the block structure over individual entries. The compatibility condition requires that the blocks align without overlaps or gaps, ensuring the row partitions sum to the total number of rows and the column partitions sum to the total number of columns, thus preserving the integrity of the matrix indices. This dissection must be consistent across the entire matrix for the partitioning to be valid. The of block matrices originated in 19th-century developments in matrix , with early uses in blockwise multiplication appearing in the work of Edmond Laguerre in 1867, building on foundational matrix operations by , and was employed to simplify computations in systems of linear equations.

Matrix Partitioning Methods

Matrix partitioning methods involve dividing the rows and columns of a matrix into groups to form a grid of submatrices, known as blocks, which facilitates structured analysis and computation. Horizontal partitioning refers to the division of rows into consecutive groups, resulting in a stacked of block rows that collectively span the entire matrix. For an m×nm \times n matrix, this creates horizontal strips where each block row consists of contiguous rows from the original matrix. Vertical partitioning, conversely, divides the columns into consecutive groups, yielding a side-by-side of block columns, with each block column comprising contiguous columns. These methods are foundational, as they transform the matrix into a block form that preserves its overall dimensions while highlighting internal , such as patterns of zeros or identities that suggest natural divisions. Conformal partitioning extends these concepts to ensure compatibility between matrices for operations like . In this approach, the horizontal partitioning of the first matrix aligns precisely with the vertical partitioning of the second matrix, meaning the block boundaries for rows in one match the column boundaries in the other, allowing block-wise computations where each product block is the sum of outer products of corresponding blocks. This alignment is essential for the validity of block matrix algebra, as mismatched dimensions would prevent or of blocks. As established in the formal definition of block matrices, such compatible partitions enable the representation of the overall matrix product as a block matrix of the same partition structure. Non-consecutive partitioning, where blocks are formed from non-adjacent rows or columns, occurs rarely in linear algebra contexts due to the challenges it poses for maintaining the structural properties required for efficient block operations. In such cases, the blocks do not form contiguous submatrices, which can invalidate straightforward applications of block multiplication or inversion formulas unless additional permutations or adjustments are applied to restore contiguity. These partitions are typically avoided in favor of consecutive ones, except in specialized numerical algorithms where or sparsity patterns necessitate non-contiguous groupings, but even then, operations must account for the resulting irregularities in indexing and computation. An algorithmic approach to partitioning provides a systematic way to divide an m×nm \times n matrix AA into a k×lk \times l block structure. Begin by defining strictly increasing row cut indices {r0=0,r1,,rk=m}\{r_0 = 0, r_1, \dots, r_k = m\}, where each rir_i (for 1i<k1 \leq i < k) marks the end of the ii-th row group, and similarly define column cut indices {c0=0,c1,,cl=n}\{c_0 = 0, c_1, \dots, c_l = n\}. The resulting (i,j)(i,j)-th block AijA_{ij} is then the submatrix of AA comprising rows from ri1+1r_{i-1} + 1 to rir_i and columns from cj1+1c_{j-1} + 1 to cjc_j, ensuring the blocks tile the original matrix without overlap or gaps. This method allows flexibility in choosing cut points based on matrix sparsity, computational needs, or problem symmetry, and it directly supports conformal setups by matching cut indices across matrices.

Common Partition Types

One of the most frequently encountered partition schemes for block matrices is the 2×2 partition, where a matrix AR(m1+m2)×(n1+n2)A \in \mathbb{R}^{(m_1 + m_2) \times (n_1 + n_2)} is divided into four submatrices as A=(A11A12A21A22),A = \begin{pmatrix} A_{11} & A_{12} \\ A_{21} & A_{22} \end{pmatrix}, with A11Rm1×n1A_{11} \in \mathbb{R}^{m_1 \times n_1}, A12Rm1×n2A_{12} \in \mathbb{R}^{m_1 \times n_2}, A21Rm2×n1A_{21} \in \mathbb{R}^{m_2 \times n_1}, and A22Rm2×n2A_{22} \in \mathbb{R}^{m_2 \times n_2}. This structure simplifies the analysis of matrix properties and operations by isolating interactions between row and column subsets. Another common configuration involves partitioning a matrix into a single row of blocks, known as a block row vector, or a single column of blocks, known as a block column vector; for instance, a matrix may be expressed with horizontal blocks [B1 B2  Bk][B_1 \ B_2 \ \cdots \ B_k] along rows or vertical blocks (C1C2Ck)\begin{pmatrix} C_1 \\ C_2 \\ \vdots \\ C_k \end{pmatrix}
Add your contribution
Related Hubs
User Avatar
No comments yet.