Hubbry Logo
Definite matrixDefinite matrixMain
Open search
Definite matrix
Community hub
Definite matrix
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Contribute something
Definite matrix
Definite matrix
from Wikipedia

In mathematics, a symmetric matrix with real entries is positive-definite if the real number is positive for every nonzero real column vector where is the row vector transpose of [1] More generally, a Hermitian matrix (that is, a complex matrix equal to its conjugate transpose) is positive-definite if the real number is positive for every nonzero complex column vector where denotes the conjugate transpose of

Positive semi-definite matrices are defined similarly, except that the scalars and are required to be positive or zero (that is, nonnegative). Negative-definite and negative semi-definite matrices are defined analogously. A matrix that is not positive semi-definite and not negative semi-definite is sometimes called indefinite.

Some authors use more general definitions of definiteness, permitting the matrices to be non-symmetric or non-Hermitian. The properties of these generalized definite matrices are explored in § Extension for non-Hermitian square matrices, below, but are not the main focus of this article.

Definitions

[edit]

In the following definitions, is the transpose of is the conjugate transpose of and denotes the n dimensional zero-vector.

Definitions for real matrices

[edit]

An symmetric real matrix is said to be positive-definite if for all non-zero in Formally,

An symmetric real matrix is said to be positive-semidefinite or non-negative-definite if for all in Formally,

An symmetric real matrix is said to be negative-definite if for all non-zero in Formally,

An symmetric real matrix is said to be negative-semidefinite or non-positive-definite if for all in Formally,

An symmetric real matrix which is neither positive semidefinite nor negative semidefinite is called indefinite.

Definitions for complex matrices

[edit]

The following definitions all involve the term Notice that this is always a real number for any Hermitian square matrix

An Hermitian complex matrix is said to be positive-definite if for all non-zero in Formally,

An Hermitian complex matrix is said to be positive semi-definite or non-negative-definite if for all in Formally,

An Hermitian complex matrix is said to be negative-definite if for all non-zero in Formally,

An Hermitian complex matrix is said to be negative semi-definite or non-positive-definite if for all in Formally,

An Hermitian complex matrix which is neither positive semidefinite nor negative semidefinite is called indefinite.

Consistency between real and complex definitions

[edit]

Since every real matrix is also a complex matrix, the definitions of "definiteness" for the two classes must agree.

For complex matrices, the most common definition says that is positive-definite if and only if is real and positive for every non-zero complex column vectors This condition implies that is Hermitian (i.e. its transpose is equal to its conjugate), since being real, it equals its conjugate transpose for every which implies

By this definition, a positive-definite real matrix is Hermitian, hence symmetric; and is positive for all non-zero real column vectors However the last condition alone is not sufficient for to be positive-definite. For example, if

then for any real vector with entries and we have which is always positive if is not zero. However, if is the complex vector with entries 1 and , one gets

which is not real. Therefore, is not positive-definite.

On the other hand, for a symmetric real matrix the condition " for all nonzero real vectors " does imply that is positive-definite in the complex sense.

Notation

[edit]

If a Hermitian matrix is positive semi-definite, one sometimes writes and if is positive-definite one writes To denote that is negative semi-definite one writes and to denote that is negative-definite one writes

The notion comes from functional analysis where positive semidefinite matrices define positive operators. If two matrices and satisfy we can define a non-strict partial order that is reflexive, antisymmetric, and transitive; It is not a total order, however, as in general, may be indefinite.

A common alternative notation is and for positive semi-definite and positive-definite, negative semi-definite and negative-definite matrices, respectively. This may be confusing, as sometimes nonnegative matrices (respectively, nonpositive matrices) are also denoted in this way.

Ramifications

[edit]

It follows from the above definitions that a Hermitian matrix is positive-definite if and only if it is the matrix of a positive-definite quadratic form or Hermitian form. In other words, a Hermitian matrix is positive-definite if and only if it defines an inner product.

Positive-definite and positive-semidefinite matrices can be characterized in many ways, which may explain the importance of the concept in various parts of mathematics. A Hermitian matrix M is positive-definite if and only if it satisfies any of the following equivalent conditions.

  • is congruent with a diagonal matrix with positive real entries.
  • is Hermitian, and all its eigenvalues are real and positive.
  • is Hermitian, and all its leading principal minors are positive.
  • There exists an invertible matrix with conjugate transpose such that

A matrix is positive semi-definite if it satisfies similar equivalent conditions where "positive" is replaced by "nonnegative", "invertible matrix" is replaced by "matrix", and the word "leading" is removed.

Positive-definite and positive-semidefinite real matrices are at the basis of convex optimization, since, given a function of several real variables that is twice differentiable, then if its Hessian matrix (matrix of its second partial derivatives) is positive-definite at a point then the function is convex near p, and, conversely, if the function is convex near then the Hessian matrix is positive-semidefinite at

The set of positive definite matrices is an open convex cone, while the set of positive semi-definite matrices is a closed convex cone.[2]

Examples

[edit]
  • The identity matrix is positive-definite (and as such also positive semi-definite). It is a real symmetric matrix, and, for any non-zero column vector z with real entries a and b, one has

    Seen as a complex matrix, for any non-zero column vector z with complex entries a and b one has

    Either way, the result is positive since is not the zero vector (that is, at least one of and is not zero).
  • The real symmetric matrix is positive-definite since for any non-zero column vector z with entries a, b and c, we have This result is a sum of squares, and therefore non-negative; and is zero only if that is, when is the zero vector.
  • For any real invertible matrix the product is a positive definite matrix (if the means of the columns of A are 0, then this is also called the covariance matrix). A simple proof is that for any non-zero vector the condition since the invertibility of matrix means that
  • The example above shows that a matrix in which some elements are negative may still be positive definite. Conversely, a matrix whose entries are all positive is not necessarily positive definite, as for example for which

Eigenvalues

[edit]

Let be an Hermitian matrix (this includes real symmetric matrices). All eigenvalues of are real, and their sign characterize its definiteness:

  • is positive definite if and only if all of its eigenvalues are positive.
  • is positive semi-definite if and only if all of its eigenvalues are non-negative.
  • is negative definite if and only if all of its eigenvalues are negative.
  • is negative semi-definite if and only if all of its eigenvalues are non-positive.
  • is indefinite if and only if it has both positive and negative eigenvalues.

Let be an eigendecomposition of where is a unitary complex matrix whose columns comprise an orthonormal basis of eigenvectors of and is a real diagonal matrix whose main diagonal contains the corresponding eigenvalues. The matrix may be regarded as a diagonal matrix that has been re-expressed in coordinates of the (eigenvectors) basis Put differently, applying to some vector giving is the same as changing the basis to the eigenvector coordinate system using giving applying the stretching transformation to the result, giving and then changing the basis back using giving

With this in mind, the one-to-one change of variable shows that is real and positive for any complex vector if and only if is real and positive for any in other words, if is positive definite. For a diagonal matrix, this is true only if each element of the main diagonal – that is, every eigenvalue of – is positive. Since the spectral theorem guarantees all eigenvalues of a Hermitian matrix to be real, the positivity of eigenvalues can be checked using Descartes' rule of alternating signs when the characteristic polynomial of a real, symmetric matrix is available.

Decomposition

[edit]

Let be an Hermitian matrix. is positive semidefinite if and only if it can be decomposed as a product of a matrix with its conjugate transpose.

When is real, can be real as well and the decomposition can be written as

is positive definite if and only if such a decomposition exists with invertible. More generally, is positive semidefinite with rank if and only if a decomposition exists with a matrix of full row rank (i.e. of rank ). Moreover, for any decomposition [3]

Proof

If then so is positive semidefinite. If moreover is invertible then the inequality is strict for so is positive definite. If is of rank then

In the other direction, suppose is positive semidefinite. Since is Hermitian, it has an eigendecomposition where is unitary and is a diagonal matrix whose entries are the eigenvalues of Since is positive semidefinite, the eigenvalues are non-negative real numbers, so one can define as the diagonal matrix whose entries are non-negative square roots of eigenvalues. Then for If moreover is positive definite, then the eigenvalues are (strictly) positive, so is invertible, and hence is invertible as well. If has rank then it has exactly positive eigenvalues and the others are zero, hence in all but rows are all zeroed. Cutting the zero rows gives a matrix such that

The columns of can be seen as vectors in the complex or real vector space respectively. Then the entries of are inner products (that is dot products, in the real case) of these vectors In other words, a Hermitian matrix is positive semidefinite if and only if it is the Gram matrix of some vectors It is positive definite if and only if it is the Gram matrix of some linearly independent vectors. In general, the rank of the Gram matrix of vectors equals the dimension of the space spanned by these vectors.[4]

Uniqueness up to unitary transformations

[edit]

The decomposition is not unique: if for some matrix and if is any unitary matrix (meaning ), then for

However, this is the only way in which two decompositions can differ: The decomposition is unique up to unitary transformations. More formally, if is a matrix and is a matrix such that then there is a matrix with orthonormal columns (meaning ) such that [5] When this means is unitary.

This statement has an intuitive geometric interpretation in the real case: let the columns of and be the vectors and in A real unitary matrix is an orthogonal matrix, which describes a rigid transformation (an isometry of Euclidean space ) preserving the 0 point (i.e. rotations and reflections, without translations). Therefore, the dot products and are equal if and only if some rigid transformation of transforms the vectors to (and 0 to 0).

Square root

[edit]

A Hermitian matrix is positive semidefinite if and only if there is a positive semidefinite matrix (in particular is Hermitian, so ) satisfying This matrix is unique,[6] is called the non-negative square root of and is denoted with When is positive definite, so is hence it is also called the positive square root of

The non-negative square root should not be confused with other decompositions Some authors use the name square root and for any such decomposition, or specifically for the Cholesky decomposition, or any decomposition of the form others only use it for the non-negative square root.

If then

Cholesky decomposition

[edit]

A Hermitian positive semidefinite matrix can be written as where is lower triangular with non-negative diagonal (equivalently where is upper triangular); this is the Cholesky decomposition. If is positive definite, then the diagonal of is positive and the Cholesky decomposition is unique. Conversely if is lower triangular with nonnegative diagonal then is positive semidefinite. The Cholesky decomposition is especially useful for efficient numerical calculations. A closely related decomposition is the LDL decomposition, where is diagonal and is lower unitriangular.

Williamson theorem

[edit]

Any positive definite Hermitian real matrix can be diagonalized via symplectic (real) matrices. More precisely, Williamson's theorem ensures the existence of symplectic and diagonal real positive such that .

Other characterizations

[edit]

Let be an real symmetric matrix, and let be the "unit ball" defined by Then we have the following

  • is a solid slab sandwiched between
  • if and only if is an ellipsoid, or an ellipsoidal cylinder.
  • if and only if is bounded, that is, it is an ellipsoid.
  • If then if and only if if and only if
  • If then for all if and only if So, since the polar dual of an ellipsoid is also an ellipsoid with the same principal axes, with inverse lengths, we have That is, if is positive-definite, then for all if and only if

Let be an Hermitian matrix. The following properties are equivalent to being positive definite:

The associated sesquilinear form is an inner product
The sesquilinear form defined by is the function from to such that for all and in where is the conjugate transpose of For any complex matrix this form is linear in and semilinear in Therefore, the form is an inner product on if and only if is real and positive for all nonzero that is if and only if is positive definite. (In fact, every inner product on arises in this fashion from a Hermitian positive definite matrix.)
Its leading principal minors are all positive
The kth leading principal minor of a matrix is the determinant of its upper-left sub-matrix. It turns out that a matrix is positive definite if and only if all these determinants are positive. This condition is known as Sylvester's criterion, and provides an efficient test of positive definiteness of a symmetric real matrix. Namely, the matrix is reduced to an upper triangular matrix by using elementary row operations, as in the first part of the Gaussian elimination method, taking care to preserve the sign of its determinant during pivoting process. Since the kth leading principal minor of a triangular matrix is the product of its diagonal elements up to row Sylvester's criterion is equivalent to checking whether its diagonal elements are all positive. This condition can be checked each time a new row of the triangular matrix is obtained.

A positive semidefinite matrix is positive definite if and only if it is invertible.[7] A matrix is negative (semi)definite if and only if is positive (semi)definite.

Quadratic forms

[edit]

The (purely) quadratic form associated with a real matrix is the function such that for all can be assumed symmetric by replacing it with since any asymmetric part will be zeroed-out in the double-sided product.

A symmetric matrix is positive definite if and only if its quadratic form is a strictly convex function.

More generally, any quadratic function from to can be written as where is a symmetric matrix, is a real n vector, and a real constant. In the case, this is a parabola, and just like in the case, we have

Theorem: This quadratic function is strictly convex, and hence has a unique finite global minimum, if and only if is positive definite.

Proof: If is positive definite, then the function is strictly convex. Its gradient is zero at the unique point of which must be the global minimum since the function is strictly convex. If is not positive definite, then there exists some vector such that so the function is a line or a downward parabola, thus not strictly convex and not having a global minimum.

For this reason, positive definite matrices play an important role in optimization problems.

Simultaneous diagonalization

[edit]

One symmetric matrix and another matrix that is both symmetric and positive definite can be simultaneously diagonalized. This is so although simultaneous diagonalization is not necessarily performed with a similarity transformation. This result does not extend to the case of three or more matrices. In this section we write for the real case. Extension to the complex case is immediate.

Let be a symmetric and a symmetric and positive definite matrix. Write the generalized eigenvalue equation as where we impose that be normalized, i.e. Now we use Cholesky decomposition to write the inverse of as Multiplying by and letting we get which can be rewritten as where Manipulation now yields where is a matrix having as columns the generalized eigenvectors and is a diagonal matrix of the generalized eigenvalues. Now premultiplication with gives the final result: and but note that this is no longer an orthogonal diagonalization with respect to the inner product where In fact, we diagonalized with respect to the inner product induced by [8]

Note that this result does not contradict what is said on simultaneous diagonalization in the article Diagonalizable matrix, which refers to simultaneous diagonalization by a similarity transformation. Our result here is more akin to a simultaneous diagonalization of two quadratic forms, and is useful for optimization of one form under conditions on the other.

Properties

[edit]

Induced partial ordering

[edit]

For arbitrary square matrices we write if i.e., is positive semi-definite. This defines a partial ordering on the set of all square matrices. One can similarly define a strict partial ordering The ordering is called the Loewner order.

Inverse of positive definite matrix

[edit]

Every positive definite matrix is invertible and its inverse is also positive definite.[9] If then [10] Moreover, by the min-max theorem, the kth largest eigenvalue of is greater than or equal to the kth largest eigenvalue of

Scaling

[edit]

If is positive definite and is a real number, then is positive definite.[11]

Addition

[edit]
  • If and are positive-definite, then the sum is also positive-definite.[11]
  • If and are positive-semidefinite, then the sum is also positive-semidefinite.
  • If is positive-definite and is positive-semidefinite, then the sum is also positive-definite.

Multiplication

[edit]
  • If and are positive definite, then the products and are also positive definite. If then is also positive definite.
  • If is positive semidefinite, then is positive semidefinite for any (possibly rectangular) matrix If is positive definite and has full column rank, then is positive definite.[12]

Trace

[edit]

The diagonal entries of a positive-semidefinite matrix are real and non-negative. As a consequence the trace, Furthermore,[13] since every principal sub-matrix (in particular, 2-by-2) is positive semidefinite, and thus, when

An Hermitian matrix is positive definite if it satisfies the following trace inequalities:[14]

Another important result is that for any and positive-semidefinite matrices, This follows by writing The matrix is positive-semidefinite and thus has non-negative eigenvalues, whose sum, the trace, is therefore also non-negative.

Hadamard product

[edit]

If although is not necessary positive semidefinite, the Hadamard product is, (this result is often called the Schur product theorem).[15]

Regarding the Hadamard product of two positive semidefinite matrices there are two notable inequalities:

  • Oppenheim's inequality: [16]
  • [17]

Kronecker product

[edit]

If although is not necessary positive semidefinite, the Kronecker product

Frobenius product

[edit]

If although is not necessary positive semidefinite, the Frobenius inner product (Lancaster–Tismenetsky, The Theory of Matrices, p. 218).

Convexity

[edit]

The set of positive semidefinite symmetric matrices is convex. That is, if and are positive semidefinite, then for any between 0 and 1, is also positive semidefinite. For any vector :

This property guarantees that semidefinite programming problems converge to a globally optimal solution.

Relation with cosine

[edit]

The positive-definiteness of a matrix expresses that the angle between any vector and its image is always

the angle between and

Further properties

[edit]
  1. If is a symmetric Toeplitz matrix, i.e. the entries are given as a function of their absolute index differences: and the strict inequality holds, then is strictly positive definite.
  2. Let and Hermitian. If (resp., ) then (resp., ).[18]
  3. If is real, then there is a such that where is the identity matrix.
  4. If denotes the leading minor, is the kth pivot during LU decomposition.
  5. A matrix is negative definite if its kth order leading principal minor is negative when is odd, and positive when is even.
  6. If is a real positive definite matrix, then there exists a positive real number such that for every vector
  7. A Hermitian matrix is positive semidefinite if and only if all of its principal minors are nonnegative. It is however not enough to consider the leading principal minors only, as is checked on the diagonal matrix with entries 0 and −1 .

Block matrices and submatrices

[edit]

A positive matrix may also be defined by blocks:

where each block is By applying the positivity condition, it immediately follows that and are hermitian, and

We have that for all complex and in particular for Then

A similar argument can be applied to and thus we conclude that both and must be positive definite. The argument can be extended to show that any principal submatrix of is itself positive definite.

Converse results can be proved with stronger conditions on the blocks, for instance, using the Schur complement.

Local extrema

[edit]

A general quadratic form on real variables can always be written as where is the column vector with those variables, and is a symmetric real matrix. Therefore, the matrix being positive definite means that has a unique minimum (zero) when is zero, and is strictly positive for any other

More generally, a twice-differentiable real function on real variables has local minimum at arguments if its gradient is zero and its Hessian (the matrix of all second derivatives) is positive semi-definite at that point. Similar statements can be made for negative definite and semi-definite matrices.

Covariance

[edit]

In statistics, the covariance matrix of a multivariate probability distribution is always positive semi-definite; and it is positive definite unless one variable is an exact linear function of the others. Conversely, every positive semi-definite matrix is the covariance matrix of some multivariate distribution.

Extension for non-Hermitian square matrices

[edit]

The definition of positive definite can be generalized by designating any complex matrix (e.g. real non-symmetric) as positive definite if for all non-zero complex vectors where denotes the real part of a complex number [19] Only the Hermitian part determines whether the matrix is positive definite, and is assessed in the narrower sense above. Similarly, if and are real, we have for all real nonzero vectors if and only if the symmetric part is positive definite in the narrower sense. It is immediately clear that is insensitive to transposition of

A non-symmetric real matrix with only positive eigenvalues may have a symmetric part with negative eigenvalues, in which case it will not be positive (semi)definite. For example, the matrix has positive eigenvalues 1 and 7, yet with the choice .

In summary, the distinguishing feature between the real and complex case is that, a bounded positive operator on a complex Hilbert space is necessarily Hermitian, or self adjoint. The general claim can be argued using the polarization identity. That is no longer true in the real case.

Applications

[edit]

Heat conductivity matrix

[edit]

Fourier's law of heat conduction, giving heat flux in terms of the temperature gradient is written for anisotropic media as in which is the thermal conductivity matrix. The negative is inserted in Fourier's law to reflect the expectation that heat will always flow from hot to cold. In other words, since the temperature gradient always points from cold to hot, the heat flux is expected to have a negative inner product with so that Substituting Fourier's law then gives this expectation as implying that the conductivity matrix should be positive definite. Ordinarily should be symmetric, however it becomes nonsymmetric in the presence of a magnetic field as in a thermal Hall effect.

More generally in thermodynamics, the flow of heat and particles is a fully coupled system as described by the Onsager reciprocal relations, and the coupling matrix is required to be positive semi-definite (possibly non-symmetric) in order that entropy production be nonnegative.

See also

[edit]

References

[edit]

Sources

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
In linear algebra, a definite matrix is a real that is either positive definite or negative definite, meaning its associated xTAx\mathbf{x}^T A \mathbf{x} is strictly positive (or strictly negative) for all non-zero vectors x\mathbf{x}. Equivalently, all eigenvalues of a positive definite matrix are positive, while all eigenvalues of a negative definite matrix are negative. These matrices play a fundamental role in various fields, including optimization, where the of the at a critical point indicates a local minimum, and negative definiteness indicates a local maximum. Positive definite matrices are particularly ubiquitous in statistics and probability, often appearing as covariance matrices of multivariate normal distributions, ensuring that variances are positive and correlations are well-defined. They also admit unique Cholesky decompositions into lower and upper triangular factors, facilitating numerical computations such as solving linear systems efficiently. Negative definite matrices share analogous properties but with sign reversals, such as the leading principal minors alternating in sign, starting with negative, for negative definiteness. Both types are invertible, with inverses that preserve definiteness (the inverse of a positive definite matrix is positive definite, and similarly for negative). Key tests for definiteness include Sylvester's criterion, which states that a symmetric matrix is positive definite if and only if all leading principal minors are positive, and negative definite if they alternate in sign starting with negative. Eigenvalue computation provides another definitive method, leveraging the spectral theorem for symmetric matrices, which guarantees real eigenvalues. In applications like control theory and physics, definiteness ensures stability and positive energy forms, such as in the analysis of quadratic potentials.

Definitions

Real symmetric matrices

A real symmetric matrix ARn×nA \in \mathbb{R}^{n \times n} (i.e., A=ATA = A^T) is defined as positive definite if the associated satisfies xTAx>0x^T A x > 0 for every nonzero real vector xRnx \in \mathbb{R}^n. This condition ensures that the quadratic form is strictly positive, establishing a foundational notion of tied to the matrix's . The concept extends to positive semi-definite matrices, where xTAx0x^T A x \geq 0 for all real vectors xx, allowing equality for some nonzero xx. Similarly, AA is negative definite if xTAx<0x^T A x < 0 for all nonzero xx, and negative semi-definite if xTAx0x^T A x \leq 0 for all xx. These classifications rely on the sign of the quadratic form, with symmetry guaranteeing that AA has real eigenvalues. An equivalent characterization for positive definiteness is that all eigenvalues of AA are positive. For positive semi-definiteness, all eigenvalues are non-negative, while negative definiteness requires all eigenvalues to be negative, and negative semi-definiteness requires all to be non-positive. This spectral equivalence follows from the spectral theorem for symmetric matrices, which diagonalizes AA orthogonally, preserving the quadratic form's sign properties. One early characterization of positive definiteness is Sylvester's criterion, introduced by James Joseph Sylvester in 1852, stating that a real symmetric matrix is positive definite if and only if all its leading principal minors are positive. For positive definite AA, the quadratic form xTAxx^T A x is always positive, providing a basic inequality that underpins applications in optimization and stability analysis.

Hermitian matrices

In the complex case, a Hermitian matrix HCn×nH \in \mathbb{C}^{n \times n}, satisfying H=HH = H^* where ^* denotes the conjugate transpose, is defined as positive definite if the quadratic form xHx>0\mathbf{x}^* H \mathbf{x} > 0 for all nonzero complex vectors xCn\mathbf{x} \in \mathbb{C}^n. Analogous definitions extend to the semi-definite cases: HH is positive semi-definite if xHx0\mathbf{x}^* H \mathbf{x} \geq 0 for all xCn\mathbf{x} \in \mathbb{C}^n, with equality holding for some nonzero x\mathbf{x}; negative definite if xHx<0\mathbf{x}^* H \mathbf{x} < 0 for all nonzero x\mathbf{x}; and negative semi-definite if xHx0\mathbf{x}^* H \mathbf{x} \leq 0 for all x\mathbf{x}. A key spectral property of positive definite Hermitian matrices is that all their eigenvalues are real and strictly positive. This follows directly from the positive definiteness condition applied to eigenvectors, ensuring the eigenvalues λ\lambda satisfy uHu=λuu>0\mathbf{u}^* H \mathbf{u} = \lambda \mathbf{u}^* \mathbf{u} > 0 for normalized eigenvectors u\mathbf{u}, implying λ>0\lambda > 0. Conversely, if all eigenvalues of a are positive, then it is positive definite. Real symmetric matrices form a special case of Hermitian matrices, as the conjugate transpose reduces to the ordinary transpose over the reals, preserving the definiteness definitions and properties in the complex framework. The Rayleigh quotient provides a normalized measure of definiteness for Hermitian matrices, defined as R(x)=xHxxxR(\mathbf{x}) = \frac{\mathbf{x}^* H \mathbf{x}}{\mathbf{x}^* \mathbf{x}} for nonzero xCn\mathbf{x} \in \mathbb{C}^n. For a positive definite Hermitian matrix HH, R(x)>0R(\mathbf{x}) > 0 holds for all nonzero x\mathbf{x}, reflecting the uniform positivity of the quadratic form relative to the vector's norm.

Notation and terminology

In the study of definite matrices, standard notation distinguishes between strict definiteness and semi-definiteness using Loewner partial ordering. For a AA, the symbol A0A \succ 0 denotes that AA is positive definite, meaning the xAx>0x^* A x > 0 for all nonzero vectors xCnx \in \mathbb{C}^n. Similarly, A0A \succeq 0 indicates positive semi-definiteness, where xAx0x^* A x \geq 0 for all xCnx \in \mathbb{C}^n, allowing zero values. The notation A0A \prec 0 is used for negative definiteness, and A0A \preceq 0 for negative semi-definiteness. Terminology in matrix theory differentiates "definite" from "semi-definite" based on the strictness of the 's positivity or negativity. A matrix is definite if the associated is strictly positive (or negative) for all nonzero vectors, whereas semi-definite allows non-strict inequality, including zero for some nonzero vectors. Matrices with eigenvalues of mixed signs—both positive and negative—are termed indefinite, contrasting with the uniform sign requirement for definite cases. In mathematical literature, the phrase "positive definite matrix" is predominantly used over the more general "definite matrix" to explicitly indicate the positive sign of eigenvalues, avoiding ambiguity with negative definite counterparts. This convention ensures clarity in contexts like optimization and spectral analysis, where the sign directly impacts applications. Sylvester's law of inertia provides a canonical classification for real symmetric matrices under congruence, specifying the inertia as the triple (p,q,r)(p, q, r), where pp is the number of positive eigenvalues, qq the number of negative eigenvalues, and r=npqr = n - p - q the multiplicity of the zero eigenvalue, with nn the matrix dimension. This signature remains invariant under nonsingular congruence transformations.

Examples

Positive definite examples

A diagonal matrix with all positive diagonal entries is positive definite, as the associated quadratic form reduces to a weighted sum of squares with positive weights. For instance, consider the 2×2 diagonal matrix D=(1002)D = \begin{pmatrix} 1 & 0 \\ 0 & 2 \end{pmatrix}. The quadratic form is xTDx=x12+2x22\mathbf{x}^T D \mathbf{x} = x_1^2 + 2 x_2^2, which is strictly positive for any nonzero x=(x1,x2)TR2\mathbf{x} = (x_1, x_2)^T \in \mathbb{R}^2 since both coefficients are positive. A common non-diagonal example arises in statistics as a , such as A=(2112)A = \begin{pmatrix} 2 & 1 \\ 1 & 2 \end{pmatrix}, which models the variances and covariance of two correlated random variables. To verify , compute the eigenvalues by solving the characteristic equation det(AλI)=(2λ)21=0\det(A - \lambda I) = (2 - \lambda)^2 - 1 = 0, yielding λ24λ+3=0\lambda^2 - 4\lambda + 3 = 0 or (λ3)(λ1)=0(\lambda - 3)(\lambda - 1) = 0, so the eigenvalues are λ1=3>0\lambda_1 = 3 > 0 and λ2=1>0\lambda_2 = 1 > 0. Alternatively, evaluate the xTAx=2x12+2x1x2+2x22=(x1+x2)2+x12+x22>0\mathbf{x}^T A \mathbf{x} = 2x_1^2 + 2x_1 x_2 + 2x_2^2 = (x_1 + x_2)^2 + x_1^2 + x_2^2 > 0 for x0\mathbf{x} \neq \mathbf{0}, confirming the property directly. matrices of this form are positive definite when the variables are non-degenerate. In contrast, scaling the identity matrix by -1 yields I=(1001)-I = \begin{pmatrix} -1 & 0 \\ 0 & -1 \end{pmatrix}, which is negative definite since xT(I)x=(x12+x22)<0\mathbf{x}^T (-I) \mathbf{x} = - (x_1^2 + x_2^2) < 0 for all nonzero x\mathbf{x}.

Indefinite and semi-definite cases

A symmetric matrix is classified as indefinite if its quadratic form takes both positive and negative values for different nonzero vectors, which occurs when the matrix has both positive and negative eigenvalues. A standard example is the 2×2 matrix A=(0110)A = \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix}, which is symmetric. The characteristic equation is det(AλI)=λ21=0\det(A - \lambda I) = \lambda^2 - 1 = 0, yielding eigenvalues λ=1\lambda = 1 and λ=1\lambda = -1. For the eigenvector corresponding to λ=1\lambda = 1, take x=(11)\mathbf{x} = \begin{pmatrix} 1 \\ 1 \end{pmatrix}; then xTAx=2>0\mathbf{x}^T A \mathbf{x} = 2 > 0. For the eigenvector corresponding to λ=1\lambda = -1, take y=(11)\mathbf{y} = \begin{pmatrix} 1 \\ -1 \end{pmatrix}; then yTAy=2<0\mathbf{y}^T A \mathbf{y} = -2 < 0, confirming that the quadratic form changes sign. In contrast to positive definite matrices, where all eigenvalues are positive and the quadratic form is always positive for nonzero vectors, semi-definite matrices allow zero eigenvalues. A symmetric matrix is positive semi-definite if all eigenvalues are nonnegative and the quadratic form is nonnegative for all vectors. An example is the rank-1 matrix B=(1111)B = \begin{pmatrix} 1 & 1 \\ 1 & 1 \end{pmatrix}, which is symmetric. The characteristic equation is det(BλI)=λ(λ2)=0\det(B - \lambda I) = \lambda(\lambda - 2) = 0, yielding eigenvalues λ=2\lambda = 2 and λ=0\lambda = 0. To verify, consider the nonzero vector z=(11)\mathbf{z} = \begin{pmatrix} 1 \\ -1 \end{pmatrix}; then Bz=(00)B \mathbf{z} = \begin{pmatrix} 0 \\ 0 \end{pmatrix}, so zTBz=0\mathbf{z}^T B \mathbf{z} = 0, satisfying the semi-definite condition without being strictly positive. For semi-definite matrices, the multiplicity of the zero eigenvalue equals the nullity, which is nrn - r where nn is the matrix dimension and rr is the rank; in the example above, the zero eigenvalue has multiplicity 1, matching the nullity since \rank(B)=1\rank(B) = 1. Negative semi-definite matrices follow analogously, with all eigenvalues nonpositive and the quadratic form nonpositive.

Spectral properties

Eigenvalues

For real symmetric matrices, all eigenvalues are real numbers. This follows from the spectral theorem for symmetric matrices, which guarantees that such matrices are diagonalizable over the reals with orthogonal eigenvectors. Similarly, for , which generalize symmetric matrices to the complex case, all eigenvalues are also real. This property arises because the Hermitian adjoint preserves the inner product structure, ensuring that eigenvalues satisfy a reality condition derived from the . A real symmetric matrix AA is positive definite if and only if all its eigenvalues λi\lambda_i satisfy λi>0\lambda_i > 0. It is negative definite if and only if λi<0\lambda_i < 0 for all ii. It is positive semi-definite if λi0\lambda_i \geq 0 for all ii, with at least one zero eigenvalue in the semi-definite case excluding the definite one. It is negative semi-definite if λi0\lambda_i \leq 0 for all ii, with at least one zero eigenvalue excluding the definite case. The same characterizations hold for , where definiteness is defined via the Hermitian inner product. These conditions link the spectral properties directly to the quadratic form xAx>0x^* A x > 0 (or <0< 0) for all nonzero xx in the positive (or negative) definite case, and 0\geq 0 (or 0\leq 0) in the semi-definite cases. The Gershgorin circle theorem provides bounds on the possible locations of eigenvalues for any square matrix, including definite ones. For a matrix A=(aij)A = (a_{ij}), every eigenvalue lies within at least one of the disks centered at aiia_{ii} with radius jiaij\sum_{j \neq i} |a_{ij}| in the complex plane. For positive definite matrices, where eigenvalues are positive reals, this theorem can confirm that all disks lie in the positive half-plane if the diagonal entries are positive and sufficiently dominant, thus supporting definiteness without full computation. Similarly, for negative definite matrices, all disks can lie in the negative half-plane if the diagonal entries are negative and sufficiently dominant. The min-max theorem, also known as the Courant-Fischer theorem, characterizes the eigenvalues of a symmetric or Hermitian matrix AA through variational principles. The smallest eigenvalue is given by λmin=minx0xAxxx,\lambda_{\min} = \min_{x \neq 0} \frac{x^* A x}{x^* x}, with the maximum yielding λmax\lambda_{\max}. More generally, the kk-th smallest eigenvalue satisfies λk=mindimS=kmaxxS,x0xAxxx=maxdimT=nk+1minxT,x0xAxxx,\lambda_k = \min_{\dim S = k} \max_{x \in S, x \neq 0} \frac{x^* A x}{x^* x} = \max_{\dim T = n-k+1} \min_{x \in T, x \neq 0} \frac{x^* A x}{x^* x}, where SS and TT are subspaces. This theorem underscores how definiteness corresponds to the Rayleigh quotient being bounded away from zero or non-negative over all directions for positive cases, and bounded above zero or non-positive for negative cases (with λmax<0\lambda_{\max} < 0 for negative definite). The spectral radius ρ(A)=maxiλi\rho(A) = \max_i |\lambda_i| plays a key role in . For positive definite matrices, ρ(A)=λmax>0\rho(A) = \lambda_{\max} > 0. For negative definite matrices, ρ(A)=λmin>0\rho(A) = -\lambda_{\min} > 0. Bounds on ρ(A)\rho(A) (e.g., via norms like ρ(A)A2\rho(A) \leq \|A\|_2) can verify or imply when combined with trace positivity (or negativity) or other conditions. This relation is particularly useful in stability analysis and iterative methods.

Trace and determinants

For a positive definite matrix AA, the trace tr(A)\operatorname{tr}(A) equals the sum of its eigenvalues λi\lambda_i, all of which are positive, so tr(A)=λi>0\operatorname{tr}(A) = \sum \lambda_i > 0. For a negative definite matrix, tr(A)=λi<0\operatorname{tr}(A) = \sum \lambda_i < 0. The determinant det(A)\det(A) is the product of the eigenvalues, yielding det(A)=λi>0\det(A) = \prod \lambda_i > 0 for positive definite matrices. For negative definite matrices, det(A)=λi\det(A) = \prod \lambda_i has sign (1)n(-1)^n, where nn is the matrix dimension, and det(A)>0|\det(A)| > 0. The log-determinant logdet(A)=logλi\log \det(A) = \sum \log \lambda_i is a on the of positive definite matrices and plays a key role in problems, such as and for multivariate Gaussians. For a positive semi-definite matrix AA, the det(A)0\det(A) \geq 0, but det(A)=0\det(A) = 0 if AA is singular (i.e., has at least one zero eigenvalue). For negative semi-definite matrices, det(A)=0\det(A) = 0 if singular, and otherwise follows the sign (1)n(-1)^n with positive magnitude for the definite case. Hadamard's inequality states that for a positive definite matrix A=(aij)A = (a_{ij}), det(A)i=1naii\det(A) \leq \prod_{i=1}^n a_{ii}, with equality if and only if AA is diagonal or a thereof.

Decompositions

Cholesky decomposition

The , also known as , applies exclusively to positive definite matrices and provides a into the product of a and its . For a real symmetric positive definite matrix ARn×nA \in \mathbb{R}^{n \times n}, there exists a unique lower LRn×nL \in \mathbb{R}^{n \times n} with positive diagonal entries such that A=LLT.A = L L^T. This decomposition is guaranteed by the of AA, which ensures all leading principal minors are positive and allows the square roots of the diagonal terms to be real and positive. The uniqueness follows from the requirement that the diagonal entries of LL are strictly positive; without this convention, the would hold up to sign changes in the columns of LL, but the positive diagonal fixes it uniquely. The standard algorithm computes LL column by column in a forward substitution manner, leveraging the symmetry of AA to halve the storage and work compared to general LU factorization. For k=1k = 1 to nn, compute the kk-th column of LL as follows: lkk=akkm=1k1lkm2,l_{kk} = \sqrt{ a_{kk} - \sum_{m=1}^{k-1} l_{km}^2 },
Add your contribution
Related Hubs
Contribute something
User Avatar
No comments yet.