Hubbry Logo
Basis (linear algebra)Basis (linear algebra)Main
Open search
Basis (linear algebra)
Community hub
Basis (linear algebra)
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Basis (linear algebra)
Basis (linear algebra)
from Wikipedia
The same vector can be represented in two different bases (purple and red arrows).

In mathematics, a set B of elements of a vector space V is called a basis (pl.: bases) if every element of V can be written in a unique way as a finite linear combination of elements of B. The coefficients of this linear combination are referred to as components or coordinates of the vector with respect to B. The elements of a basis are called basis vectors.

Equivalently, a set B is a basis if its elements are linearly independent and every element of V is a linear combination of elements of B.[1] In other words, a basis is a linearly independent spanning set.

A vector space can have several bases; however all the bases have the same number of elements, called the dimension of the vector space.

This article deals mainly with finite-dimensional vector spaces. However, many of the principles are also valid for infinite-dimensional vector spaces.

Basis vectors find applications in the study of crystal structures and frames of reference.

Definition

[edit]

A basis B of a vector space V over a field F (such as the real numbers R or the complex numbers C) is a linearly independent subset of V that spans V. This means that a subset B of V is a basis if it satisfies the two following conditions:

  • linear independence: for every finite subset of B, if for some in F, then ;
  • spanning property: for every vector v in V, one can choose in F and in B such that .

The scalars are called the coordinates of the vector v with respect to the basis B, and by the first property they are uniquely determined.

A vector space that has a finite basis is called finite-dimensional. In this case, the finite subset can be taken as B itself to check for linear independence in the above definition.

It is often convenient or even necessary to have an ordering on the basis vectors, for example, when discussing orientation, or when one considers the scalar coefficients of a vector with respect to a basis without referring explicitly to the basis elements. In this case, the ordering is necessary for associating each coefficient to the corresponding basis element. This ordering can be done by numbering the basis elements. In order to emphasize that an order has been chosen, one speaks of an ordered basis, which is therefore not simply an unstructured set, but a sequence, an indexed family, or similar; see § Ordered bases and coordinates below.

Examples

[edit]
This picture illustrates the standard basis in R2. The blue and orange vectors are the elements of the basis; the green vector can be given in terms of the basis vectors, and so is linearly dependent upon them.

The set R2 of the ordered pairs of real numbers is a vector space under the operations of component-wise addition and scalar multiplication where is any real number. A simple basis of this vector space consists of the two vectors e1 = (1, 0) and e2 = (0, 1). These vectors form a basis (called the standard basis) because any vector v = (a, b) of R2 may be uniquely written as Any other pair of linearly independent vectors of R2, such as (1, 1) and (−1, 2), forms also a basis of R2.

More generally, if F is a field, the set of n-tuples of elements of F is a vector space for similarly defined addition and scalar multiplication. Let be the n-tuple with all components equal to 0, except the ith, which is 1. Then is a basis of which is called the standard basis of

A different flavor of example is given by polynomial rings. If F is a field, the collection F[X] of all polynomials in one indeterminate X with coefficients in F is an F-vector space. One basis for this space is the monomial basis B, consisting of all monomials: Any set of polynomials such that there is exactly one polynomial of each degree (such as the Bernstein basis polynomials or Chebyshev polynomials) is also a basis. (Such a set of polynomials is called a polynomial sequence.) But there are also many bases for F[X] that are not of this form.

Properties

[edit]

Many properties of finite bases result from the Steinitz exchange lemma, which states that, for any vector space V, given a finite spanning set S and a linearly independent set L of n elements of V, one may replace n well-chosen elements of S by the elements of L to get a spanning set containing L, having its other elements in S, and having the same number of elements as S.

Most properties resulting from the Steinitz exchange lemma remain true when there is no finite spanning set, but their proofs in the infinite case generally require the axiom of choice or a weaker form of it, such as the ultrafilter lemma.

If V is a vector space over a field F, then:

  • If L is a linearly independent subset of a spanning set SV, then there is a basis B such that
  • V has a basis (this is the preceding property with L being the empty set, and S = V).
  • All bases of V have the same cardinality, which is called the dimension of V. This is the dimension theorem.
  • A generating set S is a basis of V if and only if it is minimal, that is, no proper subset of S is also a generating set of V.
  • A linearly independent set L is a basis if and only if it is maximal, that is, it is not a proper subset of any linearly independent set.

If V is a vector space of dimension n, then:

  • A subset of V with n elements is a basis if and only if it is linearly independent.
  • A subset of V with n elements is a basis if and only if it is a spanning set of V.

Coordinates

[edit]

Let V be a vector space of finite dimension n over a field F, and be a basis of V. By definition of a basis, every v in V may be written, in a unique way, as where the coefficients are scalars (that is, elements of F), which are called the coordinates of v over B. However, if one talks of the set of the coefficients, one loses the correspondence between coefficients and basis elements, and several vectors may have the same set of coefficients. For example, and have the same set of coefficients {2, 3}, and are different. It is therefore often convenient to work with an ordered basis; this is typically done by indexing the basis elements by the first natural numbers. Then, the coordinates of a vector form a sequence similarly indexed, and a vector is completely characterized by the sequence of coordinates. An ordered basis, especially when used in conjunction with an origin, is also called a coordinate frame or simply a frame (for example, a Cartesian frame or an affine frame).

Let, as usual, be the set of the n-tuples of elements of F. This set is an F-vector space, with addition and scalar multiplication defined component-wise. The map is a linear isomorphism from the vector space onto V. In other words, is the coordinate space of V, and the n-tuple is the coordinate vector of v.

The inverse image by of is the n-tuple all of whose components are 0, except the ith that is 1. The form an ordered basis of , which is called its standard basis or canonical basis. The ordered basis B is the image by of the canonical basis of .

It follows from what precedes that every ordered basis is the image by a linear isomorphism of the canonical basis of , and that every linear isomorphism from onto V may be defined as the isomorphism that maps the canonical basis of onto a given ordered basis of V. In other words, it is equivalent to define an ordered basis of V, or a linear isomorphism from onto V.

Change of basis

[edit]

Let V be a vector space of dimension n over a field F. Given two (ordered) bases and of V, it is often useful to express the coordinates of a vector x with respect to in terms of the coordinates with respect to This can be done by the change-of-basis formula, that is described below. The subscripts "old" and "new" have been chosen because it is customary to refer to and as the old basis and the new basis, respectively. It is useful to describe the old coordinates in terms of the new ones, because, in general, one has expressions involving the old coordinates, and if one wants to obtain equivalent expressions in terms of the new coordinates; this is obtained by replacing the old coordinates by their expressions in terms of the new coordinates.

Typically, the new basis vectors are given by their coordinates over the old basis, that is, If and are the coordinates of a vector x over the old and the new basis respectively, the change-of-basis formula is for i = 1, ..., n.

This formula may be concisely written in matrix notation. Let A be the matrix of the , and be the column vectors of the coordinates of v in the old and the new basis respectively, then the formula for changing coordinates is

The formula can be proven by considering the decomposition of the vector x on the two bases: one has and

The change-of-basis formula results then from the uniqueness of the decomposition of a vector over a basis, here ; that is for i = 1, ..., n.

[edit]

Free module

[edit]

If one replaces the field occurring in the definition of a vector space by a ring, one gets the definition of a module. For modules, linear independence and spanning sets are defined exactly as for vector spaces, although "generating set" is more commonly used than that of "spanning set".

Like for vector spaces, a basis of a module is a linearly independent subset that is also a generating set. A major difference with the theory of vector spaces is that not every module has a basis. A module that has a basis is called a free module. Free modules play a fundamental role in module theory, as they may be used for describing the structure of non-free modules through free resolutions.

A module over the integers is exactly the same thing as an abelian group. Thus a free module over the integers is also a free abelian group. Free abelian groups have specific properties that are not shared by modules over other rings. Specifically, every subgroup of a free abelian group is a free abelian group, and, if G is a subgroup of a finitely generated free abelian group H (that is an abelian group that has a finite basis), then there is a basis of H and an integer 0 ≤ kn such that is a basis of G, for some nonzero integers . For details, see Free abelian group § Subgroups.

Analysis

[edit]

In the context of infinite-dimensional vector spaces over the real or complex numbers, the term Hamel basis (named after Georg Hamel[2]) or algebraic basis can be used to refer to a basis as defined in this article. This is to make a distinction with other notions of "basis" that exist when infinite-dimensional vector spaces are endowed with extra structure. The most important alternatives are orthogonal bases on Hilbert spaces, Schauder bases, and Markushevich bases on normed linear spaces. In the case of the real numbers R viewed as a vector space over the field Q of rational numbers, Hamel bases are uncountable, and have specifically the cardinality of the continuum, which is the cardinal number , where (aleph-nought) is the smallest infinite cardinal, the cardinal of the integers.

The common feature of the other notions is that they permit the taking of infinite linear combinations of the basis vectors in order to generate the space. This, of course, requires that infinite sums are meaningfully defined on these spaces, as is the case for topological vector spaces – a large class of vector spaces including e.g. Hilbert spaces, Banach spaces, or Fréchet spaces.

The preference of other types of bases for infinite-dimensional spaces is justified by the fact that the Hamel basis becomes "too big" in Banach spaces: If X is an infinite-dimensional normed vector space that is complete (i.e. X is a Banach space), then any Hamel basis of X is necessarily uncountable. This is a consequence of the Baire category theorem. The completeness as well as infinite dimension are crucial assumptions in the previous claim. Indeed, finite-dimensional spaces have by definition finite bases and there are infinite-dimensional (non-complete) normed spaces that have countable Hamel bases. Consider , the space of the sequences of real numbers that have only finitely many non-zero elements, with the norm . Its standard basis, consisting of the sequences having only one non-zero element, which is equal to 1, is a countable Hamel basis.

Example

[edit]

In the study of Fourier series, one learns that the functions {1} ∪ { sin(nx), cos(nx) : n = 1, 2, 3, ... } are an "orthogonal basis" of the (real or complex) vector space of all (real or complex valued) functions on the interval [0, 2π] that are square-integrable on this interval, i.e., functions f satisfying

The functions {1} ∪ { sin(nx), cos(nx) : n = 1, 2, 3, ... } are linearly independent, and every function f that is square-integrable on [0, 2π] is an "infinite linear combination" of them, in the sense that

for suitable (real or complex) coefficients ak, bk. But many[3] square-integrable functions cannot be represented as finite linear combinations of these basis functions, which therefore do not comprise a Hamel basis. Every Hamel basis of this space is much bigger than this merely countably infinite set of functions. Hamel bases of spaces of this kind are typically not useful, whereas orthonormal bases of these spaces are essential in Fourier analysis.

Geometry

[edit]

The geometric notions of an affine space, projective space, convex set, and cone have related notions of basis.[4] An affine basis for an n-dimensional affine space is points in general linear position. A projective basis is points in general position, in a projective space of dimension n. A convex basis of a polytope is the set of the vertices of its convex hull. A cone basis[5] consists of one point by edge of a polygonal cone. See also a Hilbert basis (linear programming).

Random basis

[edit]

For a probability distribution in Rn with a probability density function, such as the equidistribution in an n-dimensional ball with respect to Lebesgue measure, it can be shown that n randomly and independently chosen vectors will form a basis with probability one, which is due to the fact that n linearly dependent vectors x1, ..., xn in Rn should satisfy the equation det[x1xn] = 0 (zero determinant of the matrix with columns xi), and the set of zeros of a non-trivial polynomial has zero measure. This observation has led to techniques for approximating random bases.[6][7]

Empirical distribution of lengths N of pairwise almost orthogonal chains of vectors that are independently randomly sampled from the n-dimensional cube [−1, 1]n as a function of dimension, n. Boxplots show the second and third quartiles of this data for each n, red bars correspond to the medians, and blue stars indicate means. Red curve shows theoretical bound given by Eq. (1) and green curve shows a refined estimate.[7]

It is difficult to check numerically the linear dependence or exact orthogonality. Therefore, the notion of ε-orthogonality is used. For spaces with inner product, x is ε-orthogonal to y if (that is, cosine of the angle between x and y is less than ε).

In high dimensions, two independent random vectors are with high probability almost orthogonal, and the number of independent random vectors, which all are with given high probability pairwise almost orthogonal, grows exponentially with dimension. More precisely, consider equidistribution in n-dimensional ball. Choose N independent random vectors from a ball (they are independent and identically distributed). Let θ be a small positive number. Then for

N random vectors are all pairwise ε-orthogonal with probability 1 − θ.[7] This N growth exponentially with dimension n and for sufficiently big n. This property of random bases is a manifestation of the so-called measure concentration phenomenon.[8]

The figure (right) illustrates distribution of lengths N of pairwise almost orthogonal chains of vectors that are independently randomly sampled from the n-dimensional cube [−1, 1]n as a function of dimension, n. A point is first randomly selected in the cube. The second point is randomly chosen in the same cube. If the angle between the vectors was within π/2 ± 0.037π/2 then the vector was retained. At the next step a new vector is generated in the same hypercube, and its angles with the previously generated vectors are evaluated. If these angles are within π/2 ± 0.037π/2 then the vector is retained. The process is repeated until the chain of almost orthogonality breaks, and the number of such pairwise almost orthogonal vectors (length of the chain) is recorded. For each n, 20 pairwise almost orthogonal chains were constructed numerically for each dimension. Distribution of the length of these chains is presented.

Proof that every vector space has a basis

[edit]

Let V be any vector space over some field F. Let X be the set of all linearly independent subsets of V.

The set X is nonempty since the empty set is an independent subset of V, and it is partially ordered by inclusion, which is denoted, as usual, by .

Let Y be a subset of X that is totally ordered by , and let LY be the union of all the elements of Y (which are themselves certain subsets of V).

Since (Y, ⊆) is totally ordered, every finite subset of LY is a subset of an element of Y, which is a linearly independent subset of V, and hence LY is linearly independent. Thus LY is an element of X. Therefore, LY is an upper bound for Y in (X, ⊆): it is an element of X, that contains every element of Y.

As X is nonempty, and every totally ordered subset of (X, ⊆) has an upper bound in X, Zorn's lemma asserts that X has a maximal element. In other words, there exists some element Lmax of X satisfying the condition that whenever Lmax ⊆ L for some element L of X, then L = Lmax.

It remains to prove that Lmax is a basis of V. Since Lmax belongs to X, we already know that Lmax is a linearly independent subset of V.

If there were some vector w of V that is not in the span of Lmax, then w would not be an element of Lmax either. Let Lw = Lmax ∪ {w}. This set is an element of X, that is, it is a linearly independent subset of V (because w is not in the span of Lmax, and Lmax is independent). As Lmax ⊆ Lw, and Lmax ≠ Lw (because Lw contains the vector w that is not contained in Lmax), this contradicts the maximality of Lmax. Thus this shows that Lmax spans V.

Hence Lmax is linearly independent and spans V. It is thus a basis of V, and this proves that every vector space has a basis.

This proof relies on Zorn's lemma, which is equivalent to the axiom of choice. Conversely, it has been proved that if every vector space has a basis, then the axiom of choice is true.[9] Thus the two assertions are equivalent.

See also

[edit]

Notes

[edit]

References

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
In linear algebra, a basis for a VV over a field FF is defined as a linearly independent set of vectors that spans VV, meaning every vector in VV can be uniquely expressed as a finite of the basis vectors. This dual property ensures that a basis serves as a minimal generating set, providing a for VV where each vector corresponds to a unique of scalars (coordinates) with respect to the basis. The concept of a basis is fundamental to understanding the structure of vector spaces, as it allows for the definition of the dimension of VV, which is the number of vectors in any basis for VV; all bases for a given have the same , a result known as the invariance of dimension. Bases can be ordered or unordered, with ordered bases enabling the construction of coordinate maps that facilitate computations such as matrix representations of linear transformations. In finite-dimensional spaces like Rn\mathbb{R}^n, the consists of the unit vectors e1,,ene_1, \dots, e_n, but alternative bases, such as orthonormal bases from Gram-Schmidt orthogonalization, are crucial for applications in and optimization. Beyond finite dimensions, bases extend to infinite-dimensional spaces in , where Hamel bases (algebraic bases) exist under the but are often non-constructive and less practical than Schauder bases used in Banach spaces. The existence and uniqueness theorems for bases underpin key results in linear algebra, including the rank-nullity theorem, which relates the of subspaces like the column space and null space of a matrix. In practice, identifying a basis involves verifying (e.g., via checks for square matrices) and spanning (e.g., row reduction to echelon form), making it a tool for solving systems of linear equations and decomposing complex problems.

Core Concepts

Definition

In linear algebra, a basis for a VV over a field FF is a set BB of vectors in VV that is linearly independent and spans VV. This means that every vector in VV can be expressed as a finite of elements from BB, with coefficients in FF, and such a representation is unique for each vector in VV. Linear independence of BB ensures that no vector in BB can be written as a linear combination of the other vectors in BB, while the spanning property guarantees that the linear combinations of vectors in BB generate all of VV. The uniqueness of the representation follows directly from the linear independence of BB, as any two distinct linear combinations equaling the same vector would imply a nontrivial linear dependence relation among the basis vectors. The dimension of the vector space VV, denoted dimV\dim V, is defined as the (number of elements) of any basis for VV; all bases for VV have the same . In the finite-dimensional case, where dimV=n<\dim V = n < \infty, a basis is often denoted as {e1,e2,,en}\{e_1, e_2, \dots, e_n\}, with each eie_i a vector in VV.

Properties

A set B={v1,,vn}B = \{v_1, \dots, v_n\} of vectors in a vector space VV over a field FF is a basis if and only if it is linearly independent and spans VV. Equivalently, if dimV=n\dim V = n is known, then BB is a basis if it spans VV and has exactly nn elements. In a finite-dimensional vector space VV, all bases have the same number of elements; this common number is the dimension of VV, denoted dimV\dim V. This invariance of dimension follows from the Steinitz exchange lemma, which establishes that the sizes of bases are uniquely determined. Any linearly independent set in a vector space VV can be extended to a basis of VV. In the finite-dimensional case, this is achieved by iteratively applying the Steinitz exchange lemma to add vectors from a spanning set until a basis is obtained. For the general case, the extension relies on applied to the partially ordered set of linearly independent supersets, ensuring a maximal linearly independent set exists, which must then span VV. The Steinitz exchange lemma provides a key tool for proving dimension invariance. Suppose {v1,,vm}\{v_1, \dots, v_m\} is a linearly independent subset of VV and {u1,,un}\{u_1, \dots, u_n\} is a spanning set for VV, with mnm \leq n. Then there exist indices j1,,jmj_1, \dots, j_m such that {v1,,vm,uj1,,ujnm}\{v_1, \dots, v_m, u_{j_1}, \dots, u_{j_{n-m}}\} spans VV. Intuitively, this lemma allows "exchanging" elements from the spanning set with those from the independent set without losing the spanning property, preserving the size while incorporating the independent vectors; repeated application shows that no linearly independent set can exceed the size of a spanning set, equating basis cardinalities. As an application, if VV and WW are subspaces of a vector space such that VWV \oplus W is their internal direct sum, then dim(VW)=dimV+dimW\dim(V \oplus W) = \dim V + \dim W. This follows by taking bases for VV and WW, which combine to form a basis for the direct sum.

Examples

Finite-Dimensional Cases

In finite-dimensional vector spaces, bases provide explicit sets of vectors that span the space while remaining linearly independent. A prototypical example is the standard basis for the Euclidean space Rn\mathbb{R}^n, consisting of the vectors e1=(1,0,,0)e_1 = (1, 0, \dots, 0), e2=(0,1,,0)e_2 = (0, 1, \dots, 0), up to en=(0,,0,1)e_n = (0, \dots, 0, 1). These vectors span Rn\mathbb{R}^n because any vector (x1,x2,,xn)(x_1, x_2, \dots, x_n) can be expressed as the linear combination x1e1+x2e2++xnenx_1 e_1 + x_2 e_2 + \dots + x_n e_n. Their linear independence follows from the fact that the only solution to c1e1++cnen=0c_1 e_1 + \dots + c_n e_n = 0 is c1==cn=0c_1 = \dots = c_n = 0, as this equation yields the system ci=0c_i = 0 for each ii. Another common finite-dimensional space is the vector space PnP_n of all polynomials with real coefficients of degree at most nn, which has dimension n+1n+1. The set {1,x,x2,,xn}\{1, x, x^2, \dots, x^n\} forms a basis for PnP_n, as any polynomial p(x)=a0+a1x++anxnp(x) = a_0 + a_1 x + \dots + a_n x^n is precisely the linear combination a01+a1x++anxna_0 \cdot 1 + a_1 \cdot x + \dots + a_n \cdot x^n, ensuring spanning. Linear independence holds because if c01+c1x++cnxn=0c_0 \cdot 1 + c_1 \cdot x + \dots + c_n \cdot x^n = 0 for all xx, then all coefficients ci=0c_i = 0, by the uniqueness of polynomial representations. Bases also arise naturally in the context of matrices. For an m×nm \times n matrix AA with full column rank nn (i.e., rank nn), the columns of AA form a basis for the column space Col(A)\operatorname{Col}(A), a subspace of Rm\mathbb{R}^m. Consider the invertible 2×22 \times 2 matrix A=(1234)A = \begin{pmatrix} 1 & 2 \\ 3 & 4 \end{pmatrix}; its columns (1,3)(1,3) and (2,4)(2,4) are linearly independent since det(A)=1423=20\det(A) = 1 \cdot 4 - 2 \cdot 3 = -2 \neq 0, and they span R2\mathbb{R}^2 because AA maps R2\mathbb{R}^2 onto R2\mathbb{R}^2. Thus, {(1,3),(2,4)}\{(1,3), (2,4)\} is a basis for R2\mathbb{R}^2. Non-standard bases illustrate the flexibility of basis choice. In R2\mathbb{R}^2, the set B={(1,1),(1,1)}B = \{(1,1), (1,-1)\} serves as a basis: any vector (x,y)(x,y) can be written as a linear combination, specifically x+y2(1,1)+xy2(1,1)=(x,y)\frac{x+y}{2} (1,1) + \frac{x-y}{2} (1,-1) = (x,y), confirming spanning. Linear independence is verified by noting that the matrix with these columns, (1111)\begin{pmatrix} 1 & 1 \\ 1 & -1 \end{pmatrix}, has determinant 1(1)11=201 \cdot (-1) - 1 \cdot 1 = -2 \neq 0. To verify whether a candidate set forms a basis for a finite-dimensional space, two checks are essential: spanning and linear independence. Spanning can be assessed by forming the matrix with the candidate vectors as columns (or rows) and using row reduction to determine if the row-reduced echelon form has a pivot in every row corresponding to the space's dimension, indicating full rank. For linear independence in Rn\mathbb{R}^n, arrange the vectors as columns of an n×nn \times n matrix and compute its determinant; a nonzero value confirms independence, as the only solution to the homogeneous system is the zero vector.

Infinite-Dimensional Cases

In infinite-dimensional vector spaces, the concept of a basis, specifically a Hamel basis, introduces significant differences from the finite-dimensional case, primarily due to the requirement that every vector be expressed as a finite linear combination of basis elements. The existence of such bases for arbitrary vector spaces, including infinite-dimensional ones, relies on the axiom of choice. Consider the vector space C[0,1]C[0,1] of all continuous real-valued functions on the interval [0,1][0,1], equipped with pointwise addition and scalar multiplication. This space admits a Hamel basis only under the axiom of choice, and without it, the existence of an algebraic basis is not guaranteed. The basis, when it exists, is highly pathological, uncountable, and lacks any explicit construction, contrasting sharply with intuitive spanning sets like polynomials or trigonometric functions, which fail to form an algebraic basis because they do not span via finite combinations alone. A more straightforward example is the vector space P(R)\mathbb{P}(\mathbb{R}) of all polynomials with real coefficients, which is infinite-dimensional. The set {1,x,x2,x3,}\{1, x, x^2, x^3, \dots \} forms a countable Hamel basis for this space: it is linearly independent over R\mathbb{R}, as any finite linear dependence relation k=0nakxk=0\sum_{k=0}^n a_k x^k = 0 for all xx implies all coefficients ak=0a_k = 0, and it spans P(R)\mathbb{P}(\mathbb{R}) because every polynomial is a finite linear combination of these monomials. In the sequence space 2\ell^2 of square-summable real sequences, which is a separable infinite-dimensional , a Hamel basis exists via the axiom of choice but is uncountable and non-constructive. This uncountability arises because any countable set in 2\ell^2 spans only a countable union of finite-dimensional subspaces, which cannot cover the uncountable dimension of the space. The algebraic Hamel basis differs fundamentally from the orthonormal (Hilbert) basis in such spaces; the latter, like the standard basis {en}\{e_n\} where ene_n has 1 in the nnth position and 0 elsewhere, allows representation via convergent infinite series cnen\sum c_n e_n, whereas a Hamel basis mandates finite support in all linear combinations. This restriction poses a core challenge: in infinite dimensions, most elements require only finitely many basis coefficients to be nonzero, limiting the utility of Hamel bases for analysis and preventing the capture of infinite expansions natural to the space's topology.

Representation

Coordinates

Given a basis B={b1,,bn}B = \{ \mathbf{b}_1, \dots, \mathbf{b}_n \} for a finite-dimensional vector space VV over a field FF, every vector vV\mathbf{v} \in V can be uniquely expressed as a linear combination v=c1b1++cnbn\mathbf{v} = c_1 \mathbf{b}_1 + \dots + c_n \mathbf{b}_n, where the scalars c1,,cnFc_1, \dots, c_n \in F are called the coordinates of v\mathbf{v} with respect to BB, denoted [v]B=(c1,,cn)[\mathbf{v}]_B = (c_1, \dots, c_n). This representation is unique due to the linear independence of the basis vectors. The coordinate map ϕB:VFn\phi_B: V \to F^n defined by ϕB(v)=[v]B\phi_B(\mathbf{v}) = [\mathbf{v}]_B is a linear isomorphism, preserving vector addition and scalar multiplication while providing a bijective correspondence between VV and the coordinate space FnF^n. To extract the coordinates, one solves the system of linear equations arising from the linear independence of BB, typically by expressing v\mathbf{v} in the basis vectors' components if VV is coordinatized, such as in Rn\mathbb{R}^n. For example, consider V=R2V = \mathbb{R}^2 with basis B={(1,1),(1,0)}B = \{ (1,1), (1,0) \}. To find the coordinates of v=(3,1)\mathbf{v} = (3,1), solve c1(1,1)+c2(1,0)=(3,1)c_1 (1,1) + c_2 (1,0) = (3,1), yielding the system: {c1+c2=3c1=1\begin{cases} c_1 + c_2 = 3 \\ c_1 = 1 \end{cases} Thus, c1=1c_1 = 1, c2=2c_2 = 2, so [v]B=(1,2)[\mathbf{v}]_B = (1, 2). Linear maps between vector spaces can be represented using coordinates with respect to fixed bases. If T:VWT: V \to W is linear with bases BB for VV and CC for WW, the B-coordinate matrix of TT is the matrix AA whose columns are the C-coordinates of T(bi)T(\mathbf{b}_i) for i=1,,ni = 1, \dots, n, such that [T(v)]C=A[v]B[T(\mathbf{v})]_C = A [\mathbf{v}]_B for all vV\mathbf{v} \in V. This matrix encodes the action of TT in the coordinate framework.

Change of Basis

In linear algebra, a change of basis refers to the process of expressing vectors and linear operators with respect to a different ordered basis of a , which alters their coordinate representations while preserving the underlying linear structure. This transformation is facilitated by a change of basis matrix, also known as a transition matrix, that linearly maps coordinates from one basis to another. Such changes are essential for simplifying computations, such as diagonalizing matrices or adapting to geometric interpretations./13%3A_Diagonalization/13.02%3A_Change_of_Basis) Consider two ordered bases B={b1,,bn}B = \{\mathbf{b}_1, \dots, \mathbf{b}_n\} and B={b1,,bn}B' = \{\mathbf{b}'_1, \dots, \mathbf{b}'_n\} for an nn-dimensional vector space VV over a field like R\mathbb{R} or C\mathbb{C}. Let [v]B[ \mathbf{v} ]_B denote the coordinate column vector of vV\mathbf{v} \in V with respect to BB. The change of basis matrix PP from BB to BB' satisfies [v]B=P[v]B[ \mathbf{v} ]_{B'} = P [ \mathbf{v} ]_B. To construct PP, express the bases as matrices BB and BB' whose columns are the basis vectors in coordinates with respect to some fixed basis (often the for Rn\mathbb{R}^n); then P=(B)1BP = (B')^{-1} B. The columns of PP are the coordinates of the BB-basis vectors with respect to BB'./13%3A_Diagonalization/13.02%3A_Change_of_Basis) The formula arises from the equality v=B[v]B=B[v]B\mathbf{v} = B [ \mathbf{v} ]_B = B' [ \mathbf{v} ]_{B'}, where BB and BB' are the basis matrices. Substituting yields B[v]B=B[v]BB [ \mathbf{v} ]_B = B' [ \mathbf{v} ]_{B'}, and solving for the new coordinates gives [v]B=(B)1B[v]B[ \mathbf{v} ]_{B'} = (B')^{-1} B [ \mathbf{v} ]_B, confirming P=(B)1BP = (B')^{-1} B. Since BB and BB' are bases, they are invertible (their columns are linearly independent and span VV), so PP is also invertible; specifically, P1=B(B)1P^{-1} = B (B')^{-1} transforms coordinates from BB' back to BB. This invertibility ensures the mapping is bijective, reflecting the equivalence of bases./13%3A_Diagonalization/13.02%3A_Change_of_Basis) For a concrete example in R2\mathbb{R}^2, take the standard basis B={e1=(1,0)T,e2=(0,1)T}B = \{ \mathbf{e}_1 = (1,0)^T, \mathbf{e}_2 = (0,1)^T \}, so B=I2B = I_2, and a non-standard basis B={b1=(1,1)T,b2=(1,0)T}B' = \{ \mathbf{b}'_1 = (1,1)^T, \mathbf{b}'_2 = (1,0)^T \}, so B=(1110).B' = \begin{pmatrix} 1 & 1 \\ 1 & 0 \end{pmatrix}. The determinant of BB' is 01=100 - 1 = -1 \neq 0, confirming it is a basis. Then P=(B)1I2=(B)1P = (B')^{-1} I_2 = (B')^{-1}. The inverse is (B)1=11(0111)=(0111).(B')^{-1} = \frac{1}{-1} \begin{pmatrix} 0 & -1 \\ -1 & 1 \end{pmatrix} = \begin{pmatrix} 0 & 1 \\ 1 & -1 \end{pmatrix}. Now consider v=(3,1)T\mathbf{v} = (3,1)^T, so [v]B=(31)[ \mathbf{v} ]_B = \begin{pmatrix} 3 \\ 1 \end{pmatrix}. The new coordinates are [v]B=P[v]B=(0111)(31)=(12).[ \mathbf{v} ]_{B'} = P [ \mathbf{v} ]_B = \begin{pmatrix} 0 & 1 \\ 1 & -1 \end{pmatrix} \begin{pmatrix} 3 \\ 1 \end{pmatrix} = \begin{pmatrix} 1 \\ 2 \end{pmatrix}. Verification: 1b1+2b2=1(1,1)T+2(1,0)T=(3,1)T=v1 \cdot \mathbf{b}'_1 + 2 \cdot \mathbf{b}'_2 = 1 \cdot (1,1)^T + 2 \cdot (1,0)^T = (3,1)^T = \mathbf{v}. Applying P1=BP^{-1} = B' recovers [v]B=B[v]B=(1110)(12)=(31)[ \mathbf{v} ]_B = B' [ \mathbf{v} ]_{B'} = \begin{pmatrix} 1 & 1 \\ 1 & 0 \end{pmatrix} \begin{pmatrix} 1 \\ 2 \end{pmatrix} = \begin{pmatrix} 3 \\ 1 \end{pmatrix}./13%3A_Diagonalization/13.02%3A_Change_of_Basis) Change of basis also affects representations of linear operators. If AA is the matrix of a linear operator T:VVT: V \to V with respect to basis BB, then the matrix AA' with respect to BB' is A=P1APA' = P^{-1} A P. This similarity transformation preserves eigenvalues, trace, determinant, and other intrinsic properties of TT, as it merely re-expresses the operator in a new coordinate system. For instance, choosing BB' as eigenvectors of TT yields a diagonal AA', simplifying analysis./13%3A_Diagonalization/13.02%3A_Change_of_Basis)

Variations

Dual Basis

The dual space of a vector space VV over a field FF, denoted VV^*, consists of all linear functionals on VV, which are linear maps from VV to FF. The vector space operations on VV^* are defined pointwise: for linear functionals ϕ,ψV\phi, \psi \in V^* and scalars cFc \in F, (ϕ+ψ)(v)=ϕ(v)+ψ(v)(\phi + \psi)(v) = \phi(v) + \psi(v) and (cϕ)(v)=cϕ(v)(c\phi)(v) = c \cdot \phi(v) for all vVv \in V. Given a basis B={biiI}B = \{b_i \mid i \in I\} for VV, where II is the index set with I=dimV|I| = \dim V, the dual basis B={fiiI}B^* = \{f_i \mid i \in I\} for VV^* is defined such that each fif_i is the unique linear functional satisfying fi(bj)=δijf_i(b_j) = \delta_{ij}, the Kronecker delta (which is 1 if i=ji = j and 0 otherwise). This construction ensures that BB^* is a basis for VV^*, as the functionals fif_i are linearly independent and span VV^*, mirroring the dimension of VV. The dual basis provides a natural correspondence between bases of VV and bases of VV^*. Any linear functional ϕV\phi \in V^* can be uniquely expressed in the dual basis as ϕ=iIcifi\phi = \sum_{i \in I} c_i f_i, where the coefficients ci=ϕ(bi)c_i = \phi(b_i) are determined by evaluating ϕ\phi on the original basis vectors. This representation highlights how the dual basis encodes the action of functionals through their values on the primal basis. The duality between VV and VV^* induces a canonical bilinear pairing v,ϕ=ϕ(v)\langle v, \phi \rangle = \phi(v) for vVv \in V and ϕV\phi \in V^*, which is linear in each argument and separates points in the respective spaces. In the finite-dimensional case over R\mathbb{R}, consider V=RnV = \mathbb{R}^n with the standard basis B={e1,,en}B = \{e_1, \dots, e_n\}, where eie_i has a 1 in the ii-th position and 0 elsewhere. The corresponding dual basis B={ε1,,εn}B^* = \{\varepsilon_1, \dots, \varepsilon_n\} consists of the coordinate functionals εi:RnR\varepsilon_i: \mathbb{R}^n \to \mathbb{R} defined by εi(x)=xi\varepsilon_i(x) = x_i, the ii-th component of x=(x1,,xn)x = (x_1, \dots, x_n). These can be represented as row vectors, with εi\varepsilon_i corresponding to the ii-th standard row vector, satisfying εi(ej)=δij\varepsilon_i(e_j) = \delta_{ij}. For instance, any linear functional ϕ:RnR\phi: \mathbb{R}^n \to \mathbb{R} is then ϕ(x)=i=1nciεi(x)\phi(x) = \sum_{i=1}^n c_i \varepsilon_i(x), where ci=ϕ(ei)c_i = \phi(e_i), recovering the standard representation ϕ(x)=i=1ncixi\phi(x) = \sum_{i=1}^n c_i x_i.

Orthonormal Basis

An orthonormal basis is defined in the context of an inner product space, where it consists of a set of vectors {ei}\{e_i\} that form a basis and satisfy the orthonormality condition ei,ej=δij\langle e_i, e_j \rangle = \delta_{ij} for all i,ji, j, meaning the vectors are pairwise orthogonal and each has unit norm. This geometric structure provides advantages over general bases by preserving lengths and angles through the inner product, facilitating computations in projections and decompositions./09%3A_Inner_product_spaces/9.05%3A_The_Gram-Schmidt_Orthogonalization_procedure) A key method to obtain an orthonormal basis from any linearly independent set {v1,v2,,vn}\{v_1, v_2, \dots, v_n\} in a finite-dimensional inner product space is the Gram-Schmidt process, introduced by Erhard Schmidt in 1907. The algorithm proceeds iteratively as follows: Start with u1=v1u_1 = v_1 and normalize e1=u1/u1e_1 = u_1 / \|u_1\|. For k=2,,nk = 2, \dots, n, define uk=vki=1k1vk,eieiu_k = v_k - \sum_{i=1}^{k-1} \langle v_k, e_i \rangle e_i to orthogonalize against previous vectors, then set ek=uk/uke_k = u_k / \|u_k\| to normalize. This yields {e1,,en}\{e_1, \dots, e_n\} as an orthonormal basis spanning the same subspace. For example, consider the basis {(1,1,1),(1,1,1),(1,1,1)}\{(1, -1, 1), (1, 1, 1), (1, 1, -1)\} in R3\mathbb{R}^3 with the Euclidean inner product. Apply Gram-Schmidt: u1=(1,1,1)u_1 = (1, -1, 1), u1=3\|u_1\| = \sqrt{3}
Add your contribution
Related Hubs
User Avatar
No comments yet.