Linear subspace

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Projectivisation F5P^1.svgProjectivisation F5P^1.svg
Projectivisation F5P^1.svgProjectivisation F5P^1.svg
One-dimensional subspaces in the two-dimensional vector space over the finite field F5. The origin (0, 0), marked with green circles, belongs to any of six 1-subspaces, while each of 24 remaining points belongs to exactly one; a property which holds for 1-subspaces over any field and in all dimensions. All F52 (i.e. a 5 × 5 square) is pictured four times for a better visualization

In mathematics, and more specifically in linear algebra, a linear subspace, also known as a vector subspace[1][2] is a vector space that is a subset of some other (higher-dimension) vector space. A linear subspace is usually called simply a subspace when the context serves to distinguish it from other kinds of subspace.

Definition[edit]

If V is a vector space over a field K and if W is a subset of V, then W is a subspace of V if under the operations of V, W is a vector space over K. Equivalently, W is a subspace of V whenever implies that .[3][4][5][6][7] It is clear from the definition of a vector space that a subspace is nonempty, contains the zero vector, and is closed under vector addition and scalar multiplication.

Examples[edit]

Example I[edit]

Let the field K be the set R of real numbers, and let the vector space V be the real coordinate space R3. Take W to be the set of all vectors in V whose last component is 0. Then W is a subspace of V.

Proof:

  1. Given u and v in W, then they can be expressed as u = (u1, u2, 0) and v = (v1, v2, 0). Then u + v = (u1+v1, u2+v2, 0+0) = (u1+v1, u2+v2, 0). Thus, u + v is an element of W, too.
  2. Given u in W and a scalar c in R, if u = (u1, u2, 0) again, then cu = (cu1, cu2, c0) = (cu1, cu2,0). Thus, cu is an element of W too.

Example II[edit]

Let the field be R again, but now let the vector space V be the Cartesian plane R2. Take W to be the set of points (x, y) of R2 such that x = y. Then W is a subspace of R2.

Example II Illustrated

Proof:

  1. Let p = (p1, p2) and q = (q1, q2) be elements of W, that is, points in the plane such that p1 = p2 and q1 = q2. Then p + q = (p1+q1, p2+q2); since p1 = p2 and q1 = q2, then p1 + q1 = p2 + q2, so p + q is an element of W.
  2. Let p = (p1, p2) be an element of W, that is, a point in the plane such that p1 = p2, and let c be a scalar in R. Then cp = (cp1, cp2); since p1 = p2, then cp1 = cp2, so cp is an element of W.

In general, any subset of the real coordinate space Rn that is defined by a system of homogeneous linear equations will yield a subspace. (The equation in example I was z = 0, and the equation in example II was x = y.) Geometrically, these subspaces are points, lines, planes, and so on, that pass through the point 0.

Example III[edit]

Again take the field to be R, but now let the vector space V be the set RR of all functions from R to R. Let C(R) be the subset consisting of continuous functions. Then C(R) is a subspace of RR.

Proof:

  1. We know from calculus that 0 ∈ C(R) ⊂ RR.
  2. We know from calculus that the sum of continuous functions is continuous.
  3. Again, we know from calculus that the product of a continuous function and a number is continuous.

Example IV[edit]

Keep the same field and vector space as before, but now consider the set Diff(R) of all differentiable functions. The same sort of argument as before shows that this is a subspace too.

Examples that extend these themes are common in functional analysis.

Properties of subspaces[edit]

A way to characterize subspaces is that they are closed under linear combinations. That is, a nonempty set W is a subspace if and only if every linear combination of (finitely many) elements of W also belongs to W. Conditions 2 and 3 for a subspace are simply the most basic kinds of linear combinations.

In a topological vector space X, a subspace W need not be closed in general, but a finite-dimensional subspace is always closed.[8] The same is true for subspaces of finite codimension, i.e. determined by a finite number of continuous linear functionals.

Descriptions[edit]

Descriptions of subspaces include the solution set to a homogeneous system of linear equations, the subset of Euclidean space described by a system of homogeneous linear parametric equations, the span of a collection of vectors, and the null space, column space, and row space of a matrix. Geometrically (especially, over the field of real numbers and its subfields), a subspace is a flat in an n-space that passes through the origin.

A natural description of a 1-subspace is the scalar multiplication of one non-zero vector v to all possible scalar values. 1-subspaces specified by two vectors are equal if and only if one vector can be obtained from another with scalar multiplication:

This idea is generalized for higher dimensions with linear span, but criteria for equality of k-spaces specified by sets of k vectors are not so simple.

A dual description is provided with linear functionals (usually implemented as linear equations). One non-zero linear functional F specifies its kernel subspace F = 0 of codimension 1. Subspaces of codimension 1 specified by two linear functionals are equal if and only if one functional can be obtained from another with scalar multiplication (in the dual space):

It is generalized for higher codimensions with a system of equations. The following two subsections will present this latter description in details, and the remaining four subsections further describe the idea of linear span.

Systems of linear equations[edit]

The solution set to any homogeneous system of linear equations with n variables is a subspace in the coordinate space Kn:

For example (over real or rational numbers), the set of all vectors (x, y, z) satisfying the equations

is a one-dimensional subspace. More generally, that is to say that given a set of n independent functions, the dimension of the subspace in Kk will be the dimension of the null set of A, the composite matrix of the n functions.

Null space of a matrix[edit]

In a finite-dimensional space, a homogeneous system of linear equations can be written as a single matrix equation:

The set of solutions to this equation is known as the null space of the matrix. For example, the subspace described above is the null space of the matrix

Every subspace of Kn can be described as the null space of some matrix (see algorithms, below).

Linear parametric equations[edit]

The subset of Kn described by a system of homogeneous linear parametric equations is a subspace:

For example, the set of all vectors (x, y, z) parameterized by the equations

is a two-dimensional subspace of K3, if K is a number field (such as real or rational numbers).[9]

Span of vectors[edit]

In linear algebra, the system of parametric equations can be written as a single vector equation:

The expression on the right is called a linear combination of the vectors (2, 5, −1) and (3, −4, 2). These two vectors are said to span the resulting subspace.

In general, a linear combination of vectors v1, v2, ... , vk is any vector of the form

The set of all possible linear combinations is called the span:

If the vectors v1, ... , vk have n components, then their span is a subspace of Kn. Geometrically, the span is the flat through the origin in n-dimensional space determined by the points v1, ... , vk.

Example
The xz-plane in R3 can be parameterized by the equations
As a subspace, the xz-plane is spanned by the vectors (1, 0, 0) and (0, 0, 1). Every vector in the xz-plane can be written as a linear combination of these two:
Geometrically, this corresponds to the fact that every point on the xz-plane can be reached from the origin by first moving some distance in the direction of (1, 0, 0) and then moving some distance in the direction of (0, 0, 1).

Column space and row space[edit]

A system of linear parametric equations in a finite-dimensional space can also be written as a single matrix equation:

In this case, the subspace consists of all possible values of the vector x. In linear algebra, this subspace is known as the column space (or image) of the matrix A. It is precisely the subspace of Kn spanned by the column vectors of A.

The row space of a matrix is the subspace spanned by its row vectors. The row space is interesting because it is the orthogonal complement of the null space (see below).

Independence, basis, and dimension[edit]

The vectors u and v are a basis for this two-dimensional subspace of R3.

In general, a subspace of Kn determined by k parameters (or spanned by k vectors) has dimension k. However, there are exceptions to this rule. For example, the subspace of K3 spanned by the three vectors (1, 0, 0), (0, 0, 1), and (2, 0, 3) is just the xz-plane, with each point on the plane described by infinitely many different values of t1, t2, t3.

In general, vectors v1, ... , vk are called linearly independent if

for (t1, t2, ... , tk) ≠ (u1, u2, ... , uk).[10] If v1, ..., vk are linearly independent, then the coordinates t1, ..., tk for a vector in the span are uniquely determined.

A basis for a subspace S is a set of linearly independent vectors whose span is S. The number of elements in a basis is always equal to the geometric dimension of the subspace. Any spanning set for a subspace can be changed into a basis by removing redundant vectors (see algorithms, below).

Example
Let S be the subspace of R4 defined by the equations
Then the vectors (2, 1, 0, 0) and (0, 0, 5, 1) are a basis for S. In particular, every vector that satisfies the above equations can be written uniquely as a linear combination of the two basis vectors:
The subspace S is two-dimensional. Geometrically, it is the plane in R4 passing through the points (0, 0, 0, 0), (2, 1, 0, 0), and (0, 0, 5, 1).

Operations and relations on subspaces[edit]

Inclusion[edit]

The set-theoretical inclusion binary relation specifies a partial order on the set of all subspaces (of any dimension).

A subspace cannot lie in any subspace of lesser dimension. If dim U = k, a finite number, and U ⊂ W, then dim W = k if and only if U = W.

Intersection[edit]

In R3, the intersection of two distinct two-dimensional subspaces is one-dimensional

Given subspaces U and W of a vector space V, then their intersection U ∩ W := {v ∈ V : v is an element of both U and W} is also a subspace of V.[11]

Proof:

  1. Let v and w be elements of U ∩ W. Then v and w belong to both U and W. Because U is a subspace, then v + w belongs to U. Similarly, since W is a subspace, then v + w belongs to W. Thus, v + w belongs to U ∩ W.
  2. Let v belong to U ∩ W, and let c be a scalar. Then v belongs to both U and W. Since U and W are subspaces, cv belongs to both U and W.
  3. Since U and W are vector spaces, then 0 belongs to both sets. Thus, 0 belongs to U ∩ W.

For every vector space V, the set {0} and V itself are subspaces of V.[12]

Sum[edit]

If U and W are subspaces, their sum is the subspace

[13]

For example, the sum of two lines is the plane that contains them both. The dimension of the sum satisfies the inequality

Here the minimum only occurs if one subspace is contained in the other, while the maximum is the most general case. The dimension of the intersection and the sum are related:

[14]

Lattice of subspaces[edit]

The operations intersection and sum make the set of all subspaces a bounded modular lattice, where the {0} subspace, the least element, is an identity element of the sum operation, and the identical subspace V, the greatest element, is an identity element of the intersection operation.

Orthogonal complements[edit]

If V is an inner product space and N is a subset of V, then the orthogonal complement N of N is again a subspace.[15] If V is finite-dimensional and N is a subspace, then the dimensions of N and N satisfy the complementation relationship {{{1}}}.[16] Moreover, no vector is orthogonal to itself, so and V is the direct sum of N and N.[17] Applying orthogonal complements twice returns the original subspace: (N) = N for every subspace N.[18]

This operation, understood as negation (¬), makes the lattice of subspaces a (possibly infinite) orthocomplemented lattice (although not a distributive lattice).[citation needed]

In spaces with other bilinear forms, some but not all of these results still hold. In pseudo-Euclidean spaces and symplectic vector spaces, for example, orthogonal complements exist. However, these spaces may have null vectors that are orthogonal to themselves, and consequently there exist subspaces N such that NN ≠ {0}. As a result, this operation does not turn the lattice of subspaces into a Boolean algebra (nor a Heyting algebra).[citation needed]

Algorithms[edit]

Most algorithms for dealing with subspaces involve row reduction. This is the process of applying elementary row operations to a matrix until it reaches either row echelon form or reduced row echelon form. Row reduction has the following important properties:

  1. The reduced matrix has the same null space as the original.
  2. Row reduction does not change the span of the row vectors, i.e. the reduced matrix has the same row space as the original.
  3. Row reduction does not affect the linear dependence of the column vectors.

Basis for a row space[edit]

Input An m × n matrix A.
Output A basis for the row space of A.
  1. Use elementary row operations to put A into row echelon form.
  2. The nonzero rows of the echelon form are a basis for the row space of A.

See the article on row space for an example.

If we instead put the matrix A into reduced row echelon form, then the resulting basis for the row space is uniquely determined. This provides an algorithm for checking whether two row spaces are equal and, by extension, whether two subspaces of Kn are equal.

Subspace membership[edit]

Input A basis {b1, b2, ..., bk} for a subspace S of Kn, and a vector v with n components.
Output Determines whether v is an element of S
  1. Create a (k + 1) × n matrix A whose rows are the vectors b1, ... , bk and v.
  2. Use elementary row operations to put A into row echelon form.
  3. If the echelon form has a row of zeroes, then the vectors {b1, ..., bk, v} are linearly dependent, and therefore vS.

Basis for a column space[edit]

Input An m × n matrix A
Output A basis for the column space of A
  1. Use elementary row operations to put A into row echelon form.
  2. Determine which columns of the echelon form have pivots. The corresponding columns of the original matrix are a basis for the column space.

See the article on column space for an example.

This produces a basis for the column space that is a subset of the original column vectors. It works because the columns with pivots are a basis for the column space of the echelon form, and row reduction does not change the linear dependence relationships between the columns.

Coordinates for a vector[edit]

Input A basis {b1, b2, ..., bk} for a subspace S of Kn, and a vector vS
Output Numbers t1, t2, ..., tk such that v = t1b1 + ··· + tkbk
  1. Create an augmented matrix A whose columns are b1,...,bk , with the last column being v.
  2. Use elementary row operations to put A into reduced row echelon form.
  3. Express the final column of the reduced echelon form as a linear combination of the first k columns. The coefficients used are the desired numbers t1, t2, ..., tk. (These should be precisely the first k entries in the final column of the reduced echelon form.)

If the final column of the reduced row echelon form contains a pivot, then the input vector v does not lie in S.

Basis for a null space[edit]

Input An m × n matrix A.
Output A basis for the null space of A
  1. Use elementary row operations to put A in reduced row echelon form.
  2. Using the reduced row echelon form, determine which of the variables x1, x2, ..., xn are free. Write equations for the dependent variables in terms of the free variables.
  3. For each free variable xi, choose a vector in the null space for which xi = 1 and the remaining free variables are zero. The resulting collection of vectors is a basis for the null space of A.

See the article on null space for an example.

Basis for the sum and intersection of two subspaces[edit]

Given two subspaces U and W of V, a basis of the sum and the intersection can be calculated using the Zassenhaus algorithm

Equations for a subspace[edit]

Input A basis {b1, b2, ..., bk} for a subspace S of Kn
Output An (n − k) × n matrix whose null space is S.
  1. Create a matrix A whose rows are b1, b2, ..., bk.
  2. Use elementary row operations to put A into reduced row echelon form.
  3. Let c1, c2, ..., cn be the columns of the reduced row echelon form. For each column without a pivot, write an equation expressing the column as a linear combination of the columns with pivots.
  4. This results in a homogeneous system of nk linear equations involving the variables c1,...,cn. The (nk) × n matrix corresponding to this system is the desired matrix with nullspace S.
Example
If the reduced row echelon form of A is
then the column vectors c1, ..., c6 satisfy the equations
It follows that the row vectors of A satisfy the equations
In particular, the row vectors of A are a basis for the null space of the corresponding matrix.

See also[edit]

Notes[edit]

  1. ^ Halmos, P. R. (1942). Finite-Dimensional Vector Spaces. Princeton, NJ: Princeton University Press. p. 14. ISBN 978-1-61427-281-6.
  2. ^ The term linear subspace is sometimes used for referring to flats and affine subspaces. In the case of vector spaces over the reals, linear subspaces, flats, and affine subspaces are also called linear manifolds for emphasizing that there are also manifolds.
  3. ^ Anton (2005, p. 155)
  4. ^ Beauregard & Fraleigh (1973, p. 176)
  5. ^ Herstein (1964, p. 132)
  6. ^ Kreyszig (1972, p. 200)
  7. ^ Nering (1970, p. 20)
  8. ^ See Paul DuChateau. "Basic Facts About Hilbert Space" (PDF). Retrieved September 17, 2012. for Hilbert spaces
  9. ^ Generally, K can be any field of such characteristic that the given integer matrix has the appropriate rank in it. All fields include integers, but some integers may equal to zero in some fields.
  10. ^ This definition is often stated differently: vectors v1, ..., vk are linearly independent if t1v1 + ··· + tkvk0 for (t1, t2, ..., tk) ≠ (0, 0, ..., 0). The two definitions are equivalent.
  11. ^ Nering (1970, p. 21)
  12. ^ Nering (1970, p. 20)
  13. ^ Nering (1970, p. 21)
  14. ^ Nering (1970, p. 22)
  15. ^ Axler (2015), 6.46.
  16. ^ Axler (2015), 6.50.
  17. ^ Axler (2015), 6.47.
  18. ^ Axler (2015), 6.51.

Textbooks[edit]

External links[edit]