Serlo: EN: Kernel of a linear map

{{#invoke:Mathe für Nicht-Freaks/Seite|oben}} The kernel of a linear map intuitively contains the information that is "deleted" when applying the linear map. Further, the kernel can be used to characterize the injectivity of linear maps. It also plays a central role in solving systems of linear equations.

Introduction

We have learned about special mappings between vector spaces, called linear maps. Those are structure-preserving; that is, they are compatible with addition and scalar multiplication of a vector space. We can therefore think of a linear map from $V$ to $W$ as something that transports the vector space structure from $V$ to $W$ .

Introductory examples

We consider two accounts, each with the account balance $x$ and $y$ respectively. We can describe this information with a vector $(x, y)^{T} \in ℝ^{2}$ . The total account balance is the sum of the two account balances. We can calculate it by using the map Vorlage:Einrücken This map is linear and therefore transports the vector space structure from $ℝ^{2}$ to $ℝ$ . In the process, information is lost: one no longer knows how the money is distributed among the accounts. For example, one can no longer distinguish the individual account balances $(500, 0)^{T}$ and $(200, 300)^{T}$ because they both map to the same total account balance $500 + 0 = 200 + 300 = 500$ . In particular, the mapping is not injective. However, we get the information about how much money is in the accounts in total.

Next, we consider the map Vorlage:Einrücken Visually, this corresponds to a counterclockwise rotation of $ℝ^{2}$ by $90$ degrees. By undoing this rotation, one can recover the original vector from any rotated vector in $ℝ^{2}$ . Formally speaking, this mapping is an isomorphism and no information is lost. In particular, the image of linearly independent vectors is linearly independent again (because an isomorphism is injective, see the article monomorphism) and the image of a generator of $ℝ^{2}$ is again a generator of $ℝ^{2}$ (because an isomorphism is surjective, see the article epimorphism).

Finally, we consider a rotation again, but then embed the rotated plane into the $ℝ^{3}$ : Vorlage:Einrücken Although this mapping is no longer bijective, no information is lost here when transporting the vector space structure of the $ℝ^{2}$ into the $ℝ^{3}$ : As in the previous example, different vectors in the $ℝ^{2}$ are mapped to different vectors in the $ℝ^{3}$ because of injectivity. Linear independence of vectors is also preserved. However, a generating system of $ℝ^{2}$ is not mapped to a generator of $ℝ^{3}$ . For example, the linear map sends the standard basis ${(1, 0)^{T}, (0, 1)^{T}}$ to ${(0, 1, 0)^{T}, (- 1, 0, 0)^{T}}$ , which is not a generator of $ℝ^{3}$ . The property of a set of vectors to be a generator depends on the ambient space. This is not the case with linear independence; it is an "intrinsic" property of sets of vectors.

Derivation Vorlage:Anker

We have seen various examples of linear maps that transport a $K$ -vector space into another $K$ -vector space, while preserving the structure. In the process, varying amounts of "intrinsic" information from the original vector space (such as differences of vectors or linear independence) were lost. The last example suggests that injective mappings preserve such intrinsic properties. On the other hand, we see: If $f : V \to W$ is not injective, then there are vectors $v, v^{'} \in V$ with $f (v) = f (v^{'})$ . So in that case, $f$ "eliminates" the difference $v - v^{'}$ of $v$ and $v^{'}$ . The difference $v - v^{'}$ is again an element in $V$ . Since $f$ is linear, we can reformulate: Vorlage:Einrücken Intuitively, $f$ is injective if and only if differences $v - v^{'}$ of vectors under $f$ are not eliminated (i.e., mapped to zero). Because $f$ is structure-preserving, we have that for all $v, v^{'} \in V$ and $λ \in K$ , that $f (v - v^{'}) = 0$ implies Vorlage:Einrücken If the difference of $v$ and $v^{'}$ is eliminated under $f$ , so is that of $λ v$ and $λ v^{'}$ . In the same way, if $v, v^{'}, w, w^{'} \in V$ : if $f (v - v^{'}) = 0$ and $f (w - w^{'}) = 0$ , then also Vorlage:Einrücken So the difference of $v + w$ and $v^{'} + w^{'}$ is also eliminated. The differences eliminated by $f$ are themselves vectors in $V$ . These are send by $f$ to the zero element $0_{W}$ of $W$ and thus, the eliminated vectors are in the preimage $f^{- 1} ({0_{W}})$ . Conversely, any vector $v \in f^{- 1} ({0_{W}})$ can be written as a difference $v = v - 0$ ; that is, the difference $v - 0$ between $v$ and the zero vector is eliminated by $f$ . The preimage $f^{- 1} ({0_{W}})$ measures exactly what differences of vectors (how much "information") is lost in the transport from $V$ to $W$ . Our considerations show that $f^{- 1} ({0_{W}})$ is even a subspace of $V$ . We give a name to this subspace: the kernel of $f$ .

Definition

The kernel of a linear map intuitively measures how much "intrinsic" information about vectors from $V$ (differences of vectors or linear independence) is lost when applying the map. Mathematically, the kernel is the preimage of the zero vector. Mathe für Nicht-Freaks: Vorlage:Definition

In the derivation we claimed that the kernel of a linear map from $V$ to $W$ is a subspace of $V$ . We will now prove this in detail.

Mathe für Nicht-Freaks: Vorlage:Satz

Examples

We determine the kernel of the examples from the introduction.

Vector is mapped to the sum of entries

We consider the mapping Vorlage:Einrücken The kernel of $f$ is made up by the vectors $(x, y)^{T} \in ℝ^{2}$ with $0 = f ((x, y)^{T}) = x + y$ , so $y = - x$ . In other words Vorlage:Einrücken Thus the kernel of $f$ is a one-dimensional subspace of $ℝ^{2}$ . More generally, for $n \in ℕ$ we can consider the mapping Vorlage:Einrücken Again, by definition, a vector $(x_{1}, \dots, x_{n})^{T} \dots, x_{n}$ lies in the kernel of $g$ if and only if $0 = g ((x_{1}, \dots, x_{n})) = x_{1} + \dots + x_{n}$ holds. So we can freely choose $x_{1}, \dots, x_{n - 1} \in ℝ$ and then set $x_{n} = - x_{1} - \dots - x_{n - 1}$ . Thus Vorlage:Einrücken Hence, the kernel of $g$ is a $(n - 1)$ -dimensional subspace of $ℝ^{n}$ . It is also called a hyperplane in $ℝ^{n}$ .

Rotation in $ℝ^{2}$

We consider the rotation Vorlage:Einrücken Suppose $(x, y)^{T}$ lies in the kernel of $f$ , i.e. it holds that Vorlage:Einrücken From this we obtain $x = y = 0$ . So only the zero vector lies in the kernel of $f$ and we have that $\ker f = {(0, 0)^{T}}$ .

$ℝ^{2}$ is rotated and embedded into the $ℝ^{3}$ Vorlage:Anker

Next we consider Vorlage:Einrücken As in the previous example, we determine the kernel by choosing any vector $(x, y)^{T} \in \ker f$ . Thus it holds that Vorlage:Einrücken Again it follows that $x = y = 0$ , so that also for this mapping $\ker f = {(0, 0)^{T}}$ holds.

Derivatives of polynomials Vorlage:Anker

Finally, we consider a linear map that did not appear in the introduction: Vorlage:Einrücken which maps a real polynomial to its derivative. That is, a polynomial Vorlage:Einrücken with coefficients $a_{0}, \dots, a_{n} \in ℝ$ is mapped to the polynomial Vorlage:Einrücken Graphically, we associate with $p$ a polynomial $p^{'}$ that indicates the gradient of $p$ at each point. From this information, we still learn what the shape of the polynomial is (just as if we were given a stencil). However, we no longer know where it is positioned on the $y$ -axis, because the information about the constant part of the polynomial is lost when taking the derivative. Polynomials that just differ by a displacement along the $y$ -axis can no longer be distinguished after derivation. For example, both $p = x^{2} - x + 1$ and $q = x^{2} - x + 42$ have the derivative $p^{'} = q^{'} = 2 x - 1$ . So the mapping $f$ maps them to the same polynomial.

The kernel of $f$ thus contains exactly the constant polynomials: Vorlage:Einrücken The inclusion " $\supseteq$ " is clear, because the derivative of a constant polynomial is always the zero polynomial. For the converse inclusion " $\subseteq$ ", we consider any polynomial $p \in \ker f$ and show that it is constant. We can always write such a polynomial as $p = \sum_{i = 1}^{n} a_{i} X^{i}$ for some $n \in ℕ$ and certain coefficients $a_{0}, \dots, a_{n} \in ℝ$ . Because of $p \in \ker f$ it holds that Vorlage:Einrücken and by comparison of the coefficients, we obtain $a_{1} = a_{2} = \dots = a_{n} = 0$ . So $p$ is constant. Vorlage:Todo

Kernel and injectivity

In the derivation above, we saw that a linear map preserves all differences of vectors (i.e., no vector is eliminated) if and only if the kernel consists only of the zero vector. We also saw there that linearity implies: A linear map is injective if and only if no difference of vectors is eliminated. So we have the following theorem:

<section begin="InjektivitätSatz" />Mathe für Nicht-Freaks: Vorlage:Satz <section end="InjektivitätSatz" />

The larger the kernel is, the more differences between vectors are "eliminated" and the more the mapping "fails to be injective". The kernel is thus a measure of the "non-injectivity" of a linear map.

Injective maps and subspaces

In the introductory examples we conjectured that injective linear maps preserve "intrinsic" properties of vector spaces. By this, we mean properties that do not depend on the ambient vector space, such as the linear independence of vectors or vectors being distinct. The property of being a generator can be lost in injective linear maps, as we have seen in the example of the twisted embedding of $ℝ^{2}$ into $ℝ^{3}$ : The mapping is injective, but the standard basis of $ℝ^{2}$ is not mapped to a generator of $ℝ^{3}$ .

What exactly does it mean that a property of a family $N = (v_{i})_{i \in I} \subseteq V$ of vectors does not depend on the ambient space $V$ ? Often, properties of vectors from $V$ (for example, linear independence) depend on the vector space structure of $V$ , that is, addition and scalar multiplication. To make dependences as small as possible, we restrict our attention to the smallest subspace of $V$ containing $N$ , that is, we restrict to $span (N)$ . Now, we call a property of $N$ intrinsic if it depends only on $span (N)$ but not on $V$ .

Mathe für Nicht-Freaks: Vorlage:Beispiel

What do intrinsic properties of a family of vectors have to do with injectivity? Let $f : V \to W$ be a linear map. Suppose $f$ preserves intrinsic properties of vectors, that is, if a family $N = (v_{i})_{i \in I} \subseteq V$ has some intrinsic property, then its image $f (N) = (f (v_{i}))_{i \in I}$ under $f$ also has this property. Then $f$ also preserves the property of vectors being different, since this is an intrinsic property. That means, if $v, v^{'} \in V$ are different, i.e., $v \neq v^{'}$ , then their image under $f$ is also different, i.e., $f (v) \neq f (v^{'})$ . So $f$ is injective.

Conversely, if $f$ is injective, then $V$ is isomorphic to the subspace $f (V)$ of $W$ : If we restrict the target space of $f$ to its image, we obtain an injective and surjective linear map $f : V \to f (V)$ , that is, an isomorphism. In particular, for any family $N$ in $V$ , it holds that the subspace $span (N)$ of $V$ is isomorphic to $f (span (N))$ . Thus, the latter has the same properties as $span (N)$ and hence, $f$ preserves intrinsic properties of subsets of $V$ .

So we have seen that $f : V \to W$ is injective if and only if $f$ preserves intrinsic properties of subsets of $V$ .

Kernel and linear independence

In the previous section we have seen that injective linear maps $V \to W$ are exactly those linear maps which preserve intrinsic properties of $V$ . The linear independence of a family of vectors is such an intrinsic property, as they either hold for any choice of an ambient space or do not hold for any choice of an ambient space.

So, injective linear maps should preserve linear independence of vectors, i.e., the image of linearly independent vectors is again linearly independent. Conversely, a linear map cannot be injective if it does not preserve the linear independence of vectors, since the intrinsic information of "being linearly independent" is lost.

Overall, we get the following theorem, which has already been proved in the article on monomorphisms:

Mathe für Nicht-Freaks: Vorlage:Satz

In particular, for any linear map $f : V \to W$ , the vector space $f (V)$ is a $\dim (V)$ -dimensional subspace of $W$ . In the finite-dimensional case, there cannot exist an injective linear map from $V$ to $W$ if $\dim (W) < \dim (V)$ . This has also already been shown in the article on monomorphisms.

Kernel and linear systems Vorlage:Anker

The kernel of a linear map is an important concept in the study of systems of linear equations.

Let $K$ be a field and let $m, n \in ℕ$ . We consider a linear system of equations Vorlage:Einrücken with $n$ variables $x_{1}, \dots, x_{n}$ and $m$ rows. We have $a_{i j}, b_{i} \in K$ , where $i \in {1, \dots, m}$ and $j \in {1, \dots, n}$ . We can also write this system of equations using matrix multiplication: Vorlage:Einrücken where $A \in K^{m \times n}$ , $x \in K^{n}$ and $b \in K^{m}$ . We denote the set of solutions by Vorlage:Einrücken

Determining a solution to the linear system of equations $A x = b$ for a given right-hand side $b$ is the same as finding a preimage of $b$ under the linear map Vorlage:Einrücken

Vorlage:Todo The system of equations $A x = b$ has solutions if the preimage $f_{A}^{- 1} (b)$ is not empty. In this case, we may ask whether there are multiple solutions, that is, whether the solution is not unique. In other words, we are interested in how many preimages a $b$ has under $f_{A}$ .

By definition of injectivity, every point $b \in K^{m}$ has at most one element in its preimage if and only if $f_{A}$ is injective. This means that the linear system of equations $A x = b$ has at most one solution for each $b \in K^{m}$ , that is, $| L (A, b) | \leq 1$ . Because $f_{A}$ is linear, injectivity is equivalent to $\ker (f_{A}) = {0}$ . So we can already state: Mathe für Nicht-Freaks: Vorlage:Satz Mathe für Nicht-Freaks: Vorlage:Hinweis

Even if $f_{A}$ is not injective, i.e., $\ker (f_{A}) \neq {0}$ holds, we can still say more about the set of solutions by exploiting the kernel: The difference of two vectors $x$ and $x^{'}$ , which $f_{A}$ maps to the same vector, lies in the kernel of $f_{A}$ . Therefore, the preimage of some $b \in K^{m}$ under $f_{A}$ can be written as Vorlage:Einrücken where $\hat{x}$ is any element of $f_{A}^{- 1} (b)$ . This is shown by the following theorem: Mathe für Nicht-Freaks: Vorlage:Satz We have thus even extended the statement of the theorem above. The larger the kernel of $f_{A}$ is, that is, the "less injective" the mapping $x \mapsto A x$ is, the "less unique" are solutions of $A x = b$ , if any exist. The set of solutions of a linear system of equations $A x = b$ is the kernel of the induced linear map $f_{A}$ shifted by a particular solution $\hat{x}$ . Furthermore, Vorlage:Einrücken The set of solutions of the homogeneous system of equations $A x = 0$ (that is, with right-hand side zero) is exactly the kernel of $f_{A}$ . Mathe für Nicht-Freaks: Vorlage:Hinweis

Exercises

<section begin=injektivität_und_dimension /> Mathe für Nicht-Freaks: Vorlage:Aufgabe<section end=injektivität_und_dimension /> <section begin=aufgabe_kern_bestimmen /> Mathe für Nicht-Freaks: Vorlage:Aufgabe

Mathe für Nicht-Freaks: Vorlage:Frage<section end=aufgabe_kern_bestimmen /> <section begin=kern_nilpotenter_endo /> Mathe für Nicht-Freaks: Vorlage:Aufgabe<section end=kern_nilpotenter_endo />

Serlo: EN: Kernel of a linear map

Inhaltsverzeichnis

Introduction

Introductory examples

Derivation Vorlage:Anker

Definition

Examples

Vector is mapped to the sum of entries

Rotation in $ℝ^{2}$

$ℝ^{2}$ is rotated and embedded into the $ℝ^{3}$ Vorlage:Anker

Derivatives of polynomials Vorlage:Anker

Kernel and injectivity

Injective maps and subspaces

Kernel and linear independence

Kernel and linear systems Vorlage:Anker

Exercises

Navigationsmenü

Serlo: EN: Kernel of a linear map

Introduction

Introductory examples

Derivation Vorlage:Anker

Definition

Examples

Vector is mapped to the sum of entries

Rotation in ℝ2

ℝ2 is rotated and embedded into the ℝ3 Vorlage:Anker

Derivatives of polynomials Vorlage:Anker

Kernel and injectivity

Injective maps and subspaces

Kernel and linear independence

Kernel and linear systems Vorlage:Anker

Exercises

Navigationsmenü

Suche

Rotation in $ℝ^{2}$

$ℝ^{2}$ is rotated and embedded into the $ℝ^{3}$ Vorlage:Anker