Serlo: EN: Basis change via matrices

In this article, you will learn about basis change via matrices. Basis change matrices can be used to convert coordinates with respect to a given basis into coordinates with respect to another basis. This is particularly useful for matrices of linear maps, which are always taken with respect to two specific bases.

Derivation

We have seen in the article on bases that every finite-dimensional vector space has a basis. This means if $V$ is an $n$ -dimensional $K$ -vector space, then there is a basis $B = {b_{1}, \dots, b_{n}}$ of $V$ . Every vector $v \in V$ can therefore be written uniquely as a linear combination of the basis vectors $b_{1}, \dots, b_{n}$ , i.e. $v = \sum_{i = 1}^{n} λ_{i} b_{i}$ with unique $λ_{1}, \dots, λ_{n} \in K$ .

We also know that vector spaces usually have more than one basis. Let $C = {c_{1}, \dots, c_{n}}$ be a second basis of $V$ . Then we can also write $v$ uniquely as a linear combination of $c_{i}$ , i.e. $v = \sum_{i = 1}^{n} μ_{i} c_{i}$ with unique coefficients $μ_{1}, \dots, μ_{n} \in K$ .

We therefore have two representations of the vector $v$ . Using the basis $B$ we get the representation $v = \sum_{i = 1}^{n} λ_{i} b_{i}$ and using the basis $C$ we get $v = \sum_{i = 1}^{n} μ_{i} c_{i}$ .

How can we convert the basis representation with respect to $B$ of the vector $v$ into the representation with respect to $C$ ?

This question is particularly interesting in the context of matrices of linear maps, as we will see below in the section Application of basis change via matrices. Mapping matrices allow us to calculate with coordinates instead of vectors of $V$ . However, the coordinates of a vector always depend on the chosen basis in $V$ . We want a simple way to convert the coordinates of any vector in $V$ with respect to a basis $B$ into coordinates with respect to another basis $C$ .

The situation in $K^{n}$

To answer this question, we start with a simpler special case. We consider $K^{n}$ as a vector space and set $B = (e_{1}, \dots, e_{n})$ as the (ordered) standard basis. Let further $C = (c_{1}, \dots, c_{n})$ be any ordered basis of $K^{n}$ . Since matrices of linear maps depend on the order of the basis vectors, we have to use ordered bases $B$ and $C$ .

Let $v = (x_{1}, \dots, x_{n})^{T} = \sum_{i = 1}^{n} x_{i} e_{i}$ be a vector for whom we know the coordinates with respect to the standard basis $B$ . The vector $v \in K^{n}$ can be written in the basis $C$ as $v = λ_{1} c_{1} + \dots + λ_{n} c_{n}$ for unique $λ_{1}, \dots, λ_{n} \in K$ . How can we calculate the coordinates $λ_{1}, \dots, λ_{n} \in K$ of $v$ with respect to $C$ simply from the coordinates $x_{1}, \dots, x_{n}$ of $v$ with respect to the standard basis $B$ ?

To do this, we need to describe the mapping $K^{n} \to K^{n}$ , which maps each vector $v = (x_{1}, . . ., x_{n})^{T} \in K^{n}$ to its coordinate vector $(λ_{1}, \dots, λ_{n})^{T} \in K^{n}$ with respect to $C$ . This is done by the coordinate mapping $k_{C} : K^{n} \to K^{n}$ , which is a linear map that we know from the article on isomorphims.

In order to describe $k_{C}$ , we calculate its matrix $M_{S t d}^{S t d} (k_{C})$ with respect to the standard basis $B = (e_{1}, \dots, e_{n})$ . By using matrix-vector multiplication in $K^{n}$ , we then obtain the coordinate vector $(λ_{1}, \dots, λ_{n})^{T}$ by multiplying $v = (x_{1}, \dots, x_{n})^{T}$ from the left by $M_{S t d}^{S t d} (k_{C})$ .

To calculate the matrix $M_{S t d}^{S t d} (k_{C})$ , we need to determine $k_{C} (e_{1}), \dots, k_{C} (e_{n})$ . These will then be the columns of $M_{S t d}^{S t d} (k_{C})$ . We are therefore looking for the coordinates of $e_{1}, \dots, e_{n}$ with respect to $C$ , so we have to write these as a linear combination of vectors in $C$ . This gives us $n$ equations Vorlage:Einrücken where $a_{i j}$ are the coordinates we are looking for. The coefficients $a_{i j}$ can be determined by solving a linear system of equations. Mathe für Nicht-Freaks: Vorlage:Beispiel

Then $k_{C} (e_{j}) = (a_{1 j}, a_{2 j}, \dots, a_{n j})^{T}$ for $j = 1, \dots, n$ . This gives us the matrix Vorlage:Einrücken We obtain $M_{S t d}^{S t d} (k_{C}) y = k_{C} (y)$ for all $y \in K^{n}$ . The required coefficients $λ_{1}, \dots, λ_{n}$ are therefore obtained by Vorlage:Einrücken

Mathe für Nicht-Freaks: Vorlage:Beispiel

Generalization to arbitrary finite-dimensional vector spaces

In a general finite-dimensional vector space $V$ , unlike in $K^{n}$ , there is no standard basis. In this situation, we have two ordered bases $B = (b_{1}, \dots, b_{n})$ and $C = (c_{1}, \dots, c_{n})$ . Usually, we are then given an arbitrary vector $v \in V$ as a linear combination $v = x_{1} b_{1} + \dots + x_{n} b_{n}$ with respect to the basis $B$ with $x_{1}, \dots, x_{n} \in K$ . The coefficients $x_{1}, \dots, x_{n}$ are also called the coordinates of $v$ with respect to $B$ . Correspondingly, the coordinates with respect to $C$ are the scalars $λ_{1}, \dots, λ_{n} \in K$ with $v = λ_{1} c_{1} + \dots + λ_{n} c_{n}$ .

We are looking for a method to convert the coordinates $x_{1}, \dots, x_{n}$ with respect to $B$ of any vector $v \in V$ into the coordinates $λ_{1}, \dots, λ_{n}$ with respect to $C$ . For this, we need a mapping $K^{n} \to K^{n}$ , which sends $(x_{1}, \dots, x_{n})^{T}$ to $(λ_{1}, \dots, λ_{n})^{T}$ .

We already know the coordinate mappings $k_{B} : V \to K^{n}$ with $k_{B} (v) = (x_{1}, \dots, x_{n})^{T} \in K^{n}$ and $k_{C} : V \to K^{n}$ with $k_{C} (v) = (λ_{1}, \dots, λ_{n})^{T}$ . From $(x_{1}, \dots, x_{n})^{T} \in K^{n}$ we want to obtain the vector $(λ_{1}, \dots, λ_{n})^{T} \in K^{n}$ . The coordinate mappings are isomorphisms. So $k_{B}^{- 1} : K^{n} \to V$ maps the vector $(x_{1}, \dots, x_{n})^{T}$ to $v$ and $k_{C} : V \to K^{n}$ maps $v$ to $(λ_{1}, \dots, λ_{n})^{T}$ . If we first execute $k_{B}^{- 1}$ and then $k_{C}$ , we obtain a mapping that sends $(x_{1}, \dots, x_{n})^{T}$ to $(λ_{1}, \dots, λ_{n})^{T}$ .

Vorlage:Anker Our desired transformation is therefore realized by the linear map $k_{C} \circ k_{B}^{- 1} : K^{n} \to K^{n}$ . As above for the situation in $K^{n}$ , we can then determine the matrix of this linear map in $K^{n}$ with respect to the standard basis. This matrix is given by $M_{S t d}^{S t d} (k_{C} \circ k_{B}^{- 1})$ . If we remember the article on matrices of linear maps, however, this matrix is just $M_{C}^{B} ({id}_{V})$ , because $k_{C} \circ k_{B}^{- 1} = k_{C} \circ {id}_{V} \circ k_{B}^{- 1}$ .

It also makes intuitive sense that the matrix executing the basis change from $B$ to $C$ is given exactly by $M_{C}^{B} ({id}_{V})$ representing the identity from basis $B$ to $C$ . This is because, if we multiply the coordinate vector $k_{B} (v)$ of $v \in V$ with respect to $B$ from the left with $M_{C}^{B} ({id}_{V})$ , then we obtain exactly the coordinate vector of ${id}_{V} (v) = v$ with respect to $C$ , just by definition of the representing matrix. That is, Vorlage:Einrücken for all $v \in V$ . The matrix $M_{C}^{B} ({id}_{V})$ therefore converts coordinates with respect to $B$ into coordinates with respect to $C$ . This is exactly what a basis change matrix does.

Definition

Mathe für Nicht-Freaks: Vorlage:Definition The basis change matrix has many other names. It is also referred to in the literature as a transition matrix, basis transition matrix, transformation matrix or coordinate change matrix. Mathe für Nicht-Freaks: Vorlage:Warnung

Application of basis change via matrices

The problem with matrices of linear maps

We can find a matrix $M_{C}^{B} (f)$ for every linear map $f : V \to W$ between two finite-dimensional vector spaces, with respect to bases $B$ and $C$ . However, this matrix depends on $B$ and $C$ , and their order. If we choose other bases $B^{'}$ or $C^{'}$ , we will very likely get a different matrix. We can see this in the following example: Mathe für Nicht-Freaks: Vorlage:Beispiel

Solution of this problem

Consider a linear map $f : V \to W$ and two ordered bases $B$ and $B^{'}$ of $V$ as well as $C$ and $C^{'}$ of $W$ . We are asking now: How can we convert the matrix $M_{C}^{B} (f)$ into $M_{C^{'}}^{B^{'}} (f)$ ?

Mathe für Nicht-Freaks: Vorlage:Satz

In the following, we will consider why the formula in this theorem is correct and how we arrived at it.

From the definition of the matrix of a linear map we know that for all vectors $x \in K^{n}$ , we have $M_{C}^{B} (f) x = k_{C} \circ f \circ k_{B}^{- 1} (x)$ and $M_{C^{'}}^{B^{'}} (f) x = k_{C^{'}} \circ f \circ k_{B^{'}}^{- 1} (x)$ . We can visualize this equation in a diagram:

In these two diagrams, it doesn't matter which way you go. For example, it does not matter whether we use $f$ to go directly from $V$ to $W$ or take the detour via $K^{n}$ and $K^{m}$ . If the same map is constructed along each path, this is referred to as a commutative diagram.

We can join the two diagrams together:

This diagram is also commutative. That means, if you have a fixed start and end point, it still doesn't matter which path you take in the diagram. If we start at the top left at $K^{n}$ , it doesn't matter which path we use to get to $K^{m}$ at the bottom left. We can get from $K^{n}$ to $K^{m}$ via $x \mapsto M_{C^{'}}^{B^{'}} (f) x$ , or using first $k_{B} \circ k_{B^{'}}^{- 1} : K^{n} \to K^{n}$ , then $x \mapsto M_{C}^{B} (f) x$ and finally $k_{C^{'}} \circ k_{C}^{- 1} : K^{m} \to K^{m}$ .

Consequently, the map $K^{n} \to K^{m}, x \mapsto M_{C^{'}}^{B^{'}} (f) x$ is equal to the combination of the maps $k_{B} \circ k_{B^{'}}^{- 1}$ , $x \mapsto M_{C}^{B} (f) x$ , and $k_{C^{'}} \circ k_{C}^{- 1}$ . We have now seen that the $x \mapsto M_{C}^{B} (f) x$ can be transformed into the map $x \mapsto M_{C^{'}}^{B^{'}} (f) x$ . Originally, however, we wanted to transform the matrix $M_{C}^{B} (f)$ into the matrix $M_{C^{'}}^{B^{'}} (f)$ . How do we get from the map $K^{n} \to K^{m}, x \mapsto M_{C^{'}}^{B^{'}} (f) x$ back to the matrix $M_{C^{'}}^{B^{'}} (f) \in K^{m \times n}$ ?

The matrix $M_{C^{'}}^{B^{'}} (f)$ looks complicated. We therefore consider how we can answer this question for a general matrix $A \in K^{m \times n}$ . We consider the linear map $L_{A} : K^{n} \to K^{m}, x \mapsto A x$ associated with $A$ . The matrix of $L_{A}$ with respect to the standard bases of $K^{n}$ and $K^{m}$ is again $A$ . Let us now plug in the matrix $M_{C^{'}}^{B^{'}} (f)$ for $A$ . The matrix of the linear map $x \mapsto M_{C^{'}}^{B^{'}} (f) x$ with respect to the standard bases is exactly $M_{C^{'}}^{B^{'}} (f)$ .

As we have already seen, the map $x \mapsto M_{C^{'}}^{B^{'}} (f) x$ is equal to the combination of the three maps $k_{B} \circ k_{B^{'}}^{- 1}$ , $x \mapsto M_{C}^{B} (f) x$ , and $k_{C^{'}} \circ k_{C}^{- 1}$ . Therefore, the matrix of the combination of $k_{B} \circ k_{B^{'}}^{- 1}$ , $x \mapsto M_{C}^{B} (f) x$ , and $k_{C^{'}} \circ k_{C}^{- 1}$ corresponds to $M_{C^{'}}^{B^{'}} (f)$ with regard to the standard bases.

However, we can also determine the matrix of the concatenation in another way. In the article on matrix multiplication, we saw that concatenation between linear maps correspond exactly to the multiplication of the respective matrices. Therefore, we write down the matrices of the concatenated linear maps individually and then multiply them.

As we have already seen for $M_{C^{'}}^{B^{'}} (f)$ , the matrix of $x \mapsto M_{C}^{B} (f) x$ with respect to the standard bases of $K^{n}$ and $K^{m}$ is again $M_{C}^{B} (f)$ .
We have already derived the matrix of $k_{C^{'}} \circ k_{C}^{- 1}$ above; it is $M_{C^{'}}^{C} (id)$ . This is exactly the basis change matrix $T_{C^{'}}^{C}$ .
Similarly, the matrix of $k_{B} \circ k_{B^{'}}^{- 1}$ is given by the basis change matrix $T_{B}^{B^{'}} = M_{B}^{B^{'}} (id)$ .

If we multiply these three matrices, we obtain $M_{C^{'}}^{B^{'}} (f)$ : Vorlage:Einrücken So $M_{C^{'}}^{B^{'}} (f)$ can be calculated from $M_{C}^{B} (f)$ by left multiplication with $T_{C^{'}}^{C}$ and right multiplication with $T_{B}^{B^{'}}$ .

Example for a basis change

We now know, how we can convert matrices of a linear map with respect to different bases into each other. Let's look at the example above again. We consider the linear map Vorlage:Einrücken as well as the ordered bases $B = (e_{1}, e_{2})$ , $C = ((1, 1)^{T}, (1, 0)^{T})$ , and $C^{'} = ((1, 2)^{T}, (1, 0)^{T})$ . We have already calculated the matrix $M_{C}^{B} (f)$ : Vorlage:Einrücken We want to determine $M_{C^{'}}^{B} (f)$ by matrix multiplication, i.e., by $M_{C^{'}}^{B} (f) = T_{C^{'}}^{C} M_{C}^{B} (f) T_{B}^{B}$ . We have to determine $T_{B}^{B}$ and $T_{C^{'}}^{C}$ . Now, $T_{B}^{B} = I_{2}$ , since the basis $B$ does not change. Now let us turn to computing the basis change matrix $T_{C^{'}}^{C}$ : We know that $T_{C^{'}}^{C} = M_{C^{'}}^{C} (id)$ . In order to determine this matrix, we need to express the basis vectors of $C$ in the basis $C^{'}$ :

Vorlage:Einrücken Hence, Vorlage:Einrücken Therefore Vorlage:Einrücken You may convince yourself that this result agrees with the result above.

Examples

Basis change for a matrix of a linear map

Consider the bases Vorlage:Einrücken of $ℝ^{2}$ , as well as the bases Vorlage:Einrücken of $ℝ^{3}$ . Let $f : ℝ^{2} \to ℝ^{3}$ be a map with the following matrix with respect to $B$ and $C$ : Vorlage:Einrücken

We want to determine the matrix of $f$ with respect to the bases $B^{'}$ and $C^{'}$ . This can be done by matrix multiplication $M_{C^{'}}^{B^{'}} (f) = T_{C^{'}}^{C} M_{C}^{B} (f) T_{B}^{B^{'}}$ . To do so, we must first calculate the basis change matrices $T_{B}^{B^{'}}$ and $T_{C^{'}}^{C}$ . Mathe für Nicht-Freaks: Vorlage:Beispiel Mathe für Nicht-Freaks: Vorlage:Beispiel

Mathe für Nicht-Freaks: Vorlage:Beispiel

Exercises

Mathe für Nicht-Freaks: Vorlage:Gruppenaufgabe

Serlo: EN: Basis change via matrices

Inhaltsverzeichnis

Derivation

The situation in $K^{n}$

Generalization to arbitrary finite-dimensional vector spaces

Definition

Application of basis change via matrices

The problem with matrices of linear maps

Solution of this problem

Example for a basis change

Examples

Basis change for a matrix of a linear map

Exercises

Navigationsmenü

Serlo: EN: Basis change via matrices

Derivation

The situation in Kn

Generalization to arbitrary finite-dimensional vector spaces

Definition

Application of basis change via matrices

The problem with matrices of linear maps

Solution of this problem

Example for a basis change

Examples

Basis change for a matrix of a linear map

Exercises

Navigationsmenü

Suche

The situation in $K^{n}$