A calligraphic letter labelling a specific basis. B={b1,b2,…,bn} names a choice of n linearly independent vectors that span the space. Used to distinguish one basis from another when multiple bases are in play at once.
$[\mathbf{v}]_\mathcal{B}$v in basis B — coordinate vector
The coordinate representation of v with respect to basis B. If v=c1b1+c2b2+⋯+cnbn, then [v]B=c1⋮cn. The same geometric vector v has different coordinate vectors in different bases — this is the core idea of the whole chapter.
$P_{\mathcal{B}}$P sub B — change-of-basis matrix
The matrix whose columns are the basis vectors of B written in standard coordinates. Converts B-coordinates to standard coordinates: v=PB[v]B. Its inverse goes the other direction: [v]B=PB−1v.
$[T]_\mathcal{B}$T in basis B — matrix representation
The matrix of a linear transformation T expressed relative to basis B. The same transformation looks like a different matrix when you change the basis — and choosing the right basis can make the matrix diagonal (Chapter 7 diagonalisation is a special case of this).
$P^{-1}AP$P inverse A P — similarity transform
A similarity transformation. Matrices A and P−1AP represent the same linear transformation in different bases. They share eigenvalues, determinant, and trace — all invariants that do not depend on the choice of basis.
01 · What Changes of Basis Mean
You already know that a vector v∈R2 has coordinates (3,5) — meaning 3 steps in the x-direction and 5 steps in the y-direction. But those coordinates depend entirely on the choice of axes. Rotate the axes 45°, and the same physical arrow has completely different coordinate numbers. The arrow does not change; the measuring frame does.
This is the heart of change of basis: one geometric object, infinitely many coordinate descriptions. The standard basis {e1,e2} is just one choice. Any two linearly independent vectors form a valid alternative.
Definition — Coordinates in a Basis
Let B={b1,…,bn} be a basis for Rn. For any v∈Rn there exist unique scalars c1,…,cn such that:
v=c1b1+c2b2+⋯+cnbn
The coordinate vector of v in basis B is:
[v]B=c1c2⋮cn
ci — the scalar coefficient on the i-th basis vector. Uniqueness is guaranteed because B is a basis (linearly independent and spanning).
✓ Example — Same Vector, Two Descriptions
Let B={b1=(11),b2=(1−1)} and v=(31).
In standard coordinates: v=(31).
In B-coordinates: solve c1(11)+c2(1−1)=(31).
Row 1: c1+c2=3. Row 2: c1−c2=1. Adding: 2c1=4⇒c1=2. Then c2=1.
Therefore [v]B=(21). The same arrow has coordinates (3,1) in the standard frame and (2,1) in the B-frame.
❌ Common Mistake — Coordinates Depend on Basis
Writing [v]B=(31) by simply copying the standard-coordinate entries. The standard coordinates of v are its coefficients with respect to {e1,e2}, not B. Unless B happens to be the standard basis, [v]B=v. You must solve the linear system PB[v]B=v every time.
02 · The Change-of-Basis Matrix
Solving a linear system each time is slow. The change-of-basis matrix packages the conversion into a single matrix multiplication.
Definition — Change-of-Basis Matrix
Given a basis B={b1,…,bn} for Rn, the change-of-basis matrix is:
PB=(b1b2⋯bn)
the n×n matrix whose columns are the basis vectors in standard coordinates.
To convert B-coordinates to standard:v=PB[v]B
To convert standard to B-coordinates:[v]B=PB−1v
PB is always invertible because the columns are linearly independent (they form a basis).
Step-by-step — Converting $\mathbf{v}=\begin{pmatrix}4\\2\end{pmatrix}$ to $\mathcal{B}$-coordinates where $\mathcal{B}=\left\{\begin{pmatrix}2\\1\end{pmatrix},\begin{pmatrix}0\\1\end{pmatrix}\right\}$
1
Write the change-of-basis matrix: place the basis vectors as columns.
PB=(2101)
Column 1 is b1=(2,1)T, column 2 is b2=(0,1)T.
2
Compute PB−1: for a 2×2 matrix (acbd), the inverse is ad−bc1(d−c−ba).
det(PB)=2⋅1−0⋅1=2. Therefore:
PB−1=21(1−102)
The 21 comes from dividing every entry by det=2.
3
Multiply PB−1v: convert v=(42) to B-coordinates.
Row 1: 1(4)+0(2)=4, then 24=2. Row 2: −1(4)+2(2)=−4+4=0, then 20=0.
4
Verify:PB[v]B=(2101)(20)=(42)=v ✓. The coordinates (2,0) mean v=2b1+0b2 — the vector v is simply twice b1.
03 · Changing Between Two Non-Standard Bases
When both the source and target are non-standard bases, the conversion routes through standard coordinates.
Definition — Basis Change Between $\mathcal{B}$ and $\mathcal{C}$
Let B and C be two bases for Rn with change-of-basis matrices PB and PC. Then:
[v]C=PC−1PB[v]B
The matrix PC−1PB is the change-of-basis matrix from B to C. Reading right to left: first convert B-coordinates to standard (PB), then standard to C-coordinates (PC−1).
Standard Basis Is the Relay Station
Every coordinate conversion routes through standard coordinates. You never need a direct formula between arbitrary bases — compose the two matrices PC−1 and PB and the routing happens automatically.
04 · Matrices Under Change of Basis
When a linear transformation T:Rn→Rn has standard matrix A, its matrix in a different basis B is related by a similarity transformation.
Definition — Similarity Transformation
Matrices A and B are similar if there exists an invertible matrix P such that:
B=P−1AP
B is the matrix of the same linear transformation, but expressed in the basis whose vectors are the columns of P. Similar matrices represent the same transformation and share: eigenvalues, determinant, trace, rank, and characteristic polynomial.
✓ Example — Diagonalisation Is Change of Basis
Diagonalising A=PDP−1 is exactly a change of basis. In the basis of eigenvectors (columns of P), the transformation becomes the diagonal matrix D=P−1AP. The transformation scales each eigenvector by its eigenvalue — a much simpler picture than the off-diagonal mixing in A.
This is why diagonalisation matters: it finds the natural coordinate system for A.
Step-by-step — Finding $[T]_\mathcal{B}$ for $T(\mathbf{x})=A\mathbf{x}$ with $A=\begin{pmatrix}3&1\\0&2\end{pmatrix}$ in basis $\mathcal{B}=\left\{\begin{pmatrix}1\\0\end{pmatrix},\begin{pmatrix}1\\1\end{pmatrix}\right\}$
1
Write PB: columns are the basis vectors in standard order.
In basis B, the transformation matrix is upper triangular — same eigenvalues 3 and 2, but simpler off-diagonal structure.
❌ What Breaks — Matrix Change of Basis Is Not Transpose
A common error is computing PTAP instead of P−1AP. This equals the similarity transformation only when P is orthogonal (PT=P−1). For a general change-of-basis matrix, PT=P−1, so PTAP produces a wrong result that does not represent the transformation in the new basis.
05 · Quant Application — PCA as Change of Basis
Principal Component Analysis is exactly a change of basis for a covariance matrix. Given a sample covariance matrix Σ∈Rp×p (symmetric, positive semi-definite), the spectral theorem guarantees an orthogonal diagonalisation:
Σ=QΛQT
where Q has orthonormal eigenvectors as columns and Λ=diag(λ1,…,λp) with λ1≥λ2≥⋯≥λp≥0.
Changing to the eigenvector basis means [Σ]Q=Q−1ΣQ=QTΣQ=Λ. In the eigenvector coordinate system, all assets' risk contributions are diagonal — uncorrelated. The original covariance matrix had cross-correlation terms; in the PC basis those vanish.
The first column of Q points in the direction of maximum variance (λ1 largest). Projecting the return data onto Q gives the principal component scores — a new set of uncorrelated factors that explain variance in decreasing order.
06 · Practice Exercises
EXERCISE 9.1
Write the change-of-basis matrix PB by placing b1 and b2 as columns. Then apply PB−1 to v.
PB=(1223).
det(PB)=1(3)−2(2)=3−4=−1.
PB−1=−11(3−2−21)=(−322−1).
[v]B=(−322−1)(57)=(−15+1410−7)=(−13).
Verify: (1223)(−13)=(−1+6−2+9)=(57)=v ✓.
Let B={(12),(23)} and v=(57). Find [v]B.
EXERCISE 9.2
Compute the change-of-basis matrix from B to C as PC−1PB. Apply it to the given coordinates.
PB=(2001) and PC=(1012).
det(PC)=2, so PC−1=21(20−11).
[v]C=PC−1PB[v]B=21(20−11)(2001)(31).
PC−1PB=21(40−11).
21(40−11)(31)=21(12−11)=(11/21/2).
Let B={(20),(01)} and C={(10),(12)}. Given [v]B=(31), find [v]C.
EXERCISE 9.3
Form P from the eigenvectors and compute P−1AP. The result should be diagonal.
Eigenvalues of A=(4123): p(λ)=(4−λ)(3−λ)−2=λ2−7λ+10=(λ−5)(λ−2), so λ1=5, λ2=2.
For λ=5: (A−5I)=(−112−2); eigenvector v1=(21).
For λ=2: (A−2I)=(2121); eigenvector v2=(−11).
P=(21−11), det(P)=3, P−1=31(1−112).
P−1AP=(5002) ✓ — diagonal, eigenvalues on diagonal.
For A=(4123), find the change-of-basis matrix P to the eigenvector basis and compute [T]P=P−1AP. Verify the result is diagonal.
EXERCISE 9.4
Similar matrices satisfy B=P−1AP. They share eigenvalues because the characteristic polynomial is invariant: det(B−λI)=det(P−1(A−λI)P)=det(A−λI).
Shared eigenvalues:det(P−1AP−λI)=det(P−1(A−λI)P)=det(P−1)det(A−λI)det(P)=det(A−λI) since det(P−1)det(P)=1. Same characteristic polynomial ⇒ same eigenvalues.
Shared trace:tr(P−1AP)=tr(APP−1)=tr(A) by the cyclic property of trace.
Shared rank:P−1AP and A have the same null space dimension (multiplying by invertible matrices preserves rank).
These quantities are basis-independent invariants of the linear transformation itself.
Prove that similar matrices A and B=P−1AP share the same eigenvalues, determinant, trace, and rank. For each, state which property of det or tr makes the proof work.
EXERCISE 9.5
A matrix A is diagonalisable ⟺ it has n linearly independent eigenvectors. The change-of-basis matrix P then diagonalises it. If eigenvectors span Rn, write them as columns of P.
Since geometric multiplicity (1) < algebraic multiplicity (2), the matrix is not diagonalisable — there is no basis of eigenvectors. No change-of-basis matrix P can produce a diagonal matrix P−1AP.
The best achievable is the Jordan form (2012) — which is already the Jordan normal form of A.
The matrix A=(2012) has eigenvalue λ=2 with algebraic multiplicity 2. Determine whether A is diagonalisable (i.e., whether a change-of-basis matrix to a diagonal representation exists) and explain why or why not.
EXERCISE 9.6
The covariance matrix Σ is symmetric, so its eigenvectors are orthogonal. The change of basis QTΣQ=Λ transforms to principal component coordinates. Portfolio variance in PC coordinates is wPCTΛwPC=∑iλiwi2.
Σ=(4221).
Eigenvalues: p(λ)=(4−λ)(1−λ)−4=λ2−5λ=λ(λ−5), so λ1=5, λ2=0.
For λ=5: (Σ−5I)v=0 gives (−122−4)v=0; eigenvector v1=51(21).
For λ=0: Σv=0 gives v2=51(−12).
Q=51(21−12), Λ=(5000).
λ2=0 means the portfolio has a zero-variance direction — the assets are perfectly collinear. In the PC basis, wPC=QTw; all portfolio variance concentrates in PC1. Holding the λ2=0 direction incurs no variance — it is the risk-free combination of these two assets.
A two-asset portfolio has covariance matrix Σ=(4221). Find the orthogonal change-of-basis matrix Q that diagonalises Σ (i.e., QTΣQ=Λ). Interpret the result: what does λ2=0 mean for portfolio construction?
07 · Chapter Summary
Concept
Formula / Rule
Coordinate vector
[v]B=(c1,…,cn)T where v=∑cibi
Change-of-basis matrix
PB = columns are basis vectors in standard coords
Standard from B
v=PB[v]B
B from standard
[v]B=PB−1v
B to C
[v]C=PC−1PB[v]B
Similarity transform
[T]B=P−1AP; same eigenvalues, det, trace
Diagonalisation
Special case: P = eigenvector matrix; [T]P=Λ
PCA
Covariance Σ=QΛQT; PC basis decorrelates risk
Next: Chapter 10 — Inner Products & Orthogonality generalises the dot product to abstract vector spaces and establishes the precise meaning of "angle" and "perpendicularity" in any dimension.