Chapter 06

Eigenvalues & Eigenvectors

00 — Symbol Glossary


01 — Eigenvalue–Eigenvector Definition

Definition

Let ARn×nA\in\mathbb{R}^{n\times n}. A scalar λC\lambda\in\mathbb{C} is an eigenvalue of AA if there exists a nonzero vector vRn\mathbf{v}\in\mathbb{R}^n (or Cn\mathbb{C}^n) such that

Av=λvA\mathbf{v}=\lambda\mathbf{v}

Such a vector v0\mathbf{v}\neq\mathbf{0} is called an eigenvector corresponding to λ\lambda.

Rearranging: (AλI)v=0(A-\lambda I)\mathbf{v}=\mathbf{0}. This system has a nonzero solution if and only if AλIA-\lambda I is singular, i.e.

det(AλI)=0\det(A-\lambda I)=0

Note

Every scalar multiple of an eigenvector is also an eigenvector for the same eigenvalue. The set of all eigenvectors for λ\lambda (together with 0\mathbf{0}) forms a subspace called the eigenspace EλE_\lambda.

Example

If AA represents a stretch by factor 3 along the xx-axis and factor 1 along the yy-axis, then the xx-axis direction is an eigenvector with λ=3\lambda=3 and the yy-axis direction is an eigenvector with λ=1\lambda=1.

Common mistake

Wrong: v=0\mathbf{v}=\mathbf{0} satisfies A0=λ0A\mathbf{0}=\lambda\mathbf{0} for any λ\lambda, so 0\mathbf{0} is an eigenvector.
Why it happens: The equation is satisfied, so it feels valid.
Correct: Eigenvectors must be nonzero by definition. The zero vector is excluded because it carries no directional information.
Check: Any definition of eigenvector explicitly states v0\mathbf{v}\neq\mathbf{0}.


02 — The Characteristic Equation

Definition

The characteristic polynomial of ARn×nA\in\mathbb{R}^{n\times n} is

p(λ)=det(AλI)p(\lambda)=\det(A-\lambda I)

It is a degree-nn polynomial in λ\lambda. The eigenvalues of AA are the roots of p(λ)=0p(\lambda)=0 (the characteristic equation).

For a 2×22\times2 matrix A=(abcd)A=\begin{pmatrix}a&b\\c&d\end{pmatrix}:

p(λ)=det(aλbcdλ)=(aλ)(dλ)bc=λ2tr(A)λ+det(A)p(\lambda)=\det\begin{pmatrix}a-\lambda&b\\c&d-\lambda\end{pmatrix}=(a-\lambda)(d-\lambda)-bc=\lambda^2-\text{tr}(A)\lambda+\det(A)

Find eigenvalues of $A=\begin{pmatrix}4&1\\2&3\end{pmatrix}$

AλI=(4λ123λ)A-\lambda I=\begin{pmatrix}4-\lambda&1\\2&3-\lambda\end{pmatrix} — subtract λ\lambda from each diagonal entry 44 and 33.

p(λ)=det(4λ123λ)=(4λ)(3λ)12p(\lambda)=\det\begin{pmatrix}4-\lambda&1\\2&3-\lambda\end{pmatrix}=(4-\lambda)(3-\lambda)-1\cdot2 =(127λ+λ2)2=λ27λ+10=(12-7\lambda+\lambda^2)-2=\lambda^2-7\lambda+10 — the 1212 comes from 434\cdot3; the 7λ7\lambda from (4λ+3λ)-(4\lambda+3\lambda); the 2-2 from (12)-(1\cdot2).

λ27λ+10=0    (λ5)(λ2)=0\lambda^2-7\lambda+10=0 \implies (\lambda-5)(\lambda-2)=0 — factor or use the quadratic formula; roots are λ=5\lambda=5 and λ=2\lambda=2.

tr(A)=4+3=7=5+2\text{tr}(A)=4+3=7=5+2\,\checkmark; det(A)=4312=10=52\det(A)=4\cdot3-1\cdot2=10=5\cdot2\,\checkmark.

Note

Trace = sum of eigenvalues and determinant = product of eigenvalues (both with algebraic multiplicity). These are fast sanity checks.


03 — Finding Eigenvectors

Once you have an eigenvalue λk\lambda_k, find Eλk=ker(AλkI)E_{\lambda_k}=\ker(A-\lambda_k I) by row-reducing (AλkI)v=0(A-\lambda_k I)\mathbf{v}=\mathbf{0}.

Find eigenvectors of $A=\begin{pmatrix}4&1\\2&3\end{pmatrix}$ for $\lambda=5$ and $\lambda=2$

A5I=(1122)A-5I=\begin{pmatrix}-1&1\\2&-2\end{pmatrix} — subtract 55 from each diagonal: 45=14-5=-1 and 35=23-5=-2.

R2R2+2R1R_2\leftarrow R_2+2R_1: (1100)\begin{pmatrix}-1&1\\0&0\end{pmatrix}2+2(1)=02+2(-1)=0; 2+2(1)=0-2+2(1)=0. One free variable (v2=tv_2=t).

From v1+v2=0-v_1+v_2=0: v1=v2=tv_1=v_2=t. Eigenvector: t(11)t\begin{pmatrix}1\\1\end{pmatrix}, t0t\neq0.

A2I=(2121)A-2I=\begin{pmatrix}2&1\\2&1\end{pmatrix}42=24-2=2; 32=13-2=1.

R2R2R1R_2\leftarrow R_2-R_1: (2100)\begin{pmatrix}2&1\\0&0\end{pmatrix} — one free variable (v2=sv_2=s).

From 2v1+v2=02v_1+v_2=0: v1=v2/2=s/2v_1=-v_2/2=-s/2. Eigenvector: s(12)s\begin{pmatrix}-1\\2\end{pmatrix} (set s=2s=2 for integer entries), s0s\neq0.

Common mistake

Wrong: "the eigenvector for λ=5\lambda=5 is (11)\begin{pmatrix}1\\1\end{pmatrix}."
Why it happens: We computed t=1t=1 for convenience.
Correct: the eigenspace is E5=span ⁣{(11)}E_5=\text{span}\!\left\{\begin{pmatrix}1\\1\end{pmatrix}\right\} — any nonzero scalar multiple is equally valid.
Check: verify A(2)(11)=5(2)(11)A\cdot(2)\begin{pmatrix}1\\1\end{pmatrix}=5\cdot(2)\begin{pmatrix}1\\1\end{pmatrix} also holds.


04 — Algebraic and Geometric Multiplicity

Definition

Let λk\lambda_k be an eigenvalue of AA.

  • Algebraic multiplicity ma(λk)m_a(\lambda_k): the multiplicity of λk\lambda_k as a root of p(λ)=det(AλI)p(\lambda)=\det(A-\lambda I).
  • Geometric multiplicity mg(λk)m_g(\lambda_k): the dimension of the eigenspace Eλk=ker(AλkI)E_{\lambda_k}=\ker(A-\lambda_k I).

It always holds that 1mg(λk)ma(λk)1\leq m_g(\lambda_k)\leq m_a(\lambda_k).

If mg(λk)<ma(λk)m_g(\lambda_k)<m_a(\lambda_k) for any λk\lambda_k, the matrix is defective — it cannot be diagonalised.

Example

A=(2102)A=\begin{pmatrix}2&1\\0&2\end{pmatrix} has characteristic polynomial (λ2)2(\lambda-2)^2, so ma(2)=2m_a(2)=2. But A2I=(0100)A-2I=\begin{pmatrix}0&1\\0&0\end{pmatrix} has a 1-dimensional null space, so mg(2)=1<2m_g(2)=1<2. The matrix is defective.


05 — Eigenvalues of Special Matrices

Matrix typeEigenvalues
Diagonal D=diag(d1,,dn)D=\text{diag}(d_1,\ldots,d_n)d1,,dnd_1,\ldots,d_n (the diagonal entries)
TriangularDiagonal entries
Symmetric (A=AA=A^\top)All real
Orthogonal (AA=IA^\top A=I)All have λ=1\lvert\lambda\rvert=1
Projection (A2=AA^2=A)Only 00 and 11
Positive definiteAll λ>0\lambda>0
Note

Symmetric matrices always have real eigenvalues and orthogonal eigenvectors — a key reason covariance matrices (which are symmetric positive semi-definite) are so tractable in multivariate statistics.


06 — Quant Application — PCA and Covariance Spectra

Principal Component Analysis (PCA) is purely the eigendecomposition of the covariance matrix Σ\Sigma.

Given pp assets with covariance matrix Σ\Sigma (symmetric, positive semi-definite):

  1. Find eigenvalues λ1λ2λp0\lambda_1\geq\lambda_2\geq\cdots\geq\lambda_p\geq0 and corresponding orthonormal eigenvectors v1,,vp\mathbf{v}_1,\ldots,\mathbf{v}_p.
  2. The kk-th principal component is the portfolio w=vk\mathbf{w}=\mathbf{v}_k; its variance is λk\lambda_k.
  3. The first PC explains λ1/iλi\lambda_1/\sum_i\lambda_i of total variance.

In fixed-income PCA, the first three PCs of yield-curve moves are almost universally interpreted as level (λ180%\lambda_1\approx80\%), slope (λ2\lambda_2), and curvature (λ3\lambda_3).

A near-zero eigenvalue of Σ\Sigma signals that a linear combination of assets is nearly riskless — useful for detecting near-arbitrage or near-multicollinear factors.


Exercises

EXERCISE 6.1

Form AλIA-\lambda I, compute det(AλI)=0\det(A-\lambda I)=0, solve the resulting quadratic. Verify using tr=λ1+λ2\text{tr}=\lambda_1+\lambda_2 and det=λ1λ2\det=\lambda_1\lambda_2.

A=(6215)A=\begin{pmatrix}6&2\\1&5\end{pmatrix}.

p(λ)=det(6λ215λ)=(6λ)(5λ)2=(3011λ+λ2)2=λ211λ+28p(\lambda)=\det\begin{pmatrix}6-\lambda&2\\1&5-\lambda\end{pmatrix}=(6-\lambda)(5-\lambda)-2=(30-11\lambda+\lambda^2)-2=\lambda^2-11\lambda+28.

λ211λ+28=(λ7)(λ4)=0λ1=7,λ2=4\lambda^2-11\lambda+28=(\lambda-7)(\lambda-4)=0 \Rightarrow \lambda_1=7,\,\lambda_2=4.

Check: tr(A)=11=7+4\text{tr}(A)=11=7+4\,\checkmark; det(A)=302=28=74\det(A)=30-2=28=7\cdot4\,\checkmark.

Find the eigenvalues of (6215)\begin{pmatrix}6&2\\1&5\end{pmatrix}.

EXERCISE 6.2

For each eigenvalue λk\lambda_k found in 6.1, solve (AλkI)v=0(A-\lambda_k I)\mathbf{v}=\mathbf{0} by row reduction. Express the eigenspace as a span.

Eigenvalue λ=7\lambda=7: A7I=(1212)A-7I=\begin{pmatrix}-1&2\\1&-2\end{pmatrix}. R2R2+R1R_2\leftarrow R_2+R_1: (1200)\begin{pmatrix}-1&2\\0&0\end{pmatrix}. So v1+2v2=0v1=2v2-v_1+2v_2=0 \Rightarrow v_1=2v_2. E7=span ⁣{(21)}E_7=\text{span}\!\left\{\begin{pmatrix}2\\1\end{pmatrix}\right\}.

Eigenvalue λ=4\lambda=4: A4I=(2211)A-4I=\begin{pmatrix}2&2\\1&1\end{pmatrix}. R2R212R1R_2\leftarrow R_2-\tfrac{1}{2}R_1: (2200)\begin{pmatrix}2&2\\0&0\end{pmatrix}. So 2v1+2v2=0v1=v22v_1+2v_2=0 \Rightarrow v_1=-v_2. E4=span ⁣{(11)}E_4=\text{span}\!\left\{\begin{pmatrix}-1\\1\end{pmatrix}\right\}.

Verify: A(21)=(12+22+5)=(147)=7(21)A\begin{pmatrix}2\\1\end{pmatrix}=\begin{pmatrix}12+2\\2+5\end{pmatrix}=\begin{pmatrix}14\\7\end{pmatrix}=7\begin{pmatrix}2\\1\end{pmatrix}\,\checkmark.

Find the eigenvectors of (6215)\begin{pmatrix}6&2\\1&5\end{pmatrix} for each eigenvalue found in Exercise 6.1.

EXERCISE 6.3

A triangular matrix has eigenvalues equal to its diagonal entries. No computation needed — just read them off.

T=(372015004)T=\begin{pmatrix}3&7&2\\0&-1&5\\0&0&4\end{pmatrix} is upper triangular.

Eigenvalues: λ1=3\lambda_1=3, λ2=1\lambda_2=-1, λ3=4\lambda_3=4.

Check: tr(T)=6=3+(1)+4\text{tr}(T)=6=3+(-1)+4\,\checkmark; det(T)=3(1)4=12=(3)(1)(4)\det(T)=3\cdot(-1)\cdot4=-12=(3)(-1)(4)\,\checkmark.

State the eigenvalues of (372015004)\begin{pmatrix}3&7&2\\0&-1&5\\0&0&4\end{pmatrix} and justify without full computation.

EXERCISE 6.4

Use the trace and determinant relations: λ1+λ2=tr(A)\lambda_1+\lambda_2=\text{tr}(A) and λ1λ2=det(A)\lambda_1\lambda_2=\det(A). Also recall that for positive definite matrices all eigenvalues are positive.

Σ=(4223)\Sigma=\begin{pmatrix}4&2\\2&3\end{pmatrix}.

tr(Σ)=7=λ1+λ2\text{tr}(\Sigma)=7=\lambda_1+\lambda_2; det(Σ)=124=8=λ1λ2\det(\Sigma)=12-4=8=\lambda_1\lambda_2.

Characteristic equation: λ27λ+8=0\lambda^2-7\lambda+8=0. λ=7±49322=7±172\lambda=\frac{7\pm\sqrt{49-32}}{2}=\frac{7\pm\sqrt{17}}{2}.

λ1=7+1725.56\lambda_1=\frac{7+\sqrt{17}}{2}\approx5.56; λ2=71721.44\lambda_2=\frac{7-\sqrt{17}}{2}\approx1.44.

Both positive \Rightarrow Σ\Sigma is positive definite. λ1/(λ1+λ2)79%\lambda_1/(\lambda_1+\lambda_2)\approx79\% of variance is explained by the first PC.

A covariance matrix for two assets is Σ=(4223)\Sigma=\begin{pmatrix}4&2\\2&3\end{pmatrix}. Find the eigenvalues and determine the percentage of total variance explained by the first principal component.

EXERCISE 6.5

A projection satisfies A2=AA^2=A. If Av=λvA\mathbf{v}=\lambda\mathbf{v}, apply AA again: A2v=A(λv)=λ2vA^2\mathbf{v}=A(\lambda\mathbf{v})=\lambda^2\mathbf{v}. But A2v=Av=λvA^2\mathbf{v}=A\mathbf{v}=\lambda\mathbf{v}. What does that tell you about λ\lambda?

If A2=AA^2=A and Av=λvA\mathbf{v}=\lambda\mathbf{v}, then A2v=λ2vA^2\mathbf{v}=\lambda^2\mathbf{v} (apply AA once more).

But A2v=Av=λvA^2\mathbf{v}=A\mathbf{v}=\lambda\mathbf{v}.

So λ2v=λv\lambda^2\mathbf{v}=\lambda\mathbf{v}. Since v0\mathbf{v}\neq\mathbf{0}: λ2=λλ(λ1)=0λ{0,1}\lambda^2=\lambda \Rightarrow \lambda(\lambda-1)=0 \Rightarrow \lambda\in\{0,1\}.

Geometrically: vectors in the image of the projection are fixed (λ=1\lambda=1); vectors in the kernel are mapped to zero (λ=0\lambda=0).

Prove that the only eigenvalues of a projection matrix (A2=AA^2=A) are 00 and 11.

EXERCISE 6.6

The covariance matrix of returns Σ\Sigma has eigenvalues equal to the variances of the principal components. The condition number κ=λmax/λmin\kappa=\lambda_{\max}/\lambda_{\min} measures near-singularity. When λmin0\lambda_{\min}\approx0, a linear combination of assets has near-zero variance.

Given eigenvalues λ1=12\lambda_1=12, λ2=3\lambda_2=3, λ3=0.1\lambda_3=0.1.

Total variance: 12+3+0.1=15.112+3+0.1=15.1.

PC1 explains 12/15.179.5%12/15.1\approx79.5\%; PC2 explains 3/15.119.9%3/15.1\approx19.9\%; PC3 explains 0.1/15.10.66%0.1/15.1\approx0.66\%.

Condition number: κ=λ1/λ3=12/0.1=120\kappa=\lambda_1/\lambda_3=12/0.1=120. A condition number of 120120 means the portfolio of assets corresponding to v3\mathbf{v}_3 has variance 0.10.1 — near-riskless relative to the dominant risk factor.

Quant implication: the factor v3\mathbf{v}_3 (the third PC) is a near-arbitrage combination. A long-short portfolio along v3\mathbf{v}_3 has very low residual risk and could be a mean-reversion candidate.

A three-asset covariance matrix has eigenvalues 12,3,0.112,\,3,\,0.1. Compute the percentage of variance explained by each principal component and interpret the smallest eigenvalue in the context of statistical arbitrage.


Chapter Summary

ConceptFormula / Rule
Eigenvalue equationAv=λvA\mathbf{v}=\lambda\mathbf{v}, v0\mathbf{v}\neq\mathbf{0}
Characteristic polynomialp(λ)=det(AλI)p(\lambda)=\det(A-\lambda I)
EigenvaluesRoots of p(λ)=0p(\lambda)=0
EigenspaceEλ=ker(AλI)E_\lambda=\ker(A-\lambda I)
Trace = sum of eigenvaluestr(A)=iλi\text{tr}(A)=\sum_i\lambda_i
Determinant = product of eigenvaluesdet(A)=iλi\det(A)=\prod_i\lambda_i
Algebraic multiplicityMultiplicity as root of p(λ)p(\lambda)
Geometric multiplicitydimker(AλI)\dim\ker(A-\lambda I)
Defective matrixmg<mam_g<m_a for some λ\lambda
PCA connectionEigenvectors of Σ\Sigma = principal components; eigenvalues = component variances

Next chapter: Chapter 07 — Diagonalization, where we factor A=PDP1A=PDP^{-1} using eigenvectors as columns of PP and eigenvalues on the diagonal of DD.