Let A∈Rn×n. A scalar λ∈C is an eigenvalue of A if there exists a nonzero vector v∈Rn (or Cn) such that
Av=λv
Such a vector v=0 is called an eigenvector corresponding to λ.
Rearranging: (A−λI)v=0. This system has a nonzero solution if and only if A−λI is singular, i.e.
det(A−λI)=0
Note
Every scalar multiple of an eigenvector is also an eigenvector for the same eigenvalue. The set of all eigenvectors for λ (together with 0) forms a subspace called the eigenspaceEλ.
Example
If A represents a stretch by factor 3 along the x-axis and factor 1 along the y-axis, then the x-axis direction is an eigenvector with λ=3 and the y-axis direction is an eigenvector with λ=1.
Common mistake
Wrong:v=0 satisfies A0=λ0 for any λ, so 0 is an eigenvector. Why it happens: The equation is satisfied, so it feels valid. Correct: Eigenvectors must be nonzero by definition. The zero vector is excluded because it carries no directional information. Check: Any definition of eigenvector explicitly states v=0.
02 — The Characteristic Equation
Definition
The characteristic polynomial of A∈Rn×n is
p(λ)=det(A−λI)
It is a degree-n polynomial in λ. The eigenvalues of A are the roots of p(λ)=0 (the characteristic equation).
Find eigenvalues of $A=\begin{pmatrix}4&1\\2&3\end{pmatrix}$
A−λI=(4−λ213−λ)
— subtract λ from each diagonal entry 4 and 3.
p(λ)=det(4−λ213−λ)=(4−λ)(3−λ)−1⋅2=(12−7λ+λ2)−2=λ2−7λ+10
— the 12 comes from 4⋅3; the 7λ from −(4λ+3λ); the −2 from −(1⋅2).
λ2−7λ+10=0⟹(λ−5)(λ−2)=0
— factor or use the quadratic formula; roots are λ=5 and λ=2.
tr(A)=4+3=7=5+2✓; det(A)=4⋅3−1⋅2=10=5⋅2✓.
Note
Trace = sum of eigenvalues and determinant = product of eigenvalues (both with algebraic multiplicity). These are fast sanity checks.
03 — Finding Eigenvectors
Once you have an eigenvalue λk, find Eλk=ker(A−λkI) by row-reducing (A−λkI)v=0.
Find eigenvectors of $A=\begin{pmatrix}4&1\\2&3\end{pmatrix}$ for $\lambda=5$ and $\lambda=2$
A−5I=(−121−2)
— subtract 5 from each diagonal: 4−5=−1 and 3−5=−2.
R2←R2+2R1:
(−1010)
— 2+2(−1)=0; −2+2(1)=0. One free variable (v2=t).
From −v1+v2=0: v1=v2=t. Eigenvector: t(11), t=0.
A−2I=(2211)
— 4−2=2; 3−2=1.
R2←R2−R1:
(2010)
— one free variable (v2=s).
From 2v1+v2=0: v1=−v2/2=−s/2. Eigenvector: s(−12) (set s=2 for integer entries), s=0.
Common mistake
Wrong: "the eigenvector for λ=5 is (11)." Why it happens: We computed t=1 for convenience. Correct: the eigenspace is E5=span{(11)} — any nonzero scalar multiple is equally valid. Check: verify A⋅(2)(11)=5⋅(2)(11) also holds.
04 — Algebraic and Geometric Multiplicity
Definition
Let λk be an eigenvalue of A.
Algebraic multiplicityma(λk): the multiplicity of λk as a root of p(λ)=det(A−λI).
Geometric multiplicitymg(λk): the dimension of the eigenspace Eλk=ker(A−λkI).
It always holds that 1≤mg(λk)≤ma(λk).
If mg(λk)<ma(λk) for any λk, the matrix is defective — it cannot be diagonalised.
Example
A=(2012) has characteristic polynomial (λ−2)2, so ma(2)=2. But A−2I=(0010) has a 1-dimensional null space, so mg(2)=1<2. The matrix is defective.
05 — Eigenvalues of Special Matrices
Matrix type
Eigenvalues
Diagonal D=diag(d1,…,dn)
d1,…,dn (the diagonal entries)
Triangular
Diagonal entries
Symmetric (A=A⊤)
All real
Orthogonal (A⊤A=I)
All have ∣λ∣=1
Projection (A2=A)
Only 0 and 1
Positive definite
All λ>0
Note
Symmetric matrices always have real eigenvalues and orthogonal eigenvectors — a key reason covariance matrices (which are symmetric positive semi-definite) are so tractable in multivariate statistics.
06 — Quant Application — PCA and Covariance Spectra
Principal Component Analysis (PCA) is purely the eigendecomposition of the covariance matrix Σ.
Given p assets with covariance matrix Σ (symmetric, positive semi-definite):
Find eigenvalues λ1≥λ2≥⋯≥λp≥0 and corresponding orthonormal eigenvectors v1,…,vp.
The k-th principal component is the portfolio w=vk; its variance is λk.
The first PC explains λ1/∑iλi of total variance.
In fixed-income PCA, the first three PCs of yield-curve moves are almost universally interpreted as level (λ1≈80%), slope (λ2), and curvature (λ3).
A near-zero eigenvalue of Σ signals that a linear combination of assets is nearly riskless — useful for detecting near-arbitrage or near-multicollinear factors.
Exercises
EXERCISE 6.1
Form A−λI, compute det(A−λI)=0, solve the resulting quadratic. Verify using tr=λ1+λ2 and det=λ1λ2.
State the eigenvalues of 3007−10254 and justify without full computation.
EXERCISE 6.4
Use the trace and determinant relations: λ1+λ2=tr(A) and λ1λ2=det(A). Also recall that for positive definite matrices all eigenvalues are positive.
Both positive ⇒Σ is positive definite. λ1/(λ1+λ2)≈79% of variance is explained by the first PC.
A covariance matrix for two assets is Σ=(4223). Find the eigenvalues and determine the percentage of total variance explained by the first principal component.
EXERCISE 6.5
A projection satisfies A2=A. If Av=λv, apply A again: A2v=A(λv)=λ2v. But A2v=Av=λv. What does that tell you about λ?
If A2=A and Av=λv, then A2v=λ2v (apply A once more).
But A2v=Av=λv.
So λ2v=λv. Since v=0: λ2=λ⇒λ(λ−1)=0⇒λ∈{0,1}.
Geometrically: vectors in the image of the projection are fixed (λ=1); vectors in the kernel are mapped to zero (λ=0).
Prove that the only eigenvalues of a projection matrix (A2=A) are 0 and 1.
EXERCISE 6.6
The covariance matrix of returns Σ has eigenvalues equal to the variances of the principal components. The condition number κ=λmax/λmin measures near-singularity. When λmin≈0, a linear combination of assets has near-zero variance.
Condition number: κ=λ1/λ3=12/0.1=120. A condition number of 120 means the portfolio of assets corresponding to v3 has variance 0.1 — near-riskless relative to the dominant risk factor.
Quant implication: the factor v3 (the third PC) is a near-arbitrage combination. A long-short portfolio along v3 has very low residual risk and could be a mean-reversion candidate.
A three-asset covariance matrix has eigenvalues 12,3,0.1. Compute the percentage of variance explained by each principal component and interpret the smallest eigenvalue in the context of statistical arbitrage.
Chapter Summary
Concept
Formula / Rule
Eigenvalue equation
Av=λv, v=0
Characteristic polynomial
p(λ)=det(A−λI)
Eigenvalues
Roots of p(λ)=0
Eigenspace
Eλ=ker(A−λI)
Trace = sum of eigenvalues
tr(A)=∑iλi
Determinant = product of eigenvalues
det(A)=∏iλi
Algebraic multiplicity
Multiplicity as root of p(λ)
Geometric multiplicity
dimker(A−λI)
Defective matrix
mg<ma for some λ
PCA connection
Eigenvectors of Σ = principal components; eigenvalues = component variances
Next chapter: Chapter 07 — Diagonalization, where we factor A=PDP−1 using eigenvectors as columns of P and eigenvalues on the diagonal of D.