Matrix Decompositions — SVD
00 · Symbol Glossary
The singular value decomposition of any matrix . holds left singular vectors (orthonormal in ), is diagonal with nonnegative singular values , and holds right singular vectors (orthonormal in ). Unlike LU or QR, SVD exists for every matrix — square, rectangular, singular, or rank-deficient.
The -th singular value of — the square root of the -th eigenvalue of (or ). Read aloud as "sigma-i." is the largest; always. In data analysis, measures variance along the -th principal direction.
The -th column of . Satisfies and . Left singular vectors are orthonormal eigenvectors of .
The -th column of . Satisfies and . Right singular vectors are orthonormal eigenvectors of .
The pseudoinverse , where inverts nonzero singular values and leaves zeros at zero. For full-rank square , . For overdetermined least squares, gives the minimum-norm least squares solution.
01 · The Universal Decomposition
LU requires square invertible matrices (with pivoting). QR requires linearly independent columns. Eigenvalue decomposition requires a square diagonalisable matrix. The SVD exists for every real matrix — no conditions.
For any with rank , there exist:
— orthogonal ().
— orthogonal ().
— diagonal with and .
such that:
— the singular values of .
— columns of , the left singular vectors.
— columns of , the right singular vectors.
A matrix holds daily excess returns for 50 stocks over one trading year. SVD gives :
— the first right singular vector — is the portfolio of stock weights explaining the most cross-sectional variance ().
— orthogonal to — captures the next-largest independent pattern.
Keeping the top singular values approximates with a rank- matrix, compressing 50-dimensional daily data into statistical factors. This is PCA in matrix form — Chapter 15's spectral decomposition of is the SVD of the centred data matrix.
— rank 1, singular. LU fails (zero pivot). QR fails (dependent columns). Eigenvalue decomposition gives , but the eigenvector for does not span the null space cleanly in numerical code.
SVD: , . spans the column space; spans the null space of .
Consequence: SVD is the robust tool for rank-deficient and rectangular matrices. It is not a special case — it is the general decomposition from which LU, QR, and eigendecomposition are specialisations.
02 · Singular Values from
The right singular vectors and singular values come from the eigenproblem of .
The singular values of satisfy:
— the -th eigenvalue of , ordered .
The right singular vector is the unit eigenvector of for .
The left singular vector is recovered by:
Form : .
The entry comes from ; the entry from .
Find eigenvalues of : characteristic polynomial .
, . Singular values: , .
Right singular vector (eigenvector for ): .
from the first row: . Unit vector: .
Right singular vector (eigenvector for ): .
. Unit vector: .
Left singular vectors from :
. .
. .
Assemble the SVD:
Verify: is the dominant stretching direction; is the secondary direction. and are orthogonal ( ✓).
03 · Geometric Interpretation
SVD reveals that every linear map is: rotate (), scale along axes (), rotate ().
The unit circle in maps under to an ellipse in . The semi-axes of that ellipse have lengths , oriented along .
— input direction that stretches by exactly without rotation (output is parallel to ).
The operator norm — the maximum stretching factor. The condition number when has full rank.
For , eigenvalues are but .
Why it breaks: eigenvalues come from — must act parallel to . Singular values come from — they measure stretching in the best-matching input-output directions, which need not be the same vector.
Consequence: and coincide in absolute value only for symmetric matrices (up to sign). For general , use SVD for norms, conditioning, and low-rank structure — not the eigenvalue decomposition of itself.
04 · Low-Rank Approximation (Eckart–Young)
For , the rank- matrix closest to in the Frobenius norm is:
— keep the top singular triplets, discard the rest.
Eckart–Young theorem: — the discarded singular values measure the approximation error exactly.
For with , : the rank-1 approximation is .
Fraction of "energy" retained: .
In a 500-stock return panel, if the top 10 singular values capture 85% of total variance (), a rank-10 factor model replaces 500 dimensions with 10 — the backbone of statistical risk models in portfolio management.
05 · Connection to PCA and the Pseudoinverse
If is the centred data matrix (each column demeaned), then:
Columns of are the principal component directions. is the variance explained by PC . The sample covariance — an eigendecomposition with eigenvalues .
PCA from Chapter 15 and SVD of the data matrix are the same computation in different packaging.
For , define:
The least squares solution of minimum Euclidean norm is . When is invertible, .
A desk has exposure matrix mapping instrument weights to risk factors. Some factors are redundant (rank ). The minimum-norm hedge uses SVD to distribute hedging across instruments without inflating position sizes — the pseudoinverse selects the smallest among all exact or least-squares solutions.
06 · Practice Exercises
For diagonal , singular values are . , (up to sign flips in columns to make ).
. .
, . , .
.
.
✓.
Find the SVD of . List , , and the columns of and .
Compute , find its eigenvalues, take square roots for . Eigenvectors of give ; then .
. .
Characteristic: . .
, .
For : eigenvector — normalise to unit length.
. (Full numeric values follow from the quadratic formula.)
Rank — both singular values are positive.
Compute the singular values of by solving the eigenproblem for . State whether is full rank.
Rank-1 approximation: . Frobenius error .
From the chapter: , , , .
.
.
Variance captured: .
For , write the rank-1 approximation . Compute and the fraction of squared singular values retained.
, . spans column space; spans null space. inverts only .
. — eigenvalues and .
, . .
(the pseudoinverse of a rank-1 matrix).
Null space: . ✓.
Column space of = span of . Null space of = span of . SVD separates both cleanly.
For (rank 1): find , , identify and , and describe the column space and null space.
PCA of centred equals SVD of . Variance of PC is . Fraction explained by top PCs: .
(3 observations, 2 assets, already centred).
. Eigenvalues: and . , .
One PC explains of variance — the assets move in perfect opposition ().
— long asset 1, short asset 2.
Total variance . PC1 fraction .
Centred data matrix (3 days, 2 assets). Find , and the fraction of total variance explained by the first principal component.
Use from the chapter example. with . Compute .
, .
Exact solution exists: since .
Via SVD: . With , :
.
.
.
✓.
The pseudoinverse recovers the exact solution. For inconsistent , it returns the minimum-norm least squares solution.
For and , compute using the SVD from this chapter. Verify .
07 · Summary
| Term | Definition |
|---|---|
| SVD | ; exists for every matrix |
| Singular value | ; |
| Left singular vector | Column of ; |
| Right singular vector | Unit eigenvector of ; column of |
| Operator norm | |
| Rank- approximation | ; error |
| PCA connection | SVD of centred gives PC directions and variances |
| Pseudoinverse | ; least squares with minimum norm |
Next: Markov Chains & Steady States — stochastic matrices, stationary distributions as eigenvectors, and credit-rating transitions in quantitative risk.