Chapter 18

Matrix Decompositions — SVD

00 · Symbol Glossary

$A = U\Sigma V^T$A equals U Sigma V transpose — SVD

The singular value decomposition of any m×nm \times n matrix AA. UU holds left singular vectors (orthonormal in Rm\mathbb{R}^m), Σ\Sigma is diagonal with nonnegative singular values σ1σ20\sigma_1 \geq \sigma_2 \geq \cdots \geq 0, and VV holds right singular vectors (orthonormal in Rn\mathbb{R}^n). Unlike LU or QR, SVD exists for every matrix — square, rectangular, singular, or rank-deficient.

$\sigma_i$sigma i — singular value

The ii-th singular value of AA — the square root of the ii-th eigenvalue of ATAA^TA (or AATAA^T). Read aloud as "sigma-i." σ1\sigma_1 is the largest; σi0\sigma_i \geq 0 always. In data analysis, σi2\sigma_i^2 measures variance along the ii-th principal direction.

$\mathbf{u}_i$u sub i — left singular vector

The ii-th column of UU. Satisfies Avi=σiuiA\mathbf{v}_i = \sigma_i \mathbf{u}_i and AATui=σi2uiAA^T\mathbf{u}_i = \sigma_i^2 \mathbf{u}_i. Left singular vectors are orthonormal eigenvectors of AATAA^T.

$\mathbf{v}_i$v sub i — right singular vector

The ii-th column of VV. Satisfies ATui=σiviA^T\mathbf{u}_i = \sigma_i \mathbf{v}_i and ATAvi=σi2viA^TA\mathbf{v}_i = \sigma_i^2 \mathbf{v}_i. Right singular vectors are orthonormal eigenvectors of ATAA^TA.

$A^+$A plus — Moore-Penrose pseudoinverse

The pseudoinverse A+=VΣ+UTA^+ = V\Sigma^+ U^T, where Σ+\Sigma^+ inverts nonzero singular values and leaves zeros at zero. For full-rank square AA, A+=A1A^+ = A^{-1}. For overdetermined least squares, x^=A+b\hat{\mathbf{x}} = A^+\mathbf{b} gives the minimum-norm least squares solution.


01 · The Universal Decomposition

LU requires square invertible matrices (with pivoting). QR requires linearly independent columns. Eigenvalue decomposition requires a square diagonalisable matrix. The SVD exists for every m×nm \times n real matrix — no conditions.

Definition — Singular Value Decomposition

For any ARm×nA \in \mathbb{R}^{m \times n} with rank rr, there exist:

URm×mU \in \mathbb{R}^{m \times m} — orthogonal (UTU=UUT=ImU^TU = UU^T = I_m).

VRn×nV \in \mathbb{R}^{n \times n} — orthogonal (VTV=VVT=InV^TV = VV^T = I_n).

ΣRm×n\Sigma \in \mathbb{R}^{m \times n} — diagonal with σ1σ2σr>0\sigma_1 \geq \sigma_2 \geq \cdots \geq \sigma_r > 0 and σr+1==0\sigma_{r+1} = \cdots = 0.

such that:

A=UΣVTA = U\Sigma V^T

σi\sigma_i — the singular values of AA.

ui\mathbf{u}_i — columns of UU, the left singular vectors.

vi\mathbf{v}_i — columns of VV, the right singular vectors.

✓ Example — SVD of a Return Panel

A 252×50252 \times 50 matrix AA holds daily excess returns for 50 stocks over one trading year. SVD gives A=UΣVTA = U\Sigma V^T:

v1\mathbf{v}_1 — the first right singular vector — is the portfolio of stock weights explaining the most cross-sectional variance (σ12\sigma_1^2).

v2\mathbf{v}_2 — orthogonal to v1\mathbf{v}_1 — captures the next-largest independent pattern.

Keeping the top k50k \ll 50 singular values approximates AA with a rank-kk matrix, compressing 50-dimensional daily data into kk statistical factors. This is PCA in matrix form — Chapter 15's spectral decomposition of Σ\Sigma is the SVD of the centred data matrix.

❌ Failure — Treating SVD as Optional for Singular Matrices

A=(1224)A = \begin{pmatrix}1&2\\2&4\end{pmatrix} — rank 1, singular. LU fails (zero pivot). QR fails (dependent columns). Eigenvalue decomposition gives λ1=5\lambda_1 = 5, λ2=0\lambda_2 = 0 but the eigenvector for λ2\lambda_2 does not span the null space cleanly in numerical code.

SVD: σ1=5\sigma_1 = 5, σ2=0\sigma_2 = 0. v1=15(12)\mathbf{v}_1 = \frac{1}{\sqrt{5}}\begin{pmatrix}1\\2\end{pmatrix} spans the column space; v2=15(21)\mathbf{v}_2 = \frac{1}{\sqrt{5}}\begin{pmatrix}-2\\1\end{pmatrix} spans the null space of AA.

Consequence: SVD is the robust tool for rank-deficient and rectangular matrices. It is not a special case — it is the general decomposition from which LU, QR, and eigendecomposition are specialisations.


02 · Singular Values from ATAA^TA

The right singular vectors and singular values come from the eigenproblem of ATAA^TA.

Definition — Singular Values as Eigenvalues

The singular values of AA satisfy:

σi=λi(ATA)\sigma_i = \sqrt{\lambda_i(A^TA)}

λi(ATA)\lambda_i(A^TA) — the ii-th eigenvalue of ATAA^TA, ordered λ1λ20\lambda_1 \geq \lambda_2 \geq \cdots \geq 0.

The right singular vector vi\mathbf{v}_i is the unit eigenvector of ATAA^TA for λi\lambda_i.

The left singular vector is recovered by:

ui=1σiAvi(σi>0)\mathbf{u}_i = \frac{1}{\sigma_i} A\mathbf{v}_i \quad (\sigma_i > 0)
Step-by-step — SVD of $A = \begin{pmatrix}1&2\\2&1\end{pmatrix}$
1

Form ATAA^TA: ATA=(1221)(1221)=(1+42+22+24+1)=(5445)A^TA = \begin{pmatrix}1&2\\2&1\end{pmatrix}\begin{pmatrix}1&2\\2&1\end{pmatrix} = \begin{pmatrix}1+4&2+2\\2+2&4+1\end{pmatrix} = \begin{pmatrix}5&4\\4&5\end{pmatrix}.

The (1,1)(1,1) entry 55 comes from 12+221^2+2^2; the (1,2)(1,2) entry 44 from 12+211\cdot2+2\cdot1.

2

Find eigenvalues of ATAA^TA: characteristic polynomial det(ATAλI)=(5λ)216=λ210λ+9=(λ9)(λ1)\det(A^TA - \lambda I) = (5-\lambda)^2 - 16 = \lambda^2 - 10\lambda + 9 = (\lambda-9)(\lambda-1).

λ1=9\lambda_1 = 9, λ2=1\lambda_2 = 1. Singular values: σ1=9=3\sigma_1 = \sqrt{9} = 3, σ2=1=1\sigma_2 = \sqrt{1} = 1.

3

Right singular vector v1\mathbf{v}_1 (eigenvector for λ1=9\lambda_1=9): (ATA9I)v=(4444)v=0(A^TA - 9I)\mathbf{v} = \begin{pmatrix}-4&4\\4&-4\end{pmatrix}\mathbf{v} = \mathbf{0}.

v1=v2v_1 = v_2 from the first row: 4v1+4v2=0v1=v2-4v_1+4v_2=0 \Rightarrow v_1=v_2. Unit vector: v1=12(11)\mathbf{v}_1 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix}.

4

Right singular vector v2\mathbf{v}_2 (eigenvector for λ2=1\lambda_2=1): (ATAI)v=(4444)v=0(A^TA - I)\mathbf{v} = \begin{pmatrix}4&4\\4&4\end{pmatrix}\mathbf{v} = \mathbf{0}.

v1=v2v_1 = -v_2. Unit vector: v2=12(11)\mathbf{v}_2 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\-1\end{pmatrix}.

5

Left singular vectors from ui=Avi/σi\mathbf{u}_i = A\mathbf{v}_i/\sigma_i:

Av1=12(1+22+1)=32(11)A\mathbf{v}_1 = \frac{1}{\sqrt{2}}\begin{pmatrix}1+2\\2+1\end{pmatrix} = \frac{3}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix}. u1=1332(11)=12(11)\mathbf{u}_1 = \frac{1}{3}\cdot\frac{3}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix} = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix}.

Av2=12(1221)=12(11)A\mathbf{v}_2 = \frac{1}{\sqrt{2}}\begin{pmatrix}1-2\\2-1\end{pmatrix} = \frac{1}{\sqrt{2}}\begin{pmatrix}-1\\1\end{pmatrix}. u2=1112(11)=12(11)\mathbf{u}_2 = \frac{1}{1}\cdot\frac{1}{\sqrt{2}}\begin{pmatrix}-1\\1\end{pmatrix} = \frac{1}{\sqrt{2}}\begin{pmatrix}-1\\1\end{pmatrix}.

6

Assemble the SVD:

A=12(1111)U(3001)Σ12(1111)TVTA = \underbrace{\frac{1}{\sqrt{2}}\begin{pmatrix}1&-1\\1&1\end{pmatrix}}_{U} \underbrace{\begin{pmatrix}3&0\\0&1\end{pmatrix}}_{\Sigma} \underbrace{\frac{1}{\sqrt{2}}\begin{pmatrix}1&1\\1&-1\end{pmatrix}^T}_{V^T}

Verify: σ1=3\sigma_1=3 is the dominant stretching direction; σ2=1\sigma_2=1 is the secondary direction. UU and VV are orthogonal (UTU=I2U^TU = I_2 ✓).


03 · Geometric Interpretation

SVD reveals that every linear map is: rotate (VTV^T), scale along axes (Σ\Sigma), rotate (UU).

Definition — Action of SVD on the Unit Sphere

The unit circle in Rn\mathbb{R}^n maps under AA to an ellipse in Rm\mathbb{R}^m. The semi-axes of that ellipse have lengths σ1,σ2,\sigma_1, \sigma_2, \ldots, oriented along u1,u2,\mathbf{u}_1, \mathbf{u}_2, \ldots.

Avi=σi\|A\mathbf{v}_i\| = \sigma_i

vi\mathbf{v}_i — input direction that AA stretches by exactly σi\sigma_i without rotation (output is parallel to ui\mathbf{u}_i).

The operator norm A2=σ1\|A\|_2 = \sigma_1 — the maximum stretching factor. The condition number κ(A)=σ1/σr\kappa(A) = \sigma_1/\sigma_r when AA has full rank.

❌ Failure — Confusing Singular Values with Eigenvalues

For A=(0100)A = \begin{pmatrix}0&1\\0&0\end{pmatrix}, eigenvalues are 0,00, 0 but σ1=1\sigma_1 = 1.

Why it breaks: eigenvalues come from Av=λvA\mathbf{v} = \lambda\mathbf{v}AA must act parallel to v\mathbf{v}. Singular values come from ATAA^TA — they measure stretching in the best-matching input-output directions, which need not be the same vector.

Consequence: σi\sigma_i and λi\lambda_i coincide in absolute value only for symmetric matrices (up to sign). For general AA, use SVD for norms, conditioning, and low-rank structure — not the eigenvalue decomposition of AA itself.


04 · Low-Rank Approximation (Eckart–Young)

Definition — Best Rank-$k$ Approximation

For A=UΣVT=i=1rσiuiviTA = U\Sigma V^T = \sum_{i=1}^r \sigma_i \mathbf{u}_i \mathbf{v}_i^T, the rank-kk matrix closest to AA in the Frobenius norm is:

Ak=i=1kσiuiviT=UkΣkVkTA_k = \sum_{i=1}^k \sigma_i \mathbf{u}_i \mathbf{v}_i^T = U_k \Sigma_k V_k^T

AkA_k — keep the top kk singular triplets, discard the rest.

Eckart–Young theorem: AAkF=σk+12++σr2\|A - A_k\|_F = \sqrt{\sigma_{k+1}^2 + \cdots + \sigma_r^2} — the discarded singular values measure the approximation error exactly.

✓ Example — Compressing a Covariance Structure

For A=(1221)A = \begin{pmatrix}1&2\\2&1\end{pmatrix} with σ1=3\sigma_1=3, σ2=1\sigma_2=1: the rank-1 approximation is A1=3u1v1TA_1 = 3\mathbf{u}_1\mathbf{v}_1^T.

Fraction of "energy" retained: σ12/(σ12+σ22)=9/10=90%\sigma_1^2/(\sigma_1^2+\sigma_2^2) = 9/10 = 90\%.

In a 500-stock return panel, if the top 10 singular values capture 85% of total variance (i=110σi2/i=1500σi2\sum_{i=1}^{10}\sigma_i^2 / \sum_{i=1}^{500}\sigma_i^2), a rank-10 factor model replaces 500 dimensions with 10 — the backbone of statistical risk models in portfolio management.


05 · Connection to PCA and the Pseudoinverse

Definition — SVD and PCA

If XX is the T×pT \times p centred data matrix (each column demeaned), then:

X=UΣVTX = U\Sigma V^T

Columns of VV are the principal component directions. σi2/T\sigma_i^2/T is the variance explained by PC ii. The sample covariance Σ^=1TXTX=VΣ2TVT\hat{\Sigma} = \frac{1}{T}X^TX = V\frac{\Sigma^2}{T}V^T — an eigendecomposition with eigenvalues σi2/T\sigma_i^2/T.

PCA from Chapter 15 and SVD of the data matrix are the same computation in different packaging.

Definition — Moore–Penrose Pseudoinverse

For A=UΣVTA = U\Sigma V^T, define:

A+=VΣ+UT,Σii+={1/σiσi>00σi=0A^+ = V\Sigma^+ U^T, \quad \Sigma^+_{ii} = \begin{cases}1/\sigma_i & \sigma_i > 0 \\ 0 & \sigma_i = 0\end{cases}

The least squares solution of minimum Euclidean norm is x^=A+b\hat{\mathbf{x}} = A^+\mathbf{b}. When AA is invertible, A+=A1A^+ = A^{-1}.

✓ Example — Hedging with a Rank-Deficient Exposure Matrix

A desk has exposure matrix ARm×nA \in \mathbb{R}^{m \times n} mapping nn instrument weights to mm risk factors. Some factors are redundant (rank r<nr < n). The minimum-norm hedge w=A+e\mathbf{w} = A^+\mathbf{e} uses SVD to distribute hedging across instruments without inflating position sizes — the pseudoinverse selects the smallest w2\|\mathbf{w}\|_2 among all exact or least-squares solutions.


06 · Practice Exercises

EXERCISE 18.1

For diagonal AA, singular values are aii|a_{ii}|. U=IU = I, V=IV = I (up to sign flips in columns to make σi0\sigma_i \geq 0).

A=(3002)A = \begin{pmatrix}3&0\\0&-2\end{pmatrix}. ATA=(9004)A^TA = \begin{pmatrix}9&0\\0&4\end{pmatrix}.

σ1=3\sigma_1 = 3, σ2=2\sigma_2 = 2. v1=(10)\mathbf{v}_1 = \begin{pmatrix}1\\0\end{pmatrix}, v2=(01)\mathbf{v}_2 = \begin{pmatrix}0\\1\end{pmatrix}.

u1=Av1/σ1=(30)/3=(10)\mathbf{u}_1 = A\mathbf{v}_1/\sigma_1 = \begin{pmatrix}3\\0\end{pmatrix}/3 = \begin{pmatrix}1\\0\end{pmatrix}.

u2=Av2/σ2=(02)/2=(01)\mathbf{u}_2 = A\mathbf{v}_2/\sigma_2 = \begin{pmatrix}0\\-2\end{pmatrix}/2 = \begin{pmatrix}0\\-1\end{pmatrix}.

A=(1001)(3002)(1001)TA = \begin{pmatrix}1&0\\0&-1\end{pmatrix}\begin{pmatrix}3&0\\0&2\end{pmatrix}\begin{pmatrix}1&0\\0&1\end{pmatrix}^T ✓.

Find the SVD of A=(3002)A = \begin{pmatrix}3&0\\0&-2\end{pmatrix}. List σ1\sigma_1, σ2\sigma_2, and the columns of UU and VV.

EXERCISE 18.2

Compute ATAA^TA, find its eigenvalues, take square roots for σi\sigma_i. Eigenvectors of ATAA^TA give vi\mathbf{v}_i; then ui=Avi/σi\mathbf{u}_i = A\mathbf{v}_i/\sigma_i.

A=(1011)A = \begin{pmatrix}1&0\\1&1\end{pmatrix}. ATA=(2111)A^TA = \begin{pmatrix}2&1\\1&1\end{pmatrix}.

Characteristic: (2λ)(1λ)1=λ23λ+1=0(2-\lambda)(1-\lambda)-1 = \lambda^2-3\lambda+1 = 0. λ=3±52\lambda = \frac{3\pm\sqrt{5}}{2}.

σ1=3+521.618\sigma_1 = \sqrt{\frac{3+\sqrt{5}}{2}} \approx 1.618, σ2=3520.618\sigma_2 = \sqrt{\frac{3-\sqrt{5}}{2}} \approx 0.618.

For λ1\lambda_1: eigenvector v1(1(λ12))\mathbf{v}_1 \propto \begin{pmatrix}1\\(\lambda_1-2)\end{pmatrix} — normalise to unit length.

u1=Av1/σ1\mathbf{u}_1 = A\mathbf{v}_1/\sigma_1. (Full numeric values follow from the quadratic formula.)

Rank =2= 2 — both singular values are positive.

Compute the singular values of A=(1011)A = \begin{pmatrix}1&0\\1&1\end{pmatrix} by solving the eigenproblem for ATAA^TA. State whether AA is full rank.

EXERCISE 18.3

Rank-1 approximation: A1=σ1u1v1TA_1 = \sigma_1 \mathbf{u}_1\mathbf{v}_1^T. Frobenius error =σ2= \sigma_2.

From the chapter: A=(1221)A = \begin{pmatrix}1&2\\2&1\end{pmatrix}, σ1=3\sigma_1=3, σ2=1\sigma_2=1, u1=v1=12(11)\mathbf{u}_1 = \mathbf{v}_1 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix}.

A1=312(11)(11)=32(1111)A_1 = 3 \cdot \frac{1}{2}\begin{pmatrix}1\\1\end{pmatrix}\begin{pmatrix}1&1\end{pmatrix} = \frac{3}{2}\begin{pmatrix}1&1\\1&1\end{pmatrix}.

AA1F=σ2=1\|A - A_1\|_F = \sigma_2 = 1.

Variance captured: σ12/(σ12+σ22)=9/10=90%\sigma_1^2/(\sigma_1^2+\sigma_2^2) = 9/10 = 90\%.

For A=(1221)A = \begin{pmatrix}1&2\\2&1\end{pmatrix}, write the rank-1 approximation A1A_1. Compute AA1F\|A-A_1\|_F and the fraction of squared singular values retained.

EXERCISE 18.4

σ1=5\sigma_1=5, σ2=0\sigma_2=0. v1\mathbf{v}_1 spans column space; v2\mathbf{v}_2 spans null space. A+A^+ inverts only σ1\sigma_1.

A=(1224)A = \begin{pmatrix}1&2\\2&4\end{pmatrix}. ATA=(5101020)A^TA = \begin{pmatrix}5&10\\10&20\end{pmatrix} — eigenvalues 2525 and 00.

σ1=5\sigma_1 = 5, σ2=0\sigma_2 = 0. v1=15(12)\mathbf{v}_1 = \frac{1}{\sqrt{5}}\begin{pmatrix}1\\2\end{pmatrix}.

A+=125(1224)A^+ = \frac{1}{25}\begin{pmatrix}1&2\\2&4\end{pmatrix} (the pseudoinverse of a rank-1 matrix).

Null space: v2=15(21)\mathbf{v}_2 = \frac{1}{\sqrt{5}}\begin{pmatrix}-2\\1\end{pmatrix}. Av2=0A\mathbf{v}_2 = \mathbf{0} ✓.

Column space of AA = span of (12)\begin{pmatrix}1\\2\end{pmatrix}. Null space of AA = span of (21)\begin{pmatrix}-2\\1\end{pmatrix}. SVD separates both cleanly.

For A=(1224)A = \begin{pmatrix}1&2\\2&4\end{pmatrix} (rank 1): find σ1\sigma_1, σ2\sigma_2, identify v1\mathbf{v}_1 and v2\mathbf{v}_2, and describe the column space and null space.

EXERCISE 18.5

PCA of centred XX equals SVD of XX. Variance of PC ii is σi2/T\sigma_i^2/T. Fraction explained by top kk PCs: i=1kσi2/i=1pσi2\sum_{i=1}^k \sigma_i^2 / \sum_{i=1}^p \sigma_i^2.

X=(110011)X = \begin{pmatrix}1&-1\\0&0\\-1&1\end{pmatrix} (3 observations, 2 assets, already centred).

XTX=(2222)X^TX = \begin{pmatrix}2&-2\\-2&2\end{pmatrix}. Eigenvalues: 44 and 00. σ1=2\sigma_1 = 2, σ2=0\sigma_2 = 0.

One PC explains 100%100\% of variance — the assets move in perfect opposition (r=1r = -1).

v1=12(11)\mathbf{v}_1 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\-1\end{pmatrix} — long asset 1, short asset 2.

Total variance =σ12+σ22=4= \sigma_1^2 + \sigma_2^2 = 4. PC1 fraction =4/4=100%= 4/4 = 100\%.

Centred data matrix X=(110011)X = \begin{pmatrix}1&-1\\0&0\\-1&1\end{pmatrix} (3 days, 2 assets). Find σ1\sigma_1, σ2\sigma_2 and the fraction of total variance explained by the first principal component.

EXERCISE 18.6

Use A=UΣVTA = U\Sigma V^T from the chapter example. A+=VΣ+UTA^+ = V\Sigma^+ U^T with Σ+=diag(1/3,1/1)\Sigma^+ = \text{diag}(1/3, 1/1). Compute x^=A+b\hat{\mathbf{x}} = A^+\mathbf{b}.

A=(1221)A = \begin{pmatrix}1&2\\2&1\end{pmatrix}, b=(33)\mathbf{b}=\begin{pmatrix}3\\3\end{pmatrix}.

Exact solution exists: x=(11)\mathbf{x} = \begin{pmatrix}1\\1\end{pmatrix} since A(11)=(33)A\begin{pmatrix}1\\1\end{pmatrix} = \begin{pmatrix}3\\3\end{pmatrix}.

Via SVD: A+=VΣ1UTA^+ = V\Sigma^{-1}U^T. With σ1=3\sigma_1=3, σ2=1\sigma_2=1:

A+b=VΣ1UTbA^+\mathbf{b} = V\Sigma^{-1}U^T\mathbf{b}.

UTb=12(1111)(33)=12(60)U^T\mathbf{b} = \frac{1}{\sqrt{2}}\begin{pmatrix}1&1\\-1&1\end{pmatrix}\begin{pmatrix}3\\3\end{pmatrix} = \frac{1}{\sqrt{2}}\begin{pmatrix}6\\0\end{pmatrix}.

Σ1UTb=(6/(32)0)=(20)\Sigma^{-1}U^T\mathbf{b} = \begin{pmatrix}6/(3\sqrt{2})\\0\end{pmatrix} = \begin{pmatrix}\sqrt{2}\\0\end{pmatrix}.

V(20)=22(11)=(11)V\begin{pmatrix}\sqrt{2}\\0\end{pmatrix} = \frac{\sqrt{2}}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix} = \begin{pmatrix}1\\1\end{pmatrix} ✓.

The pseudoinverse recovers the exact solution. For inconsistent b\mathbf{b}, it returns the minimum-norm least squares solution.

For A=(1221)A = \begin{pmatrix}1&2\\2&1\end{pmatrix} and b=(33)\mathbf{b}=\begin{pmatrix}3\\3\end{pmatrix}, compute x^=A+b\hat{\mathbf{x}} = A^+\mathbf{b} using the SVD from this chapter. Verify Ax^=bA\hat{\mathbf{x}}=\mathbf{b}.


07 · Summary

TermDefinition
SVDA=UΣVTA = U\Sigma V^T; exists for every m×nm \times n matrix
Singular value σi\sigma_iσi=λi(ATA)\sigma_i = \sqrt{\lambda_i(A^TA)}; σ10\sigma_1 \geq \cdots \geq 0
Left singular vector ui\mathbf{u}_iColumn of UU; ui=Avi/σi\mathbf{u}_i = A\mathbf{v}_i/\sigma_i
Right singular vector vi\mathbf{v}_iUnit eigenvector of ATAA^TA; column of VV
Operator normA2=σ1\|A\|_2 = \sigma_1
Rank-kk approximationAk=i=1kσiuiviTA_k = \sum_{i=1}^k \sigma_i \mathbf{u}_i\mathbf{v}_i^T; error =i>kσi2= \sqrt{\sum_{i>k}\sigma_i^2}
PCA connectionSVD of centred XX gives PC directions and variances σi2/T\sigma_i^2/T
Pseudoinverse A+A^+VΣ+UTV\Sigma^+ U^T; least squares with minimum norm

Next: Markov Chains & Steady States — stochastic matrices, stationary distributions as eigenvectors, and credit-rating transitions in quantitative risk.