Chapter 10

Inner Products & Orthogonality

00 · Symbol Glossary

$\langle \mathbf{u}, \mathbf{v} \rangle$angle brackets u v — inner product

The inner product of vectors u\mathbf{u} and v\mathbf{v}. Angle-bracket notation is standard for abstract inner products, distinguishing them from the specific dot product uv\mathbf{u}\cdot\mathbf{v}. The dot product is one particular inner product, but not the only one.

$\mathbf{u} \perp \mathbf{v}$u perp v — orthogonal

u\mathbf{u} and v\mathbf{v} are orthogonal — their inner product is zero: u,v=0\langle \mathbf{u}, \mathbf{v} \rangle = 0. Orthogonality is the precise definition of "perpendicular" in any inner product space, including high-dimensional spaces where geometric angles are not directly visible.

$W^\perp$W perp — orthogonal complement

The set of all vectors orthogonal to every vector in subspace WW: W={vv,w=0 for all wW}W^\perp = \{\mathbf{v} \mid \langle \mathbf{v}, \mathbf{w} \rangle = 0 \text{ for all } \mathbf{w} \in W\}. Together WW and WW^\perp partition Rn\mathbb{R}^n: every vector decomposes uniquely as v=w+w\mathbf{v} = \mathbf{w} + \mathbf{w}^\perp with wW\mathbf{w} \in W, wW\mathbf{w}^\perp \in W^\perp.

$\|\mathbf{v}\|_{A}$v norm A — A-weighted norm

The norm induced by a positive definite matrix AA: vA=vTAv\|\mathbf{v}\|_A = \sqrt{\mathbf{v}^T A \mathbf{v}}. In finance, A=Σ1A = \Sigma^{-1} (precision matrix) gives the Mahalanobis norm, which accounts for correlations and scales units.


01 · From Dot Product to Inner Product

The dot product uv=uivi\mathbf{u}\cdot\mathbf{v} = \sum u_i v_i works in Rn\mathbb{R}^n but fails to generalise to function spaces or weighted geometries. The inner product abstracts the properties that make the dot product useful — linearity, symmetry, positivity — into axioms satisfied by many different operations.

Definition — Inner Product

An inner product on a real vector space VV is a function ,:V×VR\langle \cdot, \cdot \rangle: V \times V \to \mathbb{R} satisfying:

IP1. Linearity: cu+w,v=cu,v+w,v\langle c\mathbf{u} + \mathbf{w}, \mathbf{v} \rangle = c\langle \mathbf{u}, \mathbf{v} \rangle + \langle \mathbf{w}, \mathbf{v} \rangle for all cRc \in \mathbb{R}.

IP2. Symmetry: u,v=v,u\langle \mathbf{u}, \mathbf{v} \rangle = \langle \mathbf{v}, \mathbf{u} \rangle for all u,v\mathbf{u}, \mathbf{v}.

IP3. Positive definiteness: v,v0\langle \mathbf{v}, \mathbf{v} \rangle \geq 0 for all v\mathbf{v}, with equality     v=0\iff \mathbf{v} = \mathbf{0}.

A vector space equipped with an inner product is an inner product space.

✓ Example — The Standard Dot Product

The dot product u,v=uTv=i=1nuivi\langle \mathbf{u}, \mathbf{v} \rangle = \mathbf{u}^T\mathbf{v} = \sum_{i=1}^n u_iv_i on Rn\mathbb{R}^n satisfies all three axioms:

IP1: cu+w,v=(cu+w)Tv=cuTv+wTv\langle c\mathbf{u}+\mathbf{w}, \mathbf{v} \rangle = (c\mathbf{u}+\mathbf{w})^T\mathbf{v} = c\mathbf{u}^T\mathbf{v} + \mathbf{w}^T\mathbf{v} — linear in first argument.

IP2: uTv=vTu\mathbf{u}^T\mathbf{v} = \mathbf{v}^T\mathbf{u} — symmetric (both equal uivi\sum u_iv_i).

IP3: vTv=vi20\mathbf{v}^T\mathbf{v} = \sum v_i^2 \geq 0, and equals zero only when all vi=0v_i=0 — positive definite.

✓ Example — Weighted Inner Product

For a positive definite matrix WRn×nW \in \mathbb{R}^{n\times n}, the weighted dot product:

u,vW=uTWv\langle \mathbf{u}, \mathbf{v} \rangle_W = \mathbf{u}^T W \mathbf{v}

is also a valid inner product. Taking W=Σ1W = \Sigma^{-1} (inverse of a covariance matrix) gives the Mahalanobis inner product — used in portfolio optimisation and anomaly detection to measure distances that account for correlations between assets.

❌ What Breaks — Not Every Bilinear Form Is an Inner Product

Consider f(u,v)=u1v1u2v2f(\mathbf{u},\mathbf{v}) = u_1v_1 - u_2v_2 on R2\mathbb{R}^2. This is bilinear and symmetric, but fails IP3: take v=(01)\mathbf{v}=\begin{pmatrix}0\\1\end{pmatrix}, then f(v,v)=01=1<0f(\mathbf{v},\mathbf{v}) = 0-1=-1 < 0. Negative value for a nonzero vector violates positive definiteness. This is the Minkowski metric from special relativity — a pseudo-inner-product, not an inner product.


02 · The Induced Norm and Cauchy-Schwarz

Every inner product induces a norm — a way to measure the "length" of a vector.

Definition — Induced Norm

Given an inner product ,\langle \cdot, \cdot \rangle, the induced norm is:

v=v,v\|\mathbf{v}\| = \sqrt{\langle \mathbf{v}, \mathbf{v} \rangle}

For the standard dot product, this recovers v=vi2\|\mathbf{v}\| = \sqrt{\sum v_i^2} — the Euclidean length from Chapter 1.

Definition — Cauchy-Schwarz Inequality

For any inner product space:

u,vuv|\langle \mathbf{u}, \mathbf{v} \rangle| \leq \|\mathbf{u}\|\,\|\mathbf{v}\|

Equality holds     u\iff \mathbf{u} and v\mathbf{v} are parallel (u=cv\mathbf{u} = c\mathbf{v} for some scalar cc). Cauchy-Schwarz guarantees that u,vuv[1,1]\frac{\langle \mathbf{u},\mathbf{v}\rangle}{\|\mathbf{u}\|\|\mathbf{v}\|} \in [-1, 1], so the cosine angle formula cosθ=u,vuv\cos\theta = \frac{\langle \mathbf{u},\mathbf{v}\rangle}{\|\mathbf{u}\|\|\mathbf{v}\|} is always valid.

Step-by-step — Verifying Cauchy-Schwarz for $\mathbf{u}=\begin{pmatrix}2\\-1\end{pmatrix}$, $\mathbf{v}=\begin{pmatrix}3\\4\end{pmatrix}$
1

Compute the inner product: u,v=2(3)+(1)(4)=64=2\langle \mathbf{u},\mathbf{v}\rangle = 2(3)+(-1)(4) = 6-4 = 2.

2

Compute the norms: u=4+1=5\|\mathbf{u}\| = \sqrt{4+1} = \sqrt{5}. v=9+16=25=5\|\mathbf{v}\| = \sqrt{9+16} = \sqrt{25} = 5.

uv=5511.18\|\mathbf{u}\|\|\mathbf{v}\| = 5\sqrt{5} \approx 11.18.

3

Check the inequality: u,v=25511.18|\langle\mathbf{u},\mathbf{v}\rangle| = 2 \leq 5\sqrt{5} \approx 11.18 ✓. The inequality holds with room to spare — the vectors are not parallel. Cosine of angle: cosθ=2550.179\cos\theta = \frac{2}{5\sqrt{5}} \approx 0.179, so θ79.7°\theta \approx 79.7°.


03 · Orthogonality

Orthogonality — zero inner product — is the most important structural property in linear algebra. Orthogonal vectors contain no "shared direction," which makes computations separable.

Definition — Orthogonal Vectors

Vectors u,v\mathbf{u}, \mathbf{v} in an inner product space are orthogonal if:

u,v=0\langle \mathbf{u}, \mathbf{v} \rangle = 0

Written uv\mathbf{u} \perp \mathbf{v}.

Pythagorean theorem: if uv\mathbf{u} \perp \mathbf{v}, then u+v2=u2+v2\|\mathbf{u}+\mathbf{v}\|^2 = \|\mathbf{u}\|^2 + \|\mathbf{v}\|^2.

Proof: u+v2=u+v,u+v=u2+2u,v+v2=u2+v2\|\mathbf{u}+\mathbf{v}\|^2 = \langle\mathbf{u}+\mathbf{v},\mathbf{u}+\mathbf{v}\rangle = \|\mathbf{u}\|^2 + 2\langle\mathbf{u},\mathbf{v}\rangle + \|\mathbf{v}\|^2 = \|\mathbf{u}\|^2 + \|\mathbf{v}\|^2 since u,v=0\langle\mathbf{u},\mathbf{v}\rangle = 0.

Definition — Orthogonal Set and Orthonormal Set

A set {v1,,vk}\{\mathbf{v}_1, \ldots, \mathbf{v}_k\} is:

Orthogonal if vi,vj=0\langle \mathbf{v}_i, \mathbf{v}_j \rangle = 0 for all iji \neq j.

Orthonormal if additionally vi=1\|\mathbf{v}_i\| = 1 for all ii:

vi,vj=δij={1i=j0ij\langle \mathbf{v}_i, \mathbf{v}_j \rangle = \delta_{ij} = \begin{cases}1 & i=j \\ 0 & i\neq j\end{cases}

δij\delta_{ij} is the Kronecker delta — 1 when indices match, 0 otherwise.

✓ Example — Standard Basis Is Orthonormal

In R3\mathbb{R}^3, the standard basis e1=(100)\mathbf{e}_1=\begin{pmatrix}1\\0\\0\end{pmatrix}, e2=(010)\mathbf{e}_2=\begin{pmatrix}0\\1\\0\end{pmatrix}, e3=(001)\mathbf{e}_3=\begin{pmatrix}0\\0\\1\end{pmatrix} satisfies eiej=δij\mathbf{e}_i\cdot\mathbf{e}_j = \delta_{ij}. Any two distinct standard basis vectors are orthogonal (dot product = 0); each has norm 1. It is the prototype orthonormal basis.

Orthogonal Sets Are Automatically Independent

Every orthogonal set of nonzero vectors is linearly independent. Suppose c1v1++ckvk=0c_1\mathbf{v}_1 + \cdots + c_k\mathbf{v}_k = \mathbf{0}. Take the inner product with vj\mathbf{v}_j: all cross terms vanish by orthogonality, leaving cjvj2=0c_j\|\mathbf{v}_j\|^2 = 0. Since vj0\|\mathbf{v}_j\| \neq 0, we get cj=0c_j = 0 for each jj. This is why orthonormal bases are so powerful — independence is free.


04 · Orthogonal Complement

Definition — Orthogonal Complement

The orthogonal complement of a subspace WRnW \subseteq \mathbb{R}^n is:

W={vRnv,w=0 for all wW}W^\perp = \{ \mathbf{v} \in \mathbb{R}^n \mid \langle \mathbf{v}, \mathbf{w} \rangle = 0 \text{ for all } \mathbf{w} \in W \}

Key facts:

  • WW^\perp is itself a subspace.
  • dim(W)+dim(W)=n\dim(W) + \dim(W^\perp) = n.
  • Every vRn\mathbf{v} \in \mathbb{R}^n decomposes uniquely as v=w+w\mathbf{v} = \mathbf{w} + \mathbf{w}^\perp with wW\mathbf{w} \in W, wW\mathbf{w}^\perp \in W^\perp.
  • (W)=W(W^\perp)^\perp = W.
Step-by-step — Finding $W^\perp$ where $W = \text{span}\left\{\begin{pmatrix}1\\2\\3\end{pmatrix}\right\}$ in $\mathbb{R}^3$
1

Set up the orthogonality condition: v=(xyz)W\mathbf{v} = \begin{pmatrix}x\\y\\z\end{pmatrix} \in W^\perp requires v(123)=0\mathbf{v} \cdot \begin{pmatrix}1\\2\\3\end{pmatrix} = 0.

x+2y+3z=0x + 2y + 3z = 0.

2

Solve the linear equation: one equation, three unknowns — a two-dimensional solution space. Set y=sy=s, z=tz=t (free variables). Then x=2s3tx = -2s - 3t.

General solution: v=s(210)+t(301)\mathbf{v} = s\begin{pmatrix}-2\\1\\0\end{pmatrix} + t\begin{pmatrix}-3\\0\\1\end{pmatrix}.

3

Identify the orthogonal complement:

W=span{(210), (301)}W^\perp = \text{span}\left\{\begin{pmatrix}-2\\1\\0\end{pmatrix},\ \begin{pmatrix}-3\\0\\1\end{pmatrix}\right\}

dim(W)=1\dim(W)=1 and dim(W)=2\dim(W^\perp)=2, summing to n=3n=3 ✓. WW is a line through the origin; WW^\perp is the plane perpendicular to that line.

❌ What Breaks — The Complement of a Complement

A common error: "the complement of the xyxy-plane in R3\mathbb{R}^3 is the zz-axis, so the complement of the zz-axis is not the xyxy-plane." In fact (W)=W(W^\perp)^\perp = W exactly — complements are involutive. The error usually arises from confusing the orthogonal complement with a geometric "remainder" in the wrong sense. The identity dim(W)+dim(W)=n\dim(W)+\dim(W^\perp)=n with (W)=W(W^\perp)^\perp=W is exact.


05 · Orthonormal Bases and the Expansion Formula

When a basis is orthonormal, coordinates are computed by inner products — no matrix inversion required.

Definition — Orthonormal Basis Expansion

If {q1,,qn}\{\mathbf{q}_1, \ldots, \mathbf{q}_n\} is an orthonormal basis for Rn\mathbb{R}^n, then every vRn\mathbf{v} \in \mathbb{R}^n has expansion:

v=v,q1q1+v,q2q2++v,qnqn=i=1nv,qiqi\mathbf{v} = \langle \mathbf{v}, \mathbf{q}_1 \rangle \mathbf{q}_1 + \langle \mathbf{v}, \mathbf{q}_2 \rangle \mathbf{q}_2 + \cdots + \langle \mathbf{v}, \mathbf{q}_n \rangle \mathbf{q}_n = \sum_{i=1}^n \langle \mathbf{v}, \mathbf{q}_i \rangle \mathbf{q}_i

The coefficient of qi\mathbf{q}_i is simply v,qi\langle \mathbf{v}, \mathbf{q}_i \rangle — computed by a single inner product, no linear system needed. This is why orthonormal bases are computationally ideal.

If QQ is the matrix with columns q1,,qn\mathbf{q}_1, \ldots, \mathbf{q}_n, then QTQ=InQ^TQ = I_n (orthonormality) and QQT=InQQ^T = I_n (since QQ is square and full rank). A square matrix with orthonormal columns is called orthogonal and satisfies Q1=QTQ^{-1} = Q^T — inversion is free.


06 · Quant Application — Orthogonal Risk Factors in PCA

In portfolio management, the covariance matrix Σ\Sigma mixes all assets' risks together. PCA changes to the eigenvector basis, producing orthogonal principal components (PCs).

Given Σ=QΛQT\Sigma = Q\Lambda Q^T with Λ=diag(λ1,,λp)\Lambda = \text{diag}(\lambda_1, \ldots, \lambda_p), the PC scores of a return vector r\mathbf{r} are f=QTr\mathbf{f} = Q^T\mathbf{r}. The covariance of PC scores:

Cov(f)=QTΣQ=Λ\text{Cov}(\mathbf{f}) = Q^T \Sigma Q = \Lambda

is diagonal. The PCs are orthogonal risk factors — knowing the value of one PC tells you nothing about the others (zero covariance). This makes risk attribution clean: the variance explained by PCk_k is exactly λk\lambda_k, and the total variance is tr(Σ)=λk\text{tr}(\Sigma) = \sum \lambda_k.

In practice, keeping only the top kk PCs (those with the largest λi\lambda_i) explains most variance with far fewer dimensions — dimensionality reduction without losing much information.


07 · Practice Exercises

EXERCISE 10.1

Compute u,vW=uTWv\langle \mathbf{u},\mathbf{v}\rangle_W = \mathbf{u}^T W \mathbf{v} by first multiplying WvW\mathbf{v}, then dotting with u\mathbf{u}.

Wv=(2113)(21)=(55)W\mathbf{v} = \begin{pmatrix}2&1\\1&3\end{pmatrix}\begin{pmatrix}2\\1\end{pmatrix} = \begin{pmatrix}5\\5\end{pmatrix}.

u,vW=uT(Wv)=(12)(55)=5+10=15\langle\mathbf{u},\mathbf{v}\rangle_W = \mathbf{u}^T(W\mathbf{v}) = \begin{pmatrix}1&2\end{pmatrix}\begin{pmatrix}5\\5\end{pmatrix} = 5+10 = 15.

Verify symmetry: Wu=(2113)(12)=(47)W\mathbf{u} = \begin{pmatrix}2&1\\1&3\end{pmatrix}\begin{pmatrix}1\\2\end{pmatrix} = \begin{pmatrix}4\\7\end{pmatrix}. v,uW=vT(Wu)=(21)(47)=8+7=15\langle\mathbf{v},\mathbf{u}\rangle_W = \mathbf{v}^T(W\mathbf{u}) = \begin{pmatrix}2&1\end{pmatrix}\begin{pmatrix}4\\7\end{pmatrix} = 8+7=15 ✓.

Let W=(2113)W = \begin{pmatrix}2&1\\1&3\end{pmatrix}, u=(12)\mathbf{u}=\begin{pmatrix}1\\2\end{pmatrix}, v=(21)\mathbf{v}=\begin{pmatrix}2\\1\end{pmatrix}. Compute u,vW=uTWv\langle\mathbf{u},\mathbf{v}\rangle_W = \mathbf{u}^TW\mathbf{v} and verify it equals v,uW\langle\mathbf{v},\mathbf{u}\rangle_W.

EXERCISE 10.2

Verify IP1 (linearity), IP2 (symmetry), and IP3 (positive definiteness) for f,g=01f(x)g(x)dx\langle f, g\rangle = \int_0^1 f(x)g(x)\,dx. For IP3, use the fact that f2(x)0f^2(x) \geq 0.

IP1 (Linearity): cf+h,g=01(cf+h)gdx=c01fgdx+01hgdx=cf,g+h,g\langle cf+h, g\rangle = \int_0^1(cf+h)g\,dx = c\int_0^1 fg\,dx + \int_0^1 hg\,dx = c\langle f,g\rangle + \langle h,g\rangle ✓ (by linearity of integration).

IP2 (Symmetry): f,g=01f(x)g(x)dx=01g(x)f(x)dx=g,f\langle f,g\rangle = \int_0^1 f(x)g(x)\,dx = \int_0^1 g(x)f(x)\,dx = \langle g,f\rangle ✓ (multiplication of reals is commutative).

IP3 (Positive definiteness): f,f=01f(x)2dx0\langle f,f\rangle = \int_0^1 f(x)^2\,dx \geq 0 since f(x)20f(x)^2 \geq 0 for all xx. Equality holds     f(x)=0\iff f(x)=0 for all x[0,1]x \in [0,1], i.e. ff is the zero function ✓.

Therefore f,g=01fgdx\langle f,g\rangle = \int_0^1 fg\,dx is an inner product on C[0,1]C[0,1].

Verify that f,g=01f(x)g(x)dx\langle f, g\rangle = \int_0^1 f(x)g(x)\,dx defines an inner product on the space C[0,1]C[0,1] of continuous functions on [0,1][0,1] by checking all three axioms IP1–IP3.

EXERCISE 10.3

Find WW^\perp by solving vw1=0\mathbf{v}\cdot\mathbf{w}_1 = 0 and vw2=0\mathbf{v}\cdot\mathbf{w}_2 = 0 simultaneously. The solution space is WW^\perp.

W=span{(101),(011)}W = \text{span}\left\{\begin{pmatrix}1\\0\\1\end{pmatrix},\begin{pmatrix}0\\1\\1\end{pmatrix}\right\}. Need v=(xyz)\mathbf{v}=\begin{pmatrix}x\\y\\z\end{pmatrix} satisfying:

Eq 1: x+z=0z=xx+z=0 \Rightarrow z=-x.

Eq 2: y+z=0y=z=xy+z=0 \Rightarrow y=-z=x.

Free variable: x=tx=t. Then y=ty=t, z=tz=-t.

W=span{(111)}W^\perp = \text{span}\left\{\begin{pmatrix}1\\1\\-1\end{pmatrix}\right\}

Check dimensions: dim(W)=2\dim(W)=2, dim(W)=1\dim(W^\perp)=1, sum =3=n=3=n ✓.

Verify: (111)(101)=1+01=0\begin{pmatrix}1\\1\\-1\end{pmatrix}\cdot\begin{pmatrix}1\\0\\1\end{pmatrix}=1+0-1=0 ✓ and (111)(011)=0+11=0\begin{pmatrix}1\\1\\-1\end{pmatrix}\cdot\begin{pmatrix}0\\1\\1\end{pmatrix}=0+1-1=0 ✓.

Find WW^\perp where W=span{(101),(011)}R3W = \text{span}\left\{\begin{pmatrix}1\\0\\1\end{pmatrix}, \begin{pmatrix}0\\1\\1\end{pmatrix}\right\} \subseteq \mathbb{R}^3. Verify using the dimension formula dim(W)+dim(W)=3\dim(W)+\dim(W^\perp)=3.

EXERCISE 10.4

{q1,q2}\{\mathbf{q}_1,\mathbf{q}_2\} is orthonormal iff q1q2=0\mathbf{q}_1\cdot\mathbf{q}_2=0 and q1=q2=1\|\mathbf{q}_1\|=\|\mathbf{q}_2\|=1. Once orthonormal, the expansion coefficients are just dot products.

q1=12(11)\mathbf{q}_1 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix}, q2=12(11)\mathbf{q}_2 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\-1\end{pmatrix}.

Orthogonality: q1q2=12(11+1(1))=12(0)=0\mathbf{q}_1\cdot\mathbf{q}_2 = \frac{1}{2}(1\cdot1 + 1\cdot(-1)) = \frac{1}{2}(0)=0 ✓.

Unit norms: q12=12(1+1)=1\|\mathbf{q}_1\|^2 = \frac{1}{2}(1+1)=1 ✓. q22=12(1+1)=1\|\mathbf{q}_2\|^2 = \frac{1}{2}(1+1)=1 ✓.

Expansion of v=(31)\mathbf{v}=\begin{pmatrix}3\\1\end{pmatrix}:

c1=v,q1=12(3+1)=42=22c_1 = \langle\mathbf{v},\mathbf{q}_1\rangle = \frac{1}{\sqrt{2}}(3+1) = \frac{4}{\sqrt{2}} = 2\sqrt{2}.

c2=v,q2=12(31)=22=2c_2 = \langle\mathbf{v},\mathbf{q}_2\rangle = \frac{1}{\sqrt{2}}(3-1) = \frac{2}{\sqrt{2}} = \sqrt{2}.

Verify: c1q1+c2q2=2212(11)+212(11)=(22)+(11)=(31)c_1\mathbf{q}_1 + c_2\mathbf{q}_2 = 2\sqrt{2}\cdot\frac{1}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix} + \sqrt{2}\cdot\frac{1}{\sqrt{2}}\begin{pmatrix}1\\-1\end{pmatrix} = \begin{pmatrix}2\\2\end{pmatrix}+\begin{pmatrix}1\\-1\end{pmatrix}=\begin{pmatrix}3\\1\end{pmatrix} ✓.

Let q1=12(11)\mathbf{q}_1 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix}, q2=12(11)\mathbf{q}_2 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\-1\end{pmatrix}. Verify this is an orthonormal basis of R2\mathbb{R}^2, then expand v=(31)\mathbf{v}=\begin{pmatrix}3\\1\end{pmatrix} as v=c1q1+c2q2\mathbf{v}=c_1\mathbf{q}_1+c_2\mathbf{q}_2 using the inner product formula for the coefficients.

EXERCISE 10.5

The matrix QQ has orthonormal columns iff QTQ=IQ^TQ=I. Check this condition. Then verify Q1=QTQ^{-1}=Q^T by computing both products.

Q=12(1111)Q = \frac{1}{\sqrt{2}}\begin{pmatrix}1&1\\1&-1\end{pmatrix}.

QTQ=12(1111)(1111)=12(1+111111+1)=12(2002)=I2Q^TQ = \frac{1}{2}\begin{pmatrix}1&1\\1&-1\end{pmatrix}\begin{pmatrix}1&1\\1&-1\end{pmatrix} = \frac{1}{2}\begin{pmatrix}1+1&1-1\\1-1&1+1\end{pmatrix} = \frac{1}{2}\begin{pmatrix}2&0\\0&2\end{pmatrix} = I_2 ✓.

QQT=12(1111)(1111)=I2QQ^T = \frac{1}{2}\begin{pmatrix}1&1\\1&-1\end{pmatrix}\begin{pmatrix}1&1\\1&-1\end{pmatrix} = I_2 ✓ (same calculation by symmetry).

Therefore Q1=QTQ^{-1} = Q^T — inversion is just transposition. This makes orthogonal matrices computationally valuable: no Gaussian elimination needed to invert.

det(Q)=1212det(1111)=12(11)=1\det(Q) = \frac{1}{\sqrt{2}}\cdot\frac{1}{\sqrt{2}}\cdot\det\begin{pmatrix}1&1\\1&-1\end{pmatrix} = \frac{1}{2}(-1-1) = -1. det(Q)=1|\det(Q)|=1 — orthogonal matrices preserve volumes.

Let Q=12(1111)Q = \frac{1}{\sqrt{2}}\begin{pmatrix}1&1\\1&-1\end{pmatrix}. Verify QTQ=IQ^TQ = I and QQT=IQQ^T = I, confirming QQ is an orthogonal matrix with Q1=QTQ^{-1}=Q^T. Also compute det(Q)\det(Q) and state what it means geometrically.

EXERCISE 10.6

In PC coordinates, the covariance is diagonal: Cov(f)=Λ\text{Cov}(\mathbf{f})=\Lambda. The fraction of variance in PCk_k is λk/iλi\lambda_k / \sum_i \lambda_i. Reconstruct the original returns approximately using only the top PCs.

Σ=(5445)\Sigma = \begin{pmatrix}5&4\\4&5\end{pmatrix}.

Eigenvalues: p(λ)=(5λ)216=λ210λ+9=(λ9)(λ1)p(\lambda)=(5-\lambda)^2-16=\lambda^2-10\lambda+9=(\lambda-9)(\lambda-1). So λ1=9\lambda_1=9, λ2=1\lambda_2=1.

Total variance: tr(Σ)=10\text{tr}(\Sigma) = 10.

PC1 explains 910=90%\frac{9}{10}=90\% of variance; PC2 explains 110=10%\frac{1}{10}=10\%.

Eigenvectors: For λ1=9\lambda_1=9: (Σ9I)=(4444)(Σ-9I)=\begin{pmatrix}-4&4\\4&-4\end{pmatrix}; q1=12(11)\mathbf{q}_1=\frac{1}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix}.

For λ2=1\lambda_2=1: q2=12(11)\mathbf{q}_2=\frac{1}{\sqrt{2}}\begin{pmatrix}1\\-1\end{pmatrix} (orthogonal to q1\mathbf{q}_1).

PC1 direction q1=12(11)\mathbf{q}_1 = \frac{1}{\sqrt{2}}\begin{pmatrix}1\\1\end{pmatrix} corresponds to the equally-weighted portfolio — the "market" factor. PC2 direction q2=12(11)\mathbf{q}_2=\frac{1}{\sqrt{2}}\begin{pmatrix}1\\-1\end{pmatrix} is long asset 1, short asset 2 — a "spread" factor. 90% of the portfolio's variance is in the market direction; only 10% is in the spread.

A two-asset portfolio has covariance matrix Σ=(5445)\Sigma = \begin{pmatrix}5&4\\4&5\end{pmatrix}. Find the eigenvalues of Σ\Sigma, interpret each as a fraction of total variance, and identify which eigenvector direction corresponds to the "market" factor and which to the "spread" factor.


08 · Chapter Summary

ConceptKey Fact
Inner productBilinear, symmetric, positive definite — generalises dot product
Standard dot productu,v=uTv\langle\mathbf{u},\mathbf{v}\rangle = \mathbf{u}^T\mathbf{v} — prototype inner product on Rn\mathbb{R}^n
Weighted inner productu,vW=uTWv\langle\mathbf{u},\mathbf{v}\rangle_W = \mathbf{u}^TWv; WW positive definite
Induced normv=v,v\|\mathbf{v}\|=\sqrt{\langle\mathbf{v},\mathbf{v}\rangle}
Cauchy-Schwarzu,vuv\lvert\langle\mathbf{u},\mathbf{v}\rangle\rvert\leq\|\mathbf{u}\|\|\mathbf{v}\|; equality iff parallel
Orthogonal vectorsu,v=0\langle\mathbf{u},\mathbf{v}\rangle=0; Pythagorean theorem holds
Orthonormal setAll pairwise inner products zero, all norms one; vi,vj=δij\langle\mathbf{v}_i,\mathbf{v}_j\rangle=\delta_{ij}
Orthogonal complementWW^\perp; dim(W)+dim(W)=n\dim(W)+\dim(W^\perp)=n
Orthonormal basis expansionv=iv,qiqi\mathbf{v}=\sum_i\langle\mathbf{v},\mathbf{q}_i\rangle\mathbf{q}_i — no linear system needed
Orthogonal matrixQTQ=IQ^TQ=I; Q1=QTQ^{-1}=Q^T; detQ=1\lvert\det Q\rvert=1
PCAΣ=QΛQT\Sigma=Q\Lambda Q^T; PC scores have diagonal covariance Λ\Lambda

Next: Chapter 11 — Gram-Schmidt Process shows how to construct an orthonormal basis from any linearly independent set, the computational foundation behind QR decomposition and numerical stability in least squares.