$\langle \mathbf{u}, \mathbf{v} \rangle$angle brackets u v — inner product
The inner product of vectors u and v. Angle-bracket notation is standard for abstract inner products, distinguishing them from the specific dot product u⋅v. The dot product is one particular inner product, but not the only one.
$\mathbf{u} \perp \mathbf{v}$u perp v — orthogonal
u and v are orthogonal — their inner product is zero: ⟨u,v⟩=0. Orthogonality is the precise definition of "perpendicular" in any inner product space, including high-dimensional spaces where geometric angles are not directly visible.
$W^\perp$W perp — orthogonal complement
The set of all vectors orthogonal to every vector in subspace W: W⊥={v∣⟨v,w⟩=0 for all w∈W}. Together W and W⊥ partition Rn: every vector decomposes uniquely as v=w+w⊥ with w∈W, w⊥∈W⊥.
$\|\mathbf{v}\|_{A}$v norm A — A-weighted norm
The norm induced by a positive definite matrix A: ∥v∥A=vTAv. In finance, A=Σ−1 (precision matrix) gives the Mahalanobis norm, which accounts for correlations and scales units.
01 · From Dot Product to Inner Product
The dot product u⋅v=∑uivi works in Rn but fails to generalise to function spaces or weighted geometries. The inner product abstracts the properties that make the dot product useful — linearity, symmetry, positivity — into axioms satisfied by many different operations.
Definition — Inner Product
An inner product on a real vector space V is a function ⟨⋅,⋅⟩:V×V→R satisfying:
IP1. Linearity:⟨cu+w,v⟩=c⟨u,v⟩+⟨w,v⟩ for all c∈R.
IP2. Symmetry:⟨u,v⟩=⟨v,u⟩ for all u,v.
IP3. Positive definiteness:⟨v,v⟩≥0 for all v, with equality ⟺v=0.
A vector space equipped with an inner product is an inner product space.
✓ Example — The Standard Dot Product
The dot product ⟨u,v⟩=uTv=∑i=1nuivi on Rn satisfies all three axioms:
IP1: ⟨cu+w,v⟩=(cu+w)Tv=cuTv+wTv — linear in first argument.
IP2: uTv=vTu — symmetric (both equal ∑uivi).
IP3: vTv=∑vi2≥0, and equals zero only when all vi=0 — positive definite.
✓ Example — Weighted Inner Product
For a positive definite matrix W∈Rn×n, the weighted dot product:
⟨u,v⟩W=uTWv
is also a valid inner product. Taking W=Σ−1 (inverse of a covariance matrix) gives the Mahalanobis inner product — used in portfolio optimisation and anomaly detection to measure distances that account for correlations between assets.
❌ What Breaks — Not Every Bilinear Form Is an Inner Product
Consider f(u,v)=u1v1−u2v2 on R2. This is bilinear and symmetric, but fails IP3: take v=(01), then f(v,v)=0−1=−1<0. Negative value for a nonzero vector violates positive definiteness. This is the Minkowski metric from special relativity — a pseudo-inner-product, not an inner product.
02 · The Induced Norm and Cauchy-Schwarz
Every inner product induces a norm — a way to measure the "length" of a vector.
Definition — Induced Norm
Given an inner product ⟨⋅,⋅⟩, the induced norm is:
∥v∥=⟨v,v⟩
For the standard dot product, this recovers ∥v∥=∑vi2 — the Euclidean length from Chapter 1.
Definition — Cauchy-Schwarz Inequality
For any inner product space:
∣⟨u,v⟩∣≤∥u∥∥v∥
Equality holds ⟺u and v are parallel (u=cv for some scalar c). Cauchy-Schwarz guarantees that ∥u∥∥v∥⟨u,v⟩∈[−1,1], so the cosine angle formula cosθ=∥u∥∥v∥⟨u,v⟩ is always valid.
Step-by-step — Verifying Cauchy-Schwarz for $\mathbf{u}=\begin{pmatrix}2\\-1\end{pmatrix}$, $\mathbf{v}=\begin{pmatrix}3\\4\end{pmatrix}$
1
Compute the inner product:⟨u,v⟩=2(3)+(−1)(4)=6−4=2.
2
Compute the norms:∥u∥=4+1=5. ∥v∥=9+16=25=5.
∥u∥∥v∥=55≈11.18.
3
Check the inequality:∣⟨u,v⟩∣=2≤55≈11.18 ✓. The inequality holds with room to spare — the vectors are not parallel. Cosine of angle: cosθ=552≈0.179, so θ≈79.7°.
03 · Orthogonality
Orthogonality — zero inner product — is the most important structural property in linear algebra. Orthogonal vectors contain no "shared direction," which makes computations separable.
Definition — Orthogonal Vectors
Vectors u,v in an inner product space are orthogonal if:
⟨u,v⟩=0
Written u⊥v.
Pythagorean theorem: if u⊥v, then ∥u+v∥2=∥u∥2+∥v∥2.
Proof: ∥u+v∥2=⟨u+v,u+v⟩=∥u∥2+2⟨u,v⟩+∥v∥2=∥u∥2+∥v∥2 since ⟨u,v⟩=0.
Definition — Orthogonal Set and Orthonormal Set
A set {v1,…,vk} is:
Orthogonal if ⟨vi,vj⟩=0 for all i=j.
Orthonormal if additionally ∥vi∥=1 for all i:
⟨vi,vj⟩=δij={10i=ji=j
δij is the Kronecker delta — 1 when indices match, 0 otherwise.
✓ Example — Standard Basis Is Orthonormal
In R3, the standard basis e1=100, e2=010, e3=001 satisfies ei⋅ej=δij. Any two distinct standard basis vectors are orthogonal (dot product = 0); each has norm 1. It is the prototype orthonormal basis.
Orthogonal Sets Are Automatically Independent
Every orthogonal set of nonzero vectors is linearly independent. Suppose c1v1+⋯+ckvk=0. Take the inner product with vj: all cross terms vanish by orthogonality, leaving cj∥vj∥2=0. Since ∥vj∥=0, we get cj=0 for each j. This is why orthonormal bases are so powerful — independence is free.
04 · Orthogonal Complement
Definition — Orthogonal Complement
The orthogonal complement of a subspace W⊆Rn is:
W⊥={v∈Rn∣⟨v,w⟩=0 for all w∈W}
Key facts:
W⊥ is itself a subspace.
dim(W)+dim(W⊥)=n.
Every v∈Rn decomposes uniquely as v=w+w⊥ with w∈W, w⊥∈W⊥.
(W⊥)⊥=W.
Step-by-step — Finding $W^\perp$ where $W = \text{span}\left\{\begin{pmatrix}1\\2\\3\end{pmatrix}\right\}$ in $\mathbb{R}^3$
1
Set up the orthogonality condition:v=xyz∈W⊥ requires v⋅123=0.
x+2y+3z=0.
2
Solve the linear equation: one equation, three unknowns — a two-dimensional solution space. Set y=s, z=t (free variables). Then x=−2s−3t.
General solution: v=s−210+t−301.
3
Identify the orthogonal complement:
W⊥=span⎩⎨⎧−210,−301⎭⎬⎫
dim(W)=1 and dim(W⊥)=2, summing to n=3 ✓. W is a line through the origin; W⊥ is the plane perpendicular to that line.
❌ What Breaks — The Complement of a Complement
A common error: "the complement of the xy-plane in R3 is the z-axis, so the complement of the z-axis is not the xy-plane." In fact (W⊥)⊥=W exactly — complements are involutive. The error usually arises from confusing the orthogonal complement with a geometric "remainder" in the wrong sense. The identity dim(W)+dim(W⊥)=n with (W⊥)⊥=W is exact.
05 · Orthonormal Bases and the Expansion Formula
When a basis is orthonormal, coordinates are computed by inner products — no matrix inversion required.
Definition — Orthonormal Basis Expansion
If {q1,…,qn} is an orthonormal basis for Rn, then every v∈Rn has expansion:
The coefficient of qi is simply ⟨v,qi⟩ — computed by a single inner product, no linear system needed. This is why orthonormal bases are computationally ideal.
If Q is the matrix with columns q1,…,qn, then QTQ=In (orthonormality) and QQT=In (since Q is square and full rank). A square matrix with orthonormal columns is called orthogonal and satisfies Q−1=QT — inversion is free.
06 · Quant Application — Orthogonal Risk Factors in PCA
In portfolio management, the covariance matrix Σ mixes all assets' risks together. PCA changes to the eigenvector basis, producing orthogonal principal components (PCs).
Given Σ=QΛQT with Λ=diag(λ1,…,λp), the PC scores of a return vector r are f=QTr. The covariance of PC scores:
Cov(f)=QTΣQ=Λ
is diagonal. The PCs are orthogonal risk factors — knowing the value of one PC tells you nothing about the others (zero covariance). This makes risk attribution clean: the variance explained by PCk is exactly λk, and the total variance is tr(Σ)=∑λk.
In practice, keeping only the top k PCs (those with the largest λi) explains most variance with far fewer dimensions — dimensionality reduction without losing much information.
07 · Practice Exercises
EXERCISE 10.1
Compute ⟨u,v⟩W=uTWv by first multiplying Wv, then dotting with u.
Let q1=21(11), q2=21(1−1). Verify this is an orthonormal basis of R2, then expand v=(31) as v=c1q1+c2q2 using the inner product formula for the coefficients.
EXERCISE 10.5
The matrix Q has orthonormal columns iff QTQ=I. Check this condition. Then verify Q−1=QT by computing both products.
Let Q=21(111−1). Verify QTQ=I and QQT=I, confirming Q is an orthogonal matrix with Q−1=QT. Also compute det(Q) and state what it means geometrically.
EXERCISE 10.6
In PC coordinates, the covariance is diagonal: Cov(f)=Λ. The fraction of variance in PCk is λk/∑iλi. Reconstruct the original returns approximately using only the top PCs.
Σ=(5445).
Eigenvalues: p(λ)=(5−λ)2−16=λ2−10λ+9=(λ−9)(λ−1). So λ1=9, λ2=1.
Total variance: tr(Σ)=10.
PC1 explains 109=90% of variance; PC2 explains 101=10%.
Eigenvectors: For λ1=9: (Σ−9I)=(−444−4); q1=21(11).
For λ2=1: q2=21(1−1) (orthogonal to q1).
PC1 direction q1=21(11) corresponds to the equally-weighted portfolio — the "market" factor. PC2 direction q2=21(1−1) is long asset 1, short asset 2 — a "spread" factor. 90% of the portfolio's variance is in the market direction; only 10% is in the spread.
A two-asset portfolio has covariance matrix Σ=(5445). Find the eigenvalues of Σ, interpret each as a fraction of total variance, and identify which eigenvector direction corresponds to the "market" factor and which to the "spread" factor.
All pairwise inner products zero, all norms one; ⟨vi,vj⟩=δij
Orthogonal complement
W⊥; dim(W)+dim(W⊥)=n
Orthonormal basis expansion
v=∑i⟨v,qi⟩qi — no linear system needed
Orthogonal matrix
QTQ=I; Q−1=QT; ∣detQ∣=1
PCA
Σ=QΛQT; PC scores have diagonal covariance Λ
Next: Chapter 11 — Gram-Schmidt Process shows how to construct an orthonormal basis from any linearly independent set, the computational foundation behind QR decomposition and numerical stability in least squares.