Linear Algebra: Eigendecomposition, Transforms, and Probability Foundations
Eigendecomposition diagonalises a matrix by expressing it in terms of its eigenvectors and eigenvalues. Combined with probability foundations (expectation, variance, covariance), these form the mathematical core of most ML algorithms.
Authored By: Sthitadhee October 14, 2024 · Updated October 14, 2024 2 min read · 484 words · high confidence
where D=VTAV is a diagonal matrix of eigenvalues and V is the matrix of eigenvectors as columns.
A is similar to D: A=M−1BM (similarity transform). Two matrices are similar if they represent the same linear transformation in different bases.
Conditions for diagonalisability:
A is similar to D.
The algebraic multiplicity (AM) of each eigenvalue λ equals its geometric multiplicity (GM).
∑GM(λi)=n (the sum of geometric multiplicities equals the matrix dimension).
Positive semidefinite:⟨Ax,x⟩≥0⇔λi≥0.
Positive definite:⟨Ax,x⟩>0⇔λi>0.
For symmetric matrices A∈Rn×n: eigenvalues are real-valued, eigenvectors form an orthonormal basis, A+AT≥AAT, and A is positive semidefinite if rank(A)=n.
These relationships appear throughout ML: in Bayesian inference (Bayes’ theorem is the conditional formula), in graphical models (chain rule and independence), and in dimensionality reduction (covariance structure).