Matrix Diagonalization

Definition

A square matrix $A$ is diagonalisable if it can be written as

A = S Λ S^{- 1},

where $S$ is an invertible matrix whose columns are eigenvectors of $A$ , and $Λ$ is the diagonal matrix of corresponding eigenvalues. Equivalently, $Λ = S^{- 1} AS$ .

Intuition

Diagonalisation decomposes a transformation into independent “modes”: change to the eigenvector basis (via $S^{- 1}$ ), apply a pure scaling along each axis (via $Λ$ ), then return to the original basis (via $S$ ). In the eigenvector basis the transformation is trivially simple — each coordinate is just scaled by its eigenvalue with no mixing between dimensions. This separation of independent modes makes repeated application, inversion, and analysis dramatically easier.

Formal Description

Derivation. Collecting the eigenrelations $Ax_{i} = λ_{i} x_{i}$ into a single matrix equation gives

AS = S Λ,

where $S = [x_{1} ∣ \dots ∣ x_{n}]$ and $Λ = diag (λ_{1}, \dots, λ_{n})$ . If $S$ is invertible (i.e.\ the eigenvectors are linearly independent), both sides can be multiplied by $S^{- 1}$ to obtain $A = S Λ S^{- 1}$ .

Condition for diagonalisability. An $n \times n$ matrix is diagonalisable if and only if it has $n$ linearly independent eigenvectors. In particular, a matrix with $n$ distinct eigenvalues is always diagonalisable.

Eigendecomposition. More generally, the eigendecomposition of $A$ is

A = V diag (λ) V^{- 1},

where the columns of $V$ are eigenvectors and $λ$ is the vector of eigenvalues. Not every matrix admits such a decomposition over $R$ , but every real symmetric matrix can be decomposed as

A = Q Λ Q^{⊤},

where $Q$ is orthogonal ( $Q^{- 1} = Q^{⊤}$ ) and its columns are eigenvectors of $A$ . By convention eigenvalues in $Λ$ are sorted in descending order; the decomposition is unique (up to sign of eigenvectors) when all eigenvalues are distinct.

Matrix powers. Diagonalisation makes computing powers straightforward. For $A = S Λ S^{- 1}$ ,

A^{p} = S Λ^{p} S^{- 1},

where $Λ^{p} = diag (λ_{1}^{p}, \dots, λ_{n}^{p})$ . Each repeated application of $A$ simply raises the diagonal entries of $Λ$ to the next power.

Properties from eigenvalues.

$A$ is singular if and only if at least one eigenvalue is zero.
$A$ is positive definite if all eigenvalues are positive; positive semidefinite if all eigenvalues are non-negative. Positive semidefinite matrices satisfy $x^{⊤} Ax \geq 0$ for all $x$ ; positive definite matrices additionally have $Ax = 0 \Rightarrow x = 0$ .
The eigendecomposition of a real symmetric matrix can be used to optimise quadratic forms $f (x) = x^{⊤} Ax$ subject to $∥ x ∥_{2} = 1$ .

Applications

Computing matrix powers and matrix exponentials efficiently (e.g., solving linear ODEs).
PCA and spectral methods in machine learning rely on eigendecomposition of covariance or kernel matrices.
Decoupling coupled differential equations into independent scalar equations.
Checking positive definiteness of matrices (e.g., for valid covariance matrices or convex quadratic forms).

Trade-offs

Not all matrices are diagonalisable: a defective matrix has a repeated eigenvalue with fewer independent eigenvectors than its algebraic multiplicity. Jordan normal form is the generalisation, but it is more complex to work with.
Even when diagonalisable, computing the full eigendecomposition costs $O (n^{3})$ ; for very large sparse matrices only a few dominant eigenvalues are typically computed via iterative methods (e.g., Lanczos, Arnoldi).
Symmetric eigendecomposition ( $A = Q Λ Q^{⊤}$ ) is numerically better conditioned and always exists over $R$ ; the general (non-symmetric) case can produce complex eigenvalues and an ill-conditioned $S$ .

Notes

Explorer

matrix_diagonalization

Matrix Diagonalization

Definition

Intuition

Formal Description

Applications

Trade-offs

Links

Graph View

Table of Contents

Backlinks