LU decomposition

LU decomposition#

Solving linear systems of equations is among the most fundamental operations in a computer. Many problems (even nonlinear ones) eventually reduce to sequences of linear system solves.

Most students learn about Gaussian elimination, also known as row reduction, in school. The LU decomposition is simply a more organised way to perform Gaussian elimination.

Basic idea#

We consider the linear system of equations

\[ Ax = b \]

with \(A \in \mathbb{R}^{n \times n}\). Throughout, assume that \(A\) is nonsingular. The LU decomposition applies similarly to complex matrices.

Many techniques in numerical linear algebra rely on factoring a matrix into products of simpler matrices. The LU decomposition is one such factorization:

\[ A = LU \]

This represents a decomposition of \(A\) into the product of two matrices, \(L\) and \(U\). Substituting this into the system gives

\[ LUx = b \]

To solve this system, we now have to solve two matrix problems:

\[ Ly = b, \qquad Ux = y. \]

This might seem cumbersome. Instead of one linear system, we now have two. However, the trick is to choose a decomposition that makes solving linear systems with \(L\) and \(U\) particularly easy.

In LU decomposition, \(L\) is a lower triangular matrix, meaning its nonzero elements are confined to the diagonal and the entries below it. Similarly, \(U\) is an upper triangular matrix, where nonzero elements appear on the diagonal and above it.

Solving an upper triangular system#

Consider an upper triangular system \(Ux = y\). Its equations can be written as

\[ \sum_{j=i}^n u_{i,j} x_j = y_i, \]

for \(i = n, n-1, \dots, 1\). This leads to the well-known back-substitution procedure:

First, find \(x_n = y_n / u_{n,n}\).
Then, for each \(i\) from \(n-1\) down to \(1\),

\[ x_i = \frac{y_i - \sum_{j=i+1}^n u_{i,j} x_j}{u_{i,i}}. \]

The following fact ensures that there is no division by zero in the back-substitution procedure if \(U\) is invertible.

Fact: Invertible triangular systems

A triangular matrix is invertible if and only if its diagonal elements are non-zero.

What is the cost of solving this triangular system? We count the number of additions, subtractions, multiplications, and divisions.

For each \(i\), there are \(n - i\) multiplications and \(n - i - 1\) additions, plus one subtraction and one division.
Hence, the overall number of operations (\(ops\)) is

\[ ops = \sum_{i=1}^n [(n-i) + (n-i-1) + 1 + 1] = \sum_{i=1}^n [2 (n-i) + 1] = n^2. \]

Thus, triangular solves are efficient and form the backbone of methods like LU decomposition for solving large linear systems.

Gaussian elimination in matrix notation#

To derive the LU decomposition, we express Gaussian elimination operations as matrix multiplications.

At the beginning, we have \(A^{(1)} = A\) and \(b^{(1)} = b\). At each stage of Gaussian elimination, we eliminate the entries of a column below the diagonal. Let \(A^{(s)}\) and \(b^{(s)}\) be the matrix and right-hand side we obtain after eliminating the first \(s-1\) columns.

Schematically, we have the picture

\[\begin{split} A^{(s)} = \begin{pmatrix} \times & \times & \times & \times & \times\\ 0 & \times & \times & \times & \times\\ 0 & 0 & \times & \times & \times\\ 0 & 0 & \times & \times & \times\\ 0 & 0 & \times & \times & \times \end{pmatrix} \mapsto \begin{pmatrix} \times & \times & \times & \times & \times\\ 0 & \times & \times & \times & \times\\ 0 & 0 & \times & \times & \times\\ 0 & 0 & 0 & {\bf \times} & \times\\ 0 & 0 & 0 & \times & \times \end{pmatrix} = A^{(s+1)} \end{split}\]

To be more precise, given \(A^{(s)}\), subtract \(A^{(s)}_{is} / A^{(s)}_{ss}\) times row \(s\) from row \(i \in \{s+1, \ldots, n\}\):

\[ A^{(s+1)}_{ij} = A^{(s)}_{ij} - \frac{A^{(s)}_{is}}{A^{(s)}_{ss}} A^{(s)}_{sj} \]

Note that on a computer, the computation only needs to be carried out for \(j \in \{ s+1, \ldots, n\}\): for \(j \in \{1, \ldots, s-1\}\) both \(A^{(s)}_{ij}\) and \(A^{(s)}_{sj}\) vanish; for \(j = s\) the formula is precisely chosen to guarantee the cancellation of terms. Through successive elimination of columns, we reduce \(A\) to an upper triangular matrix \(A^{(n)}\) after \(n-1\) steps. We now rewrite the elimination step as matrix multiplication

\[ A^{(s+1)} = F^{(s)} \, A^{(s)} \]

where

\[\begin{split} F^{(s)} = \begin{pmatrix} 1\\ & \ddots\\ & & 1\\ & & \!\!\!\!\!\!\!\!\!\!\!\!-\ell_{s+1, s}\!\!\!\!\!\!\!\!\! & \ddots\\ & & \vdots & & \ddots\\ & & \!\!\!\!-\ell_{n, s} & & & 1\\ \end{pmatrix}, \qquad \ell_{i,s} = \frac{A^{(s)}_{is}}{A^{(s)}_{ss}}. \end{split}\]

Definition 3

A matrix where all diagonal elements are \(1\), with zeros elsewhere except for nonzero entries in column \(s\) below the diagonal is called a Frobenius matrix of index \(s\).

We denote the \(s\)th unit vector by \(e^{(s)}\), thus \(e_k^{(s)} = 1\) if \(s = k\) and \(e_k^{(s)} = 0\) otherwise. Any Frobenius matrix of index \(s\) can be written by choosing a vector

\[ f = (0, \ldots, 0, f_{s+1}, \ldots, f_n)^\top, \]

containing the subdiagonal elements:

\[ F = I - f \cdot (e^{(s)})^\top = I - f \otimes e^{(s)}. \]

Here \(I\) is the identity matrix and \(\otimes\) is the outer (or tensor) product: \(a \cdot b^\top = a \otimes b\) for \(a \in \mathbb{R}^m, b \in \mathbb{R}^n\). Observe that \(a \otimes b \in \mathbb{R}^{m \times n}\) and distinguish it from the inner product \(a^\top \cdot b = \langle a, b\rangle \in \mathbb{R}\).

Before continuing, make sure you fully understand

why \(A^{(s+1)} = F^{(s)} \, A^{(s)}\) precisely implements the Gaussian elimination in column \(s\),
why all Frobenius matrices \(F\) of index \(s\) are of the form \(I - f \otimes e^{(s)}\) where the first \(s\) elements of \(f\) vanish.

Also look at the self-check exercises below.

Deriving the LU decomposition#

So far, we have changed the description of Gaussian elimination, but have not added any substantially new mathematics. That is going to change now: the inverses of Frobenius matrices are known and efficiently computable. This is a rare property!

Lemma 2

Let \(F = I - f \otimes e^{(s)}\) be as above. Then

\[ G = I + f \otimes e^{(s)} \]

is the inverse of \(F\).

Proof. To show \(F \, G = I\), we compute the entries of the product:

\[ \sum_{k = 1}^n F_{ik} \, G_{kj} = \delta_{ij} - f_i \, e^{(s)}_j + f_i \, e^{(s)}_j - \sum_{k = 1}^n f_i e^{(s)}_k f_k e^{(s)}_j = \delta_{ij}. \]

Here \(\delta_{ij}\) is the Kronecker product.

Note that the inverse of a Frobenius matrix is also a Frobenius matrix. We now learn that products of Frobenius matrices can be computed with great ease.

Lemma 3

For \(s = 1, \ldots, n-1\) let \(G^{(s)} = I + f^{(s)} \otimes e^{(s)}\) be a Frobenius matrix of index \(s\). Then

\[\begin{split} G^{(1)} \cdots G^{(s)} = I + \sum_{r = 1}^s f^{(r)} \cdot (e^{(r)})^\top = \begin{pmatrix} 1\\ f^{(1)}_2 & 1\\ f^{(1)}_3 & f^{(2)}_3 & 1\\ \vdots & & \ddots & 1 \\ \vdots & & & f^{(s)}_n & \ddots\\ \vdots & & & \vdots & & \ddots\\ f^{(1)}_n & f^{(2)}_n & \cdots & f^{(s)}_n & & & 1 \end{pmatrix} . \end{split}\]

Proof. We begin induction in \(s\) with \(s = 1\):

\[ G^{(1)} = I + f^{(1)} \cdot (e^{(1)})^\top. \]

Suppose the statement of the lemma holds for \(s\). Then for the element in row \(i\) and column \(j\) of the product we find

\[\begin{split} \begin{align*} \Big( G^{(1)} \cdots G^{(s)} G^{(s+1)} \Big)_{ij} & = \delta_{ij} + \sum_{r=1}^s f_i^{(r)} \, e_j^{(r)} + f_i^{(s+1)} \, e_j^{(s+1)} + \sum_{k=1}^n \sum_{r=1}^s f_i^{(r)} \, e_k^{(r)} f_k^{(s+1)} \, e_j^{(s+1)}\\ & = \delta_{ij} + \sum_{r=1}^{s+1} f_i^{(r)} \, e_j^{(r)}. \end{align*} \end{split}\]

Now everything is in place to return to Gaussian elimination:

\[ U := A^{(n)} = F^{(n-1)} \, A^{(n-1)} = F^{(n-1)} \cdots F^{(1)} A^{(1)}. \]

With \(L = G^{(1)} \cdots G^{(n-1)}\), we conclude

\[ LU = G^{(1)} \cdots G^{(n-1)} F^{(n-1)} \cdots F^{(1)} A^{(1)} = A^{(1)} = A. \]

We showed that if Gaussian elimination works (without causing division by 0 or needing row or column permutations, called pivoting) then we can find an LU decomposition.

Theorem 3

Suppose \(A \in \mathbb{R}^{n \times n}\) can be transformed into an upper triangular \(U \in \mathbb{R}^{n \times n}\) by Gaussian elimination (without pivoting). Then there exists a lower triangular \(L \in \mathbb{R}^{n \times n}\) such that

\[ A = LU. \]

The complete algorithm#

Consider that we want to solve the linear system

\[ Ax = b \]

First, we compute the LU decomposition

\[ A = LU \]

using Gaussian elimination, storing the \(\ell_{ij}\) multipliers for each row in the corresponding positions of the \(L\) matrix.

Next, we solve \(Ly = b\) by forward substitution. This involves solving for \(y_1\) first, then \(y_2\), and so on. The algorithm is similar to the triangular solve discussed earlier, but we proceed from the first entry to the last entry of \(y\) since the matrix \(L\) is lower triangular. Additionally, since all diagonal entries of \(L\) are \(1\), we do not need to divide by \(L_{ii}\).

Finally, we solve \(Ux = y\) by backward substitution.

Python skills#

To perform LU decomposition in Python, we can use the scipy.linalg.lu function from the SciPy library. This function returns the permutation matrix P, lower triangular matrix L, and upper triangular matrix U such that PA = LU. We shall investigate the use of P in the next section.

import numpy as np
from scipy.linalg import lu

# Example matrix
A = np.array([[3, 1, 6], [2, 1, 3], [1, 1, 1]])

# Perform LU decomposition
P, L, U = lu(A)

print("P:\n", P)
print("L:\n", L)
print("U:\n", U)

We can use the above factors to solve the linear system with different right-hand sides. The inverse of P is its own transpose.

# Example right-hand sides
b1 = np.array([1, 2, 3])
b2 = np.array([1, -2, 3])

# Check that P is its own inverse
print(f"P * P =\n {P @ P}\n")

def trig_solve(P, L, U, b):
  # Forward substitution to solve Ly = P b
  y = np.linalg.solve(L, np.dot(P, b))

  # Backward substitution to solve Ux = y
  x = np.linalg.solve(U, y)

  return x

print(f"Solution x1: {trig_solve(P, L, U, b1)}")
print(f"Solution x2: {trig_solve(P, L, U, b2)}")

Self-check questions#

Question

Write down how to solve the below general \(3 \times 3\) system

\[\begin{split} \begin{pmatrix} a_{11} & a_{12} & a_{13}\\ a_{21} & a_{22} & a_{23}\\ a_{31} & a_{32} & a_{33} \end{pmatrix} \begin{pmatrix} x_1\\ x_2 \\ x_3 \end{pmatrix} = \begin{pmatrix}b_1 \\ b_2 \\ b_3\end{pmatrix} \end{split}\]

with Gaussian elimination. You may assume that the system is invertible.

Question

Do \(LU\) decomposition on

\[\begin{split} A = \left( \begin{array}{rrrr} 3 & 1 & 4 & 1\\ 15 & 14 & 22 & 11 \\ 15 & 32 & 31 & 31\\ 27 & 72 & 95 & 126\\ \end{array} \right). \end{split}\]

Question

A triangular matrix is invertible if and only if its diagonal elements are non-zero.

Question

Eliminate the elements below the diagonal in the first column of the matrix

\[\begin{split} A = \begin{pmatrix} 2 & 1 & 1 \\ 4 & -6 & 0 \\ -2 & 7 & 2 \end{pmatrix} \end{split}\]

using multiplication with a Frobenius matrix.

Question

The Frobenius matrix

\[\begin{split} F = \begin{pmatrix} 1 & 0 & 0 \\ -2 & 1 & 0 \\ 1 & 0 & 1 \end{pmatrix} \end{split}\]

has the form \(F = I - f \cdot g^\top\). What are \(f, g \in \mathbb{R}^3\)?

Question

Prove the Sherman-Morrison formula

\[ (A+uv^\top)^{-1} = A^{-1} - \frac{A^{-1}uv^\top A^{-1}}{1+v^\top A^{-1}u} \]

if \(1+v^\top A^{-1}u\neq 0\).

Question

Compute the LU decomposition of the following matrix:

\[\begin{split} A = \begin{pmatrix} 4 & 3 & 2 \\ 6 & 3 & 0 \\ 2 & 1 & 1 \end{pmatrix} \end{split}\]

Question

Suppose that \(A\) is symmetric and \(A = L U\) is its \(LU\) decomposition. Is \(U = L^\top\)?

Question

Given a non-singular \(A \in \mathbb{R}^{3 \times 3}\), is its \(LU\) decomposition unique?

Question

Let \(A \in \mathbb{R}^{n \times n}\). Prove that an LU decomposition of \(A\) with nonsingular factors \(L\) and \(U\) exists if and only if all leading principal minors of \(A\) (i.e. the determinants of the \(m \times m\) submatrices \(A_{:m, :m}\) for \(m = 1, \ldots, n\)) are nonzero. Determine whether such an LU decomposition exists for the matrix

\[\begin{split} \begin{pmatrix} 0.5 & -2 & 2 \\ -1 & 4 & -5 \\ 9 & -5 & 8 \end{pmatrix}. \end{split}\]

Justify your conclusion.

LU decomposition

Contents

LU decomposition#

Basic idea#

Solving an upper triangular system#

Gaussian elimination in matrix notation#

Deriving the LU decomposition#

The complete algorithm#

Python skills#

Self-check questions#