Bartels–Stewart algorithm

Bartels-Stewart Algorithm

In numerical linear algebra, the Bartels-Stewart algorithm is used to numerically solve the Sylvester matrix equation $AX-XB=C$ . Developed by R.H. Bartels and G.W. Stewart in 1971, it was the first numerically stable method that could by systematically applied to solve such equations. The algorithm works by using the Schur decompositions of $A$ and $B$ to transform $AX-XB=C$ into a triangular system that can then be solved using forward and back-substitutions. In 1979, G. Golub, C. Van Loan and S.Nash introduced an improved version of the algorithm, known as the Hessenberg-Schur algorithm. It remains the standard approach for solving Sylvester equations when $X$ is of small to moderate size.

The algorithm

Let $X,C\in \mathbb {R} ^{m\times n}$ , and assume that the eigenvalues of $A$ are distinct from the eigenvalues of $B$ . Then, the matrix $AX-XB=C$ has a unique solution. The Bartels-Stewart algorithm computes $X$ by applying the following steps:

1.Compute the Schur decompositions

$R=U^{T}AU$ ,

$S=V^{T}BV$ .

The matrices $R^{T}$ and $S$ are block-upper triangular matrices, with square blocks of size no greater than $2$ .

2. Set $F=U^{T}CV$ .

3. Solve the simplified system $RY-YS^{T}=F$ , where $Y=U^{T}XV^{T}$ . This can be done using forward substitution on the blocks. Specifically, one has that if $s_{k-1,k}=0$ , then

$(R-s_{kk}I)y_{k}=f_{k}+\sum _{j=k+1}^{n}s_{kj}y_{j}$ ,

where $y_{k}$ is the $k$ th column of $Y$ .

4. Set $X=UYV^{T}$ .

Computational cost

Using the QR algorithm, the Schur decompositions in step 1 requires approximately $10(m^{3}+n^{3})$ flops, so that the overall computational cost is $10(m^{3}+n^{3})+2.5(mn^{2}+nm^{2})$ .

The Hessenberg-Schur algorithm

The Hessenberg-Schur algorithm replaces the decomposition $R=U^{T}AU$ in step 1 with the decomposition $H=Q^{T}AQ$ , where $H$ is an upper-Hessenberg matrix and $S$ is block-upper triangular. This leads to a system of the form $HY+YS^{T}=F$ that can be solved using forward substitution. The advantage of this approach is that $H=Q^{T}AQ$ can be found using Householder reflections at a cost of $(5/3)m^{3}$ flops, compared to the $10m^{3}$ flops required to compute the Schur decomposition of $A$ .

Implementations and availability

Alternative approaches and related problems

For large systems, the ${\mathcal {O}}(m^{3}+n^{3})$ cost of the Bartels-Stewart algorithm can be prohibitive. When $A$ and $B$ are sparse or structured, so that linear solves and matrix vector multiplies involving them are cheap, iterative algorithms can potentially perform better. These include projection-based methods, which use Krylov subspace iterations, methods based on the alternating-implicit direction (ADI) iteration, and hybridizations that involve both projection and ADI. Iterative methods can also be used to directly construct low rank approximations to $X$ when solving $AX-XB=C$ . This is important when, for instance, $X$ is too large to be stored in memory explicitly.

References

External links

www.example.com