This is a companion piece to Chen et al. (2022). This approach has prompted a number of follow-up works that generalize it to non-symmetric problems, and block and rational algorithms Xu & Chen (2024), Simunec (2024), Adler et al. (2025).
Code to replicate the figures in the corresponding paper can be found on Github.
Introduction¶
The Lanczos method for matrix function approximation (Lanczos-FA) can be used to approximate , and in the case that and is positive definite, this approximation is optimal over Krylov subspace. This case is very well studied and a range of error bounds and estimates exist. However, for other functions, the standard bounds are often too pessimistic as they do not take into account fine grained information about the spectrum of such as outlaying or clustered eigenvalues. This makes it difficult to know when Lanczos-FA has reached a suitable accuracy for a given problem.
In this paper we show how to reduce the error of approximating with Lanczos-FA to the error of solving a certain linear system with the Lanczos-FA. This allows us to leverage the range of existing bounds for the convergence of Lanczos-FA on linear systems to easily obtain a priori and a posteriori bounds for other matrix functions including piecewise analytic functions such as the sign function. Our a posteriori bounds are highly accurate and can be used as practical stopping criteria.
The basic idea¶
The -th Lanczos-FA approximation to is defined as
where and are produced by the Lanczos method run for steps on .
Suppose is a Hermitian matrix and If is analytic in a neighborhood of the eigenvalues of and is a contour in this neighborhood containing these eigenvalues,
If the eigenvalues of are also contained in , we similarly have
Combining, these, we see that the Lanczos-FA error can be written as
For , define the -th Lanczos-FA error and residual for the linear system as,
It is a well-known fact that
Consequently, for all , where and are both invertible,
where
Thus, if is a simple closed curve or union of simple closed curves inside this neighborhood and enclosing the eigenvalues of and and a point not in ,
Applying the triangle inequality and the submultiplicitivity of matrix-norms, we can then obtain a bound
where are some suitably chosen sets and .
Note that the integral term and linear system error term in the theorem are entirely decoupled! Thus, once the integral term is computed, bounding the error of Lanczos-FA for is reduced to bounding , and if the integral term can be bounded independently of , This implies that, up to a constant factor, the Lanczos-FA approximation to converges at least as fast as .
What else is in the paper?¶
Much of the paper is focused on practical aspects regarding the use of such a bound. In particular:
we provide a detailed discussion and analysis of the validity of our bounds when the Lanczos iterate was computed using finite precision arithmetic.
we provide several analytic examples where the integral term is computed directly
we use numerical experiments to demonstrate the effectiveness of our bounds on a variety of functions such as the square root, inverse square root, and sign functions, and
we derive similar bounds for quadratic forms.
It’s worth noting that similar ideas have been previously used to get error bounds for Stieltjes functions Ilic et al., 2009Frommer et al., 2014Frommer & Schweitzer, 2015. As discussed in Section 2.2 of the paper, our bounds differ in a number of key ways. One key difference is that we reduce error bounds for Lanczos-FA a general function to error bounds for Lanczos-FA on a fixed linear system. This allows intuition about the convergence of Lanczos-FA on linear systems to be transferred to other functions. In addition, our analysis allows bounds for piecewise continuous functions like the sign function.
- Chen, T., Greenbaum, A., Musco, C., & Musco, C. (2022). Error Bounds for Lanczos-Based Matrix Function Approximation. SIAM Journal on Matrix Analysis and Applications, 43(2), 787–811. 10.1137/21m1427784
- Xu, Q., & Chen, T. (2024). A posteriori error bounds for the block-Lanczos method for matrix function approximation. Numerical Algorithms. 10.1007/s11075-024-01819-7
- Simunec, I. (2024). Error bounds for the approximation of matrix functions with rational Krylov methods. Numerical Linear Algebra with Applications, 31(5), e2571. https://doi.org/10.1002/nla.2571
- Adler, J. H., Hu, X., Pan, W., & Xue, Z. (2025). Error Estimates for the Arnoldi Approximation of a Matrix Square Root. https://arxiv.org/abs/2506.22615
- Ilic, M. D., Turner, I. W., & Simpson, D. P. (2009). A restarted Lanczos approximation to functions of a symmetric matrix. IMA Journal of Numerical Analysis, 30(4), 1044–1061. 10.1093/imanum/drp003
- Frommer, A., Güttel, S., & Schweitzer, M. (2014). Efficient and Stable Arnoldi Restarts for Matrix Functions Based on Quadrature. SIAM Journal on Matrix Analysis and Applications, 35(2), 661–683. 10.1137/13093491x
- Frommer, A., & Schweitzer, M. (2015). Error bounds and estimates for Krylov subspace approximations of Stieltjes matrix functions. BIT Numerical Mathematics, 56(3), 865–892. 10.1007/s10543-015-0596-3