Suppose is positive definite and recall the Nyström approximation
When the columns of a subset of the columns of the identity, then corresponds to subsampling the columns of . In particular, let be a tuple containing the pivots (columns of ) we will observe; i.e. so that . Then
In particular, since is symmetric, we can compute having only observed , which contains just entries of .
The key question is how best to choose the columns of . Ideally, we would like to choose columns so that the Nyström approximation is competitive with the best rank- approximation to .
Partial Cholesky factorization with pivoting¶
It turns out that, to compute (7.5) we can use a Cholesky factorization algorithm with pivoting, and stop after steps.
Note that the “textbook” Cholesky factorization algorithms maintain directly. On the other hand, anticipating that we will terminate for some , Algorithm 7.2 only computes the necessary parts of as they are needed.
Proof
See Ethan’s blog.
Adaptive pivoting¶
Nothing in Algorithm 7.2 requires that the pivot be chosen prior to step ! In particular, we can choose the -th pivot adaptively, based on the approximation , where . While computing the error would let us try to find the column that would reduce the error the most, we want to avoid looking at all of the entires of .
Amazingly, we can find good pivots without observing all of . Towards this end, note that:
The error of a Nyström approximation is positive definite.
For any positive definite , .
By computing the diagonal entries of , we can keep track of and use this to choose the pivot. One approach is to greedily choose the pivot as the largest entries of . However, this approach is has the tendency to focus on outlier entires. Instead, we can sample proprtional to the values . This results in the Randomly Pivoted Cholesky algorithm introduced in Chen et al., 2024
- Chen, Y., Epperly, E. N., Tropp, J. A., & Webber, R. J. (2024). Randomly pivoted Cholesky: Practical approximation of a kernel matrix with few entry evaluations. Communications on Pure and Applied Mathematics, 78(5), 995–1041. 10.1002/cpa.22234