Sketching - Randomized Numerical Linear Algebra with Examples

The workhorse of RandNLA is a technique called sketching.^[1] Sketching is a way to take a large matrix and create a smaller matrix (called a sketch) that approximates key properties of the original matrix. Which properties are approximated depends on the specific sketching method used, and the application at hand.

Diagram showing matrix sketching process: an n×d input matrix A is multiplied by a k×d sketching matrix S to produce a smaller k×d sketch SA, where k is much smaller than n

An example of sketching is illustrated above. Here a sketching matrix $\vec{S}$ with sketching/embedding dimension $k$ is applied to a $n\times d$ input matrix $\vec{A}$ where $n\gg d$ . The resulting $k\times d$ sketch $\vec{S}\vec{A}$ is much smaller than the original matrix, and can therefore be processed easily using classical techniques.

Perhaps the two important considerations when choosing a sketching distribution are:

How fast the sketching matrix can be generated and applied to $\vec{A}$ , and
How well the sketch $\vec{S}\vec{A}$ retains relevant information about the original matrix $\vec{A}$ .

While these considerations are often at odds with each other, by choosing the sketch at random from a suitable distribution, we can often achieve a good balance between the two. In fact, in many algorithms in RandNLA, the randomness in the sketching matrix is the only source of randomness in the entire problem, and is only needed to ensure the above two considerations.

Mixing vs sampling¶

In this book, we treat sketches that mix the rows of $\vec{A}$ and sketches that subsample the rows of $\vec{A}$ as conceptually different. We primarily focus on algorithms based on mixing-based sketches, since these tend to be most suitable as a default choice algorithm in many settings. We discuss subsampling-based methods, which can provide substantial speedups in certain contexts, in Chapter 7.

Footnotes¶

In RandNLA, sketching is almost always done using a linear transform of the original matrix.
↩

2 Sketching

Mixing vs sampling¶