Subspace Embedding - Randomized Numerical Linear Algebra with Examples

An important property of a sketching matrix is that it preserves the geometry of a subspace. This is quantified by the following definition:

In particular, a subspace embedding ensures that the lengths of all vectors within a subspace are preserved by the sketch.

Equivalent Characterizations¶

It will sometimes be useful to work with the following equivalent characterizations of subspace embeddings:

Theorem 2.1

Let $\vec{V}$ be an orthonormal basis for a subspace $V$ of $\R^n$ . The following are equivalent:

$\vec{S}$ is a subspace embedding for $V$ with distortion $\varepsilon$
$\forall \vec{c}\in \R^d: (1-\varepsilon)\|\vec{V}\vec{c}\| \leq \|\vec{S}\vec{V}\vec{c}\| \leq (1+\varepsilon)\|\vec{V}\vec{c}\|$
$\forall \vec{c}\in \R^d: (1-\varepsilon)\|\vec{c}\| \leq \|\vec{S}\vec{V}\vec{c}\| \leq (1+\varepsilon)\|\vec{c}\|$
$\| \vec{V}^\T \vec{S}^\T \vec{S} \vec{V} - \vec{I} \|_2 \leq \varepsilon(2+\varepsilon)$
$1-\varepsilon \leq \smin(\vec{S}\vec{V}) \leq \smax(\vec{S}\vec{V}) \leq 1+\varepsilon$

Proof

Suppose $\vec{S}$ is a subspace embedding for $V$ . Any $\vec{x} \in V$ can be written as $\vec{x} = \vec{V}\vec{c}$ for some $\vec{c} \in \R^d$ . Thus, the subspace embedding condition can be rewritten as (this is true even if $\vec{V}$ isn’t orthonormal)

\forall \vec{c}\in \R^d: (1-\varepsilon)\|\vec{V}\vec{c}\| \leq \|\vec{S}\vec{V}\vec{c}\| \leq (1+\varepsilon)\|\vec{V}\vec{c}\|.

Since $\vec{V}$ is orthonormal, $\|\vec{V}\vec{c}\| = \|\vec{c}\|$ and hence we have the equivalent condition

\forall \vec{c}\in \R^d: (1-\varepsilon)\|\vec{c}\| \leq \|\vec{S}\vec{V}\vec{c}\| \leq (1+\varepsilon)\|\vec{c}\|.

We can rewrite this as

\smax(\vec{S}\vec{V}) = \max_{\vec{c}\in \R^d} \frac{\|\vec{S}\vec{V}\vec{c}\|}{\|\vec{c}\|} \leq 1+\varepsilon ,\qquad \smin(\vec{S}\vec{V}) = \min_{\vec{c}\in \R^d} \frac{\|\vec{S}\vec{V}\vec{c}\|}{\|\vec{c}\|} \geq 1-\varepsilon.

Finally, note that

\begin{aligned} \| \vec{V}^\T \vec{S}^\T \vec{S} \vec{V} - \vec{I} \|_2 &= \max_{i} |\lambda_i(\vec{V}^\T \vec{S}^\T \vec{S} \vec{V}) - 1|| \\&= \max_{i} |\smax(\vec{S}\vec{V})^2 - 1| \\&= \max\{ (1+\varepsilon)^2 - 1, 1 - (1-\varepsilon)^2 \} \\&= \varepsilon(2+\varepsilon). \end{aligned}

In TCS, where the rates as $\varepsilon\to0$ are important, the set of “equivalent” definitions is larger. For instance, the condition on the spectral norm is often written as $\| \vec{V}^\T \vec{S}^\T \vec{S} \vec{V} - \vec{I} \|_2 \leq \varepsilon$ . Similarly, in some cases, a subspace embedding is defined with respect to the squared norms, since $\sqrt{1+\varepsilon} = 1 + \varepsilon/2 + O(\varepsilon^2)$ as $\varepsilon \to 0$ .

Randomized Numerical Linear Algebra with Examples

Sketching

Randomized Numerical Linear Algebra with Examples

Mixing Sketches

2.1Subspace Embedding

Equivalent Characterizations¶