Active Regression - Randomized Numerical Linear Algebra with Examples

The active regression problem is a variant of the standard linear regression problem.

Most of the algorithms presented in Chapter 4 on linear regression require reading the entire vector $\vec{b}$ . However, in the active regression problem, where we measure cost by the number of entries of $\vec{b}$ that get observed, so such methods are off the table.

This section outlines basic sampling-based approaches to the active regression problem that aim to use a small number of entry evaluations.

Leverage score sampling¶

Recall $\Call{leverage-dist}(\vec{A})$ is the distribution that corresponds to sampling an index from $\{1, \ldots, n\}$ proportional to the Leverage-scores $(\ell_1, \ldots, \ell_n)$ of $\vec{A}$ .

Algorithm 7.1 (Regression by Leverage-score sampling)

Input: $\vec{A}$ , $\vec{b}$ , number of samples $k$

Sample iid indices $s_1, \ldots, s_k\sim \Call{leverage-dist}(\vec{A})$
Form

\widehat{\vec{A}} := \begin{bmatrix} - & \ell_{s_1}^{-1/2} \vec{a}_{s_1}^\T & -\\ &\vdots \\ - & \ell_{s_k}^{-1/2} \vec{a}_{s_k}^\T & -\\ \end{bmatrix} ,\quad \widehat{\vec{b}} := \begin{bmatrix} \ell_{s_1}^{-1/2} b_{s_1} \\ \vdots \\ \ell_{s_k}^{-1/2} b_{s_n} \end{bmatrix}

Obtain solution $\widehat{\vec{x}}$ to least squares problem $\min_{\vec{x}} \|\widehat{\vec{b}} - \widehat{\vec{A}}\vec{x}\|$ .

Output: $\widehat{\vec{x}}$

Analysis¶

Note that Algorithm 7.1 is nothing more than Algorithm 4.3 (sketch-and-solve) using the Leverage-score Sketch. Theorem 2.9 guarantees that the leverage-score sketch is a subspace embedding for $\vec{A}$ . However, we cannot immediately apply the analysis techniques from used in the analysis of Sketch and Solve, because these require that the sketch is a subspace embedding for $[\vec{A},\vec{b}]$ . The standard approach to the analysis is to make use of Approximate Matrix Multiplication guarantee.

Proof

See Raphel’s wiki.

Randomized Numerical Linear Algebra with Examples

Linear Regression

Randomized Numerical Linear Algebra with Examples

SGD/Kaczmarz