解超定方程 Ax=0 与奇异值分解

Reference:

解超定方程 Ax=0

文章跳转：

前文已经介绍了奇异值分解，现在来聊聊奇异值分解的一个应用。

工程中很多问题会归结为求解超定方程 $\mathbf{Ax}=0$ ， $\mathbf{A}$ 是 $m\times n$ 的矩阵，且 $m > n$ 。如SLAM中三角化地图点，PnP 等一些问题都是求解这个方程。
很显然，这个方程有一个零解，但这不是我们想要的，我们实际想求非零解。

为了求非零解，我们对 $\mathbf{x}$ 加上一个约束 $\|\mathbf{x}\|^2=1$ 。也就是限制 $\mathbf{x}$ 的长度为 1 。并构建成一个带约束的最小二乘问题：
$\tag{1} \hat{\mathbf{x}}=\arg \min \|\mathbf{A} \mathbf{x}\|^2, \text { subject to }\|\mathbf{x}\|^2=1$ 这是一个带约束的最小二乘问题，我们把拉格朗日搬出来：
$\tag{2} \begin{aligned} L(\mathbf{x}, \lambda) & =\|\mathbf{A} \mathbf{x}\|^2+\lambda\left(1-\|\mathbf{x}\|^2\right) \\ & =\mathbf{x}^T \mathbf{A}^T \mathbf{A} \mathbf{x}+\lambda\left(1-\mathbf{x}^T \mathbf{x}\right) \end{aligned}$ 为了求极值，我们分别对 $\mathbf{x}$ 和 $\lambda$ 求偏导数，令为 $0$ ：
$\tag{3} \begin{array}{l} \frac{\partial L(\mathbf{x}, \lambda)}{\partial \mathbf{x}}=2 \mathbf{A}^T \mathbf{A} \mathbf{x}-2 \lambda \mathbf{x}=0\end{array}$ $\tag{4} \begin{array}{l} \frac{\partial L(\mathbf{x}, \lambda)}{\partial \lambda}=1-\mathbf{x}^T \mathbf{x}=0 \end{array}$ 把(3)式整理一下：
$\tag{5} \begin{array}{r} \left(\mathbf{A}^T \mathbf{A}-\lambda \mathbf{I}\right) \mathbf{x}=0 \end{array}$ $\tag{6}\begin{array}{r} \mathbf{A}^T \mathbf{A} \mathbf{x}=\lambda \mathbf{x} \end{array}$ 可以看出 $\lambda$ 和 $\mathbf{x}$ 分别是 $\mathbf{A}^T \mathbf{A}$ 的特征值和特征向量。也就是说(1)式的解，就是这些特征向量中的一个。
问题来了，那么多的特征向量，应该选择哪个作为解呢？我们展开 $\|\mathbf{A x}\|^2$ 看一下：
$\tag{7} \|\mathbf{A} \mathbf{x}\|^2=\mathbf{x}^T \mathbf{A}^T \mathbf{A} \mathbf{x}=\mathbf{x}^T \lambda \mathbf{x}=\lambda \mathbf{x}^T \mathbf{x}=\lambda$

上方公式(7)的推导，利用了公式(6) 及 $\|\mathbf{x}\|^2=1$ 。

也就是说，我们想要 $\|\mathbf{A} \mathbf{x}\|^2$ 最小，就需要 $\lambda$ 最小。
那方程(1)的非零解就是 $\mathbf{A}^T\mathbf{A}$ 最小特征值 $\lambda$ 对应的特征向量，即最小奇异值对应的右奇异向量。

解超定方程 Ax=0 与奇异值分解

猜你喜欢