您的位置：首页 > 其它

凸优化：ADMM(Alternating Direction Method of Multipliers)交替方向乘子算法系列之四： General Patterns

2015-07-09 15:31 911 查看

最近开始对凸优化(convex optimization)中的ADMM(Alternating Direction Method of Multipliers)交替方向乘子算法开始感兴趣，接下来我会写一系列关于ADMM(Alternating Direction Method of Multipliers)交替方向乘子算法的内容。

凸优化：ADMM(Alternating Direction Method of Multipliers)交替方向乘子算法系列之四： General Patterns

本文地址：/article/1323062.html

4- 一般模式（General Patterns）

本章主要探讨如何加速 x-和 z-更新步骤。主要考虑三种类型：quadratic objective

terms, separable objective and constraints 和 smooth objective terms.

我们首先表示 x-更新步骤为：

其中 v=−Bz+c−u v = −Bz + c − u 是一个常量。（对称适用于 z-更新步骤）

4-1 近似算子（Proximity Operator）

考虑最简单的情况 A=IA = I，因此 x-更新步骤为

右边看做关于 u 的一个函数，标记为 proxf,ρ(v)prox_{f, ρ}(v)，叫做 f 关于 ρ 的近似算子（the proximity operator of f with penalty ρ ）。

在变分分析，

是 f 的 Moreau envelope 或 Moreau-Yosida regularization，与接近点算（proximal point algorithm ）的理论联系起来。因此接近算子（proximity operator）中的 x-最小化被称为接近端最小化（proximal minimization）。

当 f 足够简单时，x-update 就能评估分析。例如，f 是一个闭合非空凸集 C 的指示函数时，

x-update 为

其中 ΠCΠ_{C} 为 C 上的映射（Euclidean范式）。等式成立与 ρ 无关。更多例子见 [41]

[41] P. L. Combettes and J. C. Pesquet, “Proximal Splitting Methods in Signal Processing,” arXiv:0912.3522, 2009.

4-2 二次型目标项（Quadratic Objective Terms）

假设 f 为（凸）二次函数，

其中 P∈Sn+P ∈ S^{n}_{+} ，对称正半定 n × n 矩阵。

假设 P+ρATAP + ρA_{T}A 是可逆的，x+x^{+} 是 u 的仿射函数（affine function）

换句话说，计算 x-update 等于求解一个关于正定系数矩阵（positive definite coefficient matrix）P+ρATAP + ρA_{T}A 和 ρATv−qρA^{T}v − q 的线性系统。

4-2-1 直接法（Direct Methods）

求解 Fx=gFx = g, 首先分解 F=F1F2⋅⋅⋅FkF = F_{1}F_{2} ··· F_{k}， FiF_{i} 为简单矩阵，接着计算 x=F−1bx = F^{−1}b通过解一系列问题 Fizi=zi−1F_{i}z_{i} = z_{i−1} ，其中 z1=F−11gz_{1} = F_{1}^{−1}g 和 x=zkx = z_{k}。

4-2-2 利用稀疏（Exploiting Sparsity）

令 F=P+ρATAF = P + ρA^{T}A，当 F 是稀疏时，

- if P and A are diagonal n × n matrices, then both the factor and solve costs are

O(n).

- If P and A are banded, then so is F.

- If F is banded with bandwidth k, the factorization cost is O(nk2)O(nk^{2}) and the back-solve cost is O(nk). In this case, the x-update can be carried out at a cost O(nk2)O(nk^{2}), plus the cost of forming F.