Check out Tianrui Qi's Optimization Wiki page for more details about augmented Lagrangian method (ALM), alternating direction method of multipliers (ADMM), and other optimization algorithms.
Given a set of data
with
or equivalently
where
While the feasibility of the above problems depends on the separability of the data, a soft-margin SVM is employed if the data points can not be strictly separated by solving the problem:
Note that at the optimal solution
Thus, we can rewrite the equation into
Now, define
we can rewrite the primal problem as
Introduce a variable
Note that this is the standard problem that can be solved by ADMM. Note that the problem can also be solved by augmented Lagrangian method (ALM), but for this derivation, we use ADMM.
Let
The goal is to solve the dual problem
Applying the ADMM to the dual problem we formulated above, we first solve
by solving two subproblems
For
setting the gradient equal to zero, we have that
we get
setting the gradient equal to zero, we have that
i.e.
We have three cases,
Next, we solve
by setting
According to the derivation from the lecture, the primal residual is
and the dual residual is
We stop the algorithm when maximum of primal and dual residual less than tol
.
Since we do not have any sub-iteration, we set the maxit = 5000
and stop the
algorithm when the iteration is larger than maxit
.
We have three training datasets: rho02,
rho08, and spam.
Implementing the ADMM-based algorithm according to the derivation above, we test
our algorithm on these training datasets with different
The results are shown below, including the violation of primal and dual
feasibility at each outer iteration, the time needed, and final accuracy.
The accuracy increase as
- Final Project, MATP 4820 Computational Optimization by Prof. Yangyang Xu, Rensselaer Polytechnic Institute.
- Ye, Gui–Bo, Yifei Chen, and Xiaohui Xie. "Efficient variable selection in support vector machines via the alternating direction method of multipliers." Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. JMLR Workshop and Conference Proceedings, 2011.