Random coordinate descent
It is proposed that this article be deleted because of the following concern:
If you can address this concern by improving, copyediting, sourcing, renaming, or merging the page, please edit this page and do so. You may remove this message if you improve the article or otherwise object to deletion for any reason. Although not required, you are encouraged to explain why you object to the deletion, either in your edit summary or on the talk page. If this template is removed, do not replace it. This message has remained in place for seven days, so the article may be deleted without further notice. Find sources: "Random coordinate descent" – news · newspapers · books · scholar · JSTOR Nominator: Please consider notifying the author/project: {{subst:proposed deletion notify|Random coordinate descent|concern=Very little content or context. Barely enough context to avoid CSD A1 and that is about it.}} ~~~~ Timestamp: 20120120175213 17:52, 20 January 2012 (UTC) Administrators: delete |
Random Coordinate Descent Method is optimization technique popularized in last years. It can be seen as a generalization of Coordinate descent Method with one modification: Coorinate directions are chosen randomly. Many authors reported that randomization inprove Rate of convergence (time or number of iterations needed to get solution), but there was no theoretical analysis. The first proper analysis of this methods come from Yurii Nesterov (2010)[1]. In that paper, author proposed a randomized coordinate descent method when first order information is used.
Imagine that one wants to solve problem where is a Smooth function. Nesterov showed that if gradient of is a coordinate-wise lipschitz continuous with constants (i.e. ) than the following algorithm converge to optimal value.
Algorithm Random Coordinate Descent Method Input: //starting point Output: set x=x_0 for k=1,... do choose random coordinate update endfor;
- "←" denotes assignment. For instance, "largest ← item" means that the value of largest changes to the value of item.
- "return" terminates the algorithm and outputs the following value.
One need to realize that this algorithm is random and hence also output after iterations is a random variable. It was shown in [2] that if , where , is an optimal solution (), is our confidence level and is our target accuracy, then .
Extension to Block Coordinate Setting
One can naturaly extend this algorithm not only just to coordinates, but to blocks of coordinates. Assume that we have space . This space has 5 coorinate directions, concretely in which Random Coordinate Descent Method can move. Hovewer, one can group some coordinate directions into blocks and we can have istead of those 5 coordinate directions 3 block coordinate directions:

See also
References
- ^ Nesterov, Yurii (2010), "Efficiency of coordinate descent methods on huge-scale optimization problems", CORE Discussion Paper, no. \#2010/2
- ^ Richtárik, Peter; Takáč, Martin (2011), Iteration complexity of randomized block-coordinate descent methods for minimizing a composite function
{{citation}}
: line feed character in|title=
at position 60 (help)