Distributed Bilevel Optimization with Communication Compression
Yutong He, Jie Hu, et al.
ICML 2024
In this work, a gradient-based primal-dual method of multipliers is proposed for solving a class of linearly constrained non-convex problems. We show that with random initialization of the primal and dual variables, the algorithm is able to compute second-order stationary points (SOSPs) with probability one. Further, we present applications of the proposed method in popular signal processing and machine learning problems such as decentralized matrix factorization and decentralized training of overparameterized neural networks. One of the key steps in the analysis is to construct a new loss function for these problems such that the required convergence conditions (especially the gradient Lipschitz conditions) can be satisfied without changing the global optimal points.
Yutong He, Jie Hu, et al.
ICML 2024
Songtao Lu, Xiaodong Cui, et al.
ICASSP 2022
A Saif, Lisha Chen, et al.
INTERSPEECH 2024
Shuai Ma, Yunqi Zhang, et al.
IEEE TIFS