Accelerate distributed stochastic descent for nonconvex optimization with momentumGuojing CongTianyi Liu2020MLHPC/AI4S 2020
Fast Training of Deep Neural Networks for Speech RecognitionGuojing CongBrian Kingsburyet al.2020ICASSP 2020