Conference paperOn Application of the Simplex Type Algorithm for SCLP to Large-scale Fluid Processing Networks
Conference paperFinite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average Reward