Learning for multi-robot cooperation in partially observable stochastic environments with macro-actionsMiao LiuKavinayan Sivakumaret al.2017IROS 2017