Learning in Factored Domains with Information-Constrained Visual RepresentationsTyler MalloyTim Klingeret al.2022NeurIPS 2022
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement LearningDong Ki KimMiao Liuet al.2021ICML 2021