ComPhy: Compositional Physical Reasoning of Objects and Events from VideosZhenfang ChenKexin Yiet al.2022ICLR 2022
Neural-symbolic VQA: Disentangling reasoning from vision and language understandingKexin YiAntonio Torralbaet al.2018NeurIPS 2018