PTR: A Benchmark for Part-based Conceptual, Relational, and Physical ReasoningYining HongLi Yiet al.2021NeurIPS 2021
AVLnet: Learning audio-visual language representations from instructional videosAndrew RouditchenkoAngie Boggustet al.2021INTERSPEECH 2021
Self-supervised segmentation and source separation on videosAndrew RouditchenkoHang Zhaoet al.2019CVPRW 2019