Bringing Image Structure to Video via Frame-Clip Consistency of Object TokensElad Ben-AvrahamRoei Herziget al.2022NeurIPS 2022