Beyond RNNs: Positional self-attention with co-attention for video question answeringXiangpeng LiJingkuan Songet al.2019AAAI 2019
Text-instance graph: Exploring the relational semantics for text-based visual question answeringXiangpeng LiBo Wuet al.2022Pattern Recognition