Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene GraphsRoi HerzigAlon Mendelsonet al.2023EMNLP 2023