Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image RetrievalKuniaki SaitoKihyuk Sohnet al.2023CVPR 2023
MaskSketch: Unpaired Structure-guided Masked Image GenerationDina BashkirovaJosé Lezamaet al.2023CVPR 2023
Prefix Conditioning Unifies Language and Label SupervisionKuniaki SaitoKihyuk Sohnet al.2023CVPR 2023