%0 Journal Article %T Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone %A Dou, Zi-Yi %A Kamath, Aishwarya %A Gan, Zhe %A Zhang, Pengchuan %A Wang, Jianfeng %A Li, Linjie %A Liu, Zicheng %A Liu, Ce %A LeCun, Yann %A Peng, Nanyun %A Gao, Jianfeng %A Wang, Lijuan %J Computing Research Repository %V 2023 %N 2206 %D 2022-06-15 %~ DeepDyve