%0 Journal Article %T ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval %A Cheng, Mengjun %A Sun, Yipeng %A Wang, Longchao %A Zhu, Xiongwei %A Yao, Kun %A Chen, Jie %A Song, Guoli %A Han, Junyu %A Liu, Jingtuo %A Ding, Errui %A Wang, Jingdong %J 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) %D 2022-06-01 %I IEEE %~ DeepDyve