Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.05465
Cited By
LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
10 March 2022
Jie Lei
Xinlei Chen
Ning Zhang
Meng-xing Wang
Mohit Bansal
Tamara L. Berg
Licheng Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval"
8 / 8 papers shown
Title
RECLIP: Resource-efficient CLIP by Training with Small Images
Runze Li
Dahun Kim
B. Bhanu
Weicheng Kuo
VLM
CLIP
22
12
0
12 Apr 2023
Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge Base and Database
Ye Liu
Semih Yavuz
Rui Meng
Dragomir R. Radev
Caiming Xiong
Yingbo Zhou
21
29
0
09 Nov 2022
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Mohit Bansal
MLLM
249
525
0
04 Feb 2021
SEED: Self-supervised Distillation For Visual Representation
Zhiyuan Fang
Jianfeng Wang
Lijuan Wang
Lei Zhang
Yezhou Yang
Zicheng Liu
SSL
231
190
0
12 Jan 2021
Distilling Knowledge from Reader to Retriever for Question Answering
Gautier Izacard
Edouard Grave
RALM
180
251
0
08 Dec 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Mohit Bansal
106
275
0
24 Jan 2020
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
948
20,471
0
17 Apr 2017
1