Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.05707
Cited By
MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling
10 March 2023
Jiaqi Xu
Bo Liu
Yunkuo Chen
Mengli Cheng
Xing Shi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling"
2 / 2 papers shown
Title
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,412
0
11 Nov 2021
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
CLIP
VLM
309
778
0
18 Apr 2021
1