Video Corpus Moment Retrieval with Contrastive LearningAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021

Hao Zhang

Aixin Sun

Wei Jing

Guoshun Nan

Liangli Zhen

Qiufeng Wang

Rick Siow Mong Goh

274

102

13 May 2021

Connecting What to Say With Where to Look by Modeling Human Attention TracesComputer Vision and Pattern Recognition (CVPR), 2021

Babak Damavandi

262

12 May 2021

VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching

Hanwang Zhang

289

12 May 2021

Language Acquisition is Embodied, Interactive, Emotive: a Research Proposal

C. Kennington

LM&Ro

106

10 May 2021

Spoken Moments: Learning Joint Audio-Visual Representations from Video DescriptionsComputer Vision and Pattern Recognition (CVPR), 2021

181

10 May 2021

Recent Advances in Deep Learning Based Dialogue Systems: A Systematic SurveyArtificial Intelligence Review (AIR), 2021

855

322

10 May 2021

A survey on VQA_Datasets and Approaches

Yeyun Zou

Qiyu Xie

277

02 May 2021

Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's HeadsInternational Joint Conference on Artificial Intelligence (IJCAI), 2021

Chenyu Gao

Qi Zhu

Peng Wang

Qi Wu

109

30 Apr 2021

Comparing Visual Reasoning in Humans and AI

Shravan Murlidaran

Wenjie Wang

Miguel P. Eckstein

197

29 Apr 2021

A First Look: Towards Explainable TextVQA Models via Visual and Textual Explanations

188

29 Apr 2021

Multimodal Contrastive Training for Visual Representation LearningComputer Vision and Pattern Recognition (CVPR), 2021

252

191

26 Apr 2021

MDETR -- Modulated Detection for End-to-End Multi-Modal UnderstandingIEEE International Conference on Computer Vision (ICCV), 2021

644

1,058

26 Apr 2021

SemEval-2021 Task 6: Detection of Persuasion Techniques in Texts and ImagesInternational Workshop on Semantic Evaluation (SemEval), 2021

Dimitar Dimitrov

Firoj Alam

Giovanni Da San Martino

150

120

25 Apr 2021

MusCaps: Generating Captions for Music AudioIEEE International Joint Conference on Neural Network (IJCNN), 2021

290

24 Apr 2021

M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with TransformersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

Jun Wang

Zuxuan Wu

221

157

24 Apr 2021

Playing Lottery Tickets with Vision and LanguageAAAI Conference on Artificial Intelligence (AAAI), 2021

Zicheng Liu

312

23 Apr 2021

Multiscale Vision TransformersIEEE International Conference on Computer Vision (ICCV), 2021

Christoph Feichtenhofer

ViT

482

1,521

22 Apr 2021

Comprehensive Multi-Modal Interactions for Referring Image SegmentationFindings (Findings), 2021

Kanishk Jain

Vineet Gandhi

237

21 Apr 2021

Understanding Synonymous Referring Expressions via Contrastive FeaturesInternational Journal of Computer Vision (IJCV), 2021

Yi-Wen Chen

Yi-Hsuan Tsai

Ming-Hsuan Yang

ObjD

185

20 Apr 2021

Detector-Free Weakly Supervised Grounding by SeparationIEEE International Conference on Computer Vision (ICCV), 2021

...

195

20 Apr 2021

Understanding Chinese Video and Language via Contrastive Multimodal Pre-TrainingACM Multimedia (ACM MM), 2021

163

19 Apr 2021

BM-NAS: Bilevel Multimodal Neural Architecture SearchAAAI Conference on Artificial Intelligence (AAAI), 2021

Yihang Yin

Siyu Huang

Xiang Zhang

232

19 Apr 2021

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding

270

167

18 Apr 2021

CLIPScore: A Reference-free Evaluation Metric for Image CaptioningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Yejin Choi

969

2,298

18 Apr 2021

Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales

...

Antonio Torralba

181

17 Apr 2021

TransVG: End-to-End Visual Grounding with TransformersIEEE International Conference on Computer Vision (ICCV), 2021

621

442

17 Apr 2021

LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding

Spurthi Amba Hombaiah

Michael Bendersky

150

16 Apr 2021