Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval

23 August 2018

Niluthpol Chowdhury Mithun

Yikang Shen

Evangelos E. Papalexakis

Amit K. Roy-Chowdhury

ArXiv (abs)PDF HTML

Papers citing "Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval"

25 / 25 papers shown

Semi-Supervised Image Captioning Considering Wasserstein Graph Matching

Yang Yang

288

26 Mar 2024

Open-Vocabulary Camouflaged Object Segmentation

Huchuan Lu

330

19 Nov 2023

Robust Visual Question Answering: Datasets, Methods, and Future ChallengesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Pinghui Wang

Jun Liu

333

21 Jul 2023

Vision-Language Models can Identify Distracted Driver Behavior from Naturalistic Videos

Md Zahid Hasan

Jiajing Chen

Jiyang Wang

Mohammed Shaiqur Rahman

351

16 Jun 2023

Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web VideosComputer Vision and Pattern Recognition (CVPR), 2022

Tomávs Souvcek

Jean-Baptiste Alayrac

Antoine Miech

Ivan Laptev

Josef Sivic

230

22 Mar 2022

Cross Modal Retrieval with Querybank NormalisationComputer Vision and Pattern Recognition (CVPR), 2021

Yang Liu

290

115

23 Dec 2021

Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image CaptioningIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2021

Yang Yang

Haoran Wei

Hengshu Zhu

Dianhai Yu

Hui Xiong

Jian Yang

SSL

100

22 Oct 2021

Multimodal Entity Linking for TweetsEuropean Conference on Information Retrieval (ECIR), 2020

161

07 Apr 2021

Learning Transferable Visual Models From Natural Language SupervisionInternational Conference on Machine Learning (ICML), 2021

...

2.0K

41,259

26 Feb 2021

Decoupling the Role of Data, Attention, and Losses in Multimodal TransformersTransactions of the Association for Computational Linguistics (TACL), 2021

Lisa Anne Hendricks

John F. J. Mellor

R. Schneider

Jean-Baptiste Alayrac

Aida Nematzadeh

234

126

31 Jan 2021

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual LocalizationACM Multimedia (ACM MM), 2020

Niluthpol Chowdhury Mithun

211

12 Sep 2020

Learning Video Representations from Textual Web Supervision

245

29 Jul 2020

COBE: Contextualized Object Embeddings from Narrated Instructional VideoNeural Information Processing Systems (NeurIPS), 2020

Gedas Bertasius

Lorenzo Torresani

187

14 Jul 2020

A Feature Analysis for Multimodal News Retrieval

167

13 Jul 2020

Self-Supervised MultiModal Versatile Networks

Jean-Baptiste Alayrac

423

400

29 Jun 2020

Mitigating Gender Bias in Captioning Systems

538

15 Jun 2020

COBRA: Contrastive Bi-Modal Representation Algorithm

Vishaal Udandarao

A. Maiti

Deepak Srivatsav

Suryatej Reddy Vyalla

Yifang Yin

R. Shah

221

07 May 2020

Graph Structured Network for Image-Text MatchingComputer Vision and Pattern Recognition (CVPR), 2020

188

277

01 Apr 2020

Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder FrameworkIEEE transactions on multimedia (TMM), 2020

Yaochen Zhu

Jiayi Xie

Zhenzhong Chen

28 Mar 2020

IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text RetrievalComputer Vision and Pattern Recognition (CVPR), 2020

Hui Chen

Jungong Han

193

365

08 Mar 2020

End-to-End Learning of Visual Representations from Uncurated Instructional VideosComputer Vision and Pattern Recognition (CVPR), 2019

Antoine Miech

Jean-Baptiste Alayrac

608

754

13 Dec 2019

Prediction and Description of Near-Future Activities in VideoComputer Vision and Image Understanding (CVIU), 2019

T. Mahmud

Mohammad Billah

Mahmudul Hasan

Amit K. Roy-Chowdhury

379

02 Aug 2019

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video ClipsIEEE International Conference on Computer Vision (ICCV), 2019

Antoine Miech

Dimitri Zhukov

Jean-Baptiste Alayrac

512

1,366

07 Jun 2019

Multitask Text-to-Visual Embedding with Titles and Clickthrough Data

30 May 2019

Weakly Supervised Video Moment Retrieval From Text Queries

Niluthpol Chowdhury Mithun

S. Paul

Amit K. Roy-Chowdhury

284

211

05 Apr 2019