v1v2 (latest)

Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for Image Annotation

26 November 2014

Lior Wolf

Papers citing "Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for Image Annotation"

40 / 40 papers shown

Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval

325

26 Dec 2023

Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility StudyEuropean Conference on Information Retrieval (ECIR), 2023

Mariya Hendriksen

Svitlana Vakulenko

E. Kuiper

Maarten de Rijke

300

12 Jan 2023

Describing Sets of Images with Textual-PCAConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Oded Hupert

Idan Schwartz

Lior Wolf

CoGe

148

21 Oct 2022

Zero-Shot Video Captioning with Evolving Pseudo-Tokens

Lior Wolf

231

22 Jul 2022

What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text InputsNeural Information Processing Systems (NeurIPS), 2022

Tal Shaharabany

Yoad Tewel

Lior Wolf

ObjD

254

19 Jun 2022

ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticComputer Vision and Pattern Recognition (CVPR), 2021

Lior Wolf

327

235

29 Nov 2021

A Survey on Multi-modal Summarization

206

11 Sep 2021

A Better Loss for Visual-Textual GroundingACM Symposium on Applied Computing (SAC), 2021

175

11 Aug 2021

Efficient Algorithms for Estimating the Parameters of Mixed Linear Regression Models

142

12 May 2021

Continual learning in cross-modal retrieval

148

14 Apr 2021

Probabilistic Embeddings for Cross-Modal RetrievalComputer Vision and Pattern Recognition (CVPR), 2021

Sanghyuk Chun

Seong Joon Oh

Rafael Sampaio de Rezende

Yannis Kalantidis

Diane Larlus

UQCV

908

259

13 Jan 2021

Learning to Scale Multilingual Representations for Vision-Language TasksEuropean Conference on Computer Vision (ECCV), 2020

196

09 Apr 2020

Ladder Loss for Coherent Visual-Semantic EmbeddingAAAI Conference on Artificial Intelligence (AAAI), 2019

280

18 Nov 2019

Do Cross Modal Systems Leverage Semantic Relationships?

Shah Nawaz

Muhammad Kamran Janjua

03 Sep 2019

Language Features Matter: Effective Language Representations for Vision-Language TasksIEEE International Conference on Computer Vision (ICCV), 2019

154

17 Aug 2019

Semi Supervised Phrase Localization in a Bidirectional Caption-Image Retrieval Framework

Deepan Das

Noor Mohammed Ghouse

Shashank Verma

Yin Li

116

08 Aug 2019

Position Focused Attention Network for Image-Text MatchingInternational Joint Conference on Artificial Intelligence (IJCAI), 2019

179

186

23 Jul 2019

Coherent and Controllable Outfit Generation

Kedan Li

Chen Liu

David A. Forsyth

266

17 Jun 2019

On the Behavior of the Expectation-Maximization Algorithm for Mixture Models

Babak Barazandeh

Meisam Razaviyayn

115

24 Sep 2018

Revisiting Cross Modal Retrieval

Shah Nawaz

Muhammad Kamran Janjua

Alessandro Calefati

I. Gallo

127

19 Jul 2018

iParaphrasing: Extracting Visually Grounded Paraphrases via an Image

Chenhui Chu

Mayu Otani

Yuta Nakashima

136

12 Jun 2018

Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts

Jinjun Xiong

127

29 Mar 2018

Unsupervised Textual Grounding: Linking Words to Image Concepts

Raymond A. Yeh

Minh Do

Alex Schwing

120

29 Mar 2018

Learning Type-Aware Embeddings for Fashion Compatibility

268

244

25 Mar 2018

Learning Social Image Embedding with Deep Multimodal Attention Networks

Feiran Huang

Xiaoming Zhang

Zhoujun Li

Tao Mei

Yueying He

Zhonghua Zhao

127

18 Oct 2017

Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross RetrievalIEEE International Conference on Computer Vision (ICCV), 2017

Yuming Shen

Li Liu

Ling Shao

Jingkuan Song

145

08 Aug 2017

Multimodal Machine Learning: A Survey and Taxonomy

T. Baltrušaitis

Chaitanya Ahuja

Louis-Philippe Morency

534

3,572

26 May 2017

Learning Two-Branch Neural Networks for Image-Text Matching Tasks

Yin Li

280

530

11 Apr 2017

Backpropagation Training for Fisher Vectors within Neural Networks

P. Wieschollek

F. Groh

Hendrik P. A. Lensch

FedML

120

08 Feb 2017

Learning Visual N-Grams from Web DataIEEE International Conference on Computer Vision (ICCV), 2016

Ang Li

Allan Jabri

Armand Joulin

Laurens van der Maaten

VLM

292

149

29 Dec 2016

Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions

126

23 Jun 2016

Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual GroundingConference on Empirical Methods in Natural Language Processing (EMNLP), 2016

596

1,541

06 Jun 2016

Multi-Cue Zero-Shot Learning with Strong Supervision

Zeynep Akata

Mateusz Malinowski

Mario Fritz

Bernt Schiele

209

152

29 Mar 2016

Learning Deep Structure-Preserving Image-Text Embeddings

Liwei Wang

Yin Li

Svetlana Lazebnik

479

820

19 Nov 2015

Natural Language Object Retrieval

305

569

13 Nov 2015

Learning to Answer Questions From Image Using Convolutional Neural NetworkAAAI Conference on Artificial Intelligence (AAAI), 2015

Lin Ma

Zhengdong Lu

Hang Li

229

266

01 Jun 2015

Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question AnsweringNeural Information Processing Systems (NeurIPS), 2015

Jie Zhou

327

520

21 May 2015

Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models

Bryan A. Plummer

Liwei Wang

Christopher M. Cervantes

Juan C. Caicedo

Anjali Narayan-Chen

Svetlana Lazebnik

570

2,380

19 May 2015

Exploring Models and Data for Image Question Answering

Mengye Ren

Ryan Kiros

R. Zemel

346

750

08 May 2015

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images

Yi Yang

194

160

25 Apr 2015