Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1411.7399
Cited By
v1
v2 (latest)
Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for Image Annotation
26 November 2014
Benjamin Klein
Guy Lev
Gil Sadeh
Lior Wolf
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for Image Annotation"
40 / 40 papers shown
Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval
Zeqiang Wei
Kai Jin
Xiuzhuang Zhou
MedIm
325
8
0
26 Dec 2023
Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility Study
European Conference on Information Retrieval (ECIR), 2023
Mariya Hendriksen
Svitlana Vakulenko
E. Kuiper
Maarten de Rijke
300
5
0
12 Jan 2023
Describing Sets of Images with Textual-PCA
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Oded Hupert
Idan Schwartz
Lior Wolf
CoGe
148
1
0
21 Oct 2022
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Yoad Tewel
Yoav Shalev
Roy Nadler
Idan Schwartz
Lior Wolf
231
32
0
22 Jul 2022
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Neural Information Processing Systems (NeurIPS), 2022
Tal Shaharabany
Yoad Tewel
Lior Wolf
ObjD
254
23
0
19 Jun 2022
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Computer Vision and Pattern Recognition (CVPR), 2021
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
327
235
0
29 Nov 2021
A Survey on Multi-modal Summarization
Anubhav Jangra
Sourajit Mukherjee
Adam Jatowt
S. Saha
M. Hasanuzzaman
206
77
0
11 Sep 2021
A Better Loss for Visual-Textual Grounding
ACM Symposium on Applied Computing (SAC), 2021
Davide Rigoni
Luciano Serafini
A. Sperduti
ObjD
175
3
0
11 Aug 2021
Efficient Algorithms for Estimating the Parameters of Mixed Linear Regression Models
Babak Barazandeh
Ali Ghafelebashi
Meisam Razaviyayn
Ram Sriharsha
142
3
0
12 May 2021
Continual learning in cross-modal retrieval
Kai Wang
Luis Herranz
Joost van de Weijer
CLL
148
17
0
14 Apr 2021
Probabilistic Embeddings for Cross-Modal Retrieval
Computer Vision and Pattern Recognition (CVPR), 2021
Sanghyuk Chun
Seong Joon Oh
Rafael Sampaio de Rezende
Yannis Kalantidis
Diane Larlus
UQCV
908
259
0
13 Jan 2021
Learning to Scale Multilingual Representations for Vision-Language Tasks
European Conference on Computer Vision (ECCV), 2020
Andrea Burns
Donghyun Kim
Derry Wijaya
Kate Saenko
Bryan A. Plummer
196
36
0
09 Apr 2020
Ladder Loss for Coherent Visual-Semantic Embedding
AAAI Conference on Artificial Intelligence (AAAI), 2019
Mo Zhou
Zhenxing Niu
Le Wang
Zhanning Gao
Qilin Zhang
G. Hua
280
45
0
18 Nov 2019
Do Cross Modal Systems Leverage Semantic Relationships?
Shah Nawaz
Muhammad Kamran Janjua
I. Gallo
Arif Mahmood
Alessandro Calefati
Faisal Shafait
97
9
0
03 Sep 2019
Language Features Matter: Effective Language Representations for Vision-Language Tasks
IEEE International Conference on Computer Vision (ICCV), 2019
Andrea Burns
Reuben Tan
Kate Saenko
Stan Sclaroff
Bryan A. Plummer
VLM
154
28
0
17 Aug 2019
Semi Supervised Phrase Localization in a Bidirectional Caption-Image Retrieval Framework
Deepan Das
Noor Mohammed Ghouse
Shashank Verma
Yin Li
116
0
0
08 Aug 2019
Position Focused Attention Network for Image-Text Matching
International Joint Conference on Artificial Intelligence (IJCAI), 2019
Yaxiong Wang
Hao-Hsiang Yang
Xueming Qian
Lin Ma
Jing Lu
Biao Li
Xin Fan
179
186
0
23 Jul 2019
Coherent and Controllable Outfit Generation
Kedan Li
Chen Liu
David A. Forsyth
266
15
0
17 Jun 2019
On the Behavior of the Expectation-Maximization Algorithm for Mixture Models
Babak Barazandeh
Meisam Razaviyayn
115
19
0
24 Sep 2018
Revisiting Cross Modal Retrieval
Shah Nawaz
Muhammad Kamran Janjua
Alessandro Calefati
I. Gallo
127
6
0
19 Jul 2018
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image
Chenhui Chu
Mayu Otani
Yuta Nakashima
136
8
0
12 Jun 2018
Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts
Raymond A. Yeh
Jinjun Xiong
Wen-mei W. Hwu
Minh Do
Alex Schwing
127
58
0
29 Mar 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh
Minh Do
Alex Schwing
120
44
0
29 Mar 2018
Learning Type-Aware Embeddings for Fashion Compatibility
Mariya I. Vasileva
Bryan A. Plummer
Krishna Dusad
Shreya Rajpal
Ranjitha Kumar
David A. Forsyth
268
244
0
25 Mar 2018
Learning Social Image Embedding with Deep Multimodal Attention Networks
Feiran Huang
Xiaoming Zhang
Zhoujun Li
Tao Mei
Yueying He
Zhonghua Zhao
127
20
0
18 Oct 2017
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval
IEEE International Conference on Computer Vision (ICCV), 2017
Yuming Shen
Li Liu
Ling Shao
Jingkuan Song
145
51
0
08 Aug 2017
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
534
3,572
0
26 May 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Liwei Wang
Yin Li
Jing-ling Huang
Svetlana Lazebnik
VLM
280
530
0
11 Apr 2017
Backpropagation Training for Fisher Vectors within Neural Networks
P. Wieschollek
F. Groh
Hendrik P. A. Lensch
FedML
120
2
0
08 Feb 2017
Learning Visual N-Grams from Web Data
IEEE International Conference on Computer Vision (ICCV), 2016
Ang Li
Allan Jabri
Armand Joulin
Laurens van der Maaten
VLM
292
149
0
29 Dec 2016
Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions
F. Carrara
Andrea Esuli
T. Fagni
Fabrizio Falchi
Alejandro Moreo
DiffM
126
31
0
23 Jun 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2016
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
596
1,541
0
06 Jun 2016
Multi-Cue Zero-Shot Learning with Strong Supervision
Zeynep Akata
Mateusz Malinowski
Mario Fritz
Bernt Schiele
209
152
0
29 Mar 2016
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
479
820
0
19 Nov 2015
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
305
569
0
13 Nov 2015
Learning to Answer Questions From Image Using Convolutional Neural Network
AAAI Conference on Artificial Intelligence (AAAI), 2015
Lin Ma
Zhengdong Lu
Hang Li
229
266
0
01 Jun 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Neural Information Processing Systems (NeurIPS), 2015
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
Wenyuan Xu
327
520
0
21 May 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer
Liwei Wang
Christopher M. Cervantes
Juan C. Caicedo
Anjali Narayan-Chen
Svetlana Lazebnik
570
2,380
0
19 May 2015
Exploring Models and Data for Image Question Answering
Mengye Ren
Ryan Kiros
R. Zemel
346
750
0
08 May 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images
Junhua Mao
Xu Wei
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
194
160
0
25 Apr 2015
1