ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.7399
  4. Cited By
Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for
  Image Annotation
v1v2 (latest)

Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for Image Annotation

26 November 2014
Benjamin Klein
Guy Lev
Gil Sadeh
Lior Wolf
ArXiv (abs)PDFHTML

Papers citing "Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for Image Annotation"

40 / 40 papers shown
Masked Contrastive Reconstruction for Cross-modal Medical Image-Report
  Retrieval
Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval
Zeqiang Wei
Kai Jin
Xiuzhuang Zhou
MedIm
325
8
0
26 Dec 2023
Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A
  Reproducibility Study
Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility StudyEuropean Conference on Information Retrieval (ECIR), 2023
Mariya Hendriksen
Svitlana Vakulenko
E. Kuiper
Maarten de Rijke
300
5
0
12 Jan 2023
Describing Sets of Images with Textual-PCA
Describing Sets of Images with Textual-PCAConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Oded Hupert
Idan Schwartz
Lior Wolf
CoGe
148
1
0
21 Oct 2022
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Yoad Tewel
Yoav Shalev
Roy Nadler
Idan Schwartz
Lior Wolf
231
32
0
22 Jul 2022
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding
  without Text Inputs
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text InputsNeural Information Processing Systems (NeurIPS), 2022
Tal Shaharabany
Yoad Tewel
Lior Wolf
ObjD
254
23
0
19 Jun 2022
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic
  Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticComputer Vision and Pattern Recognition (CVPR), 2021
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
327
235
0
29 Nov 2021
A Survey on Multi-modal Summarization
A Survey on Multi-modal Summarization
Anubhav Jangra
Sourajit Mukherjee
Adam Jatowt
S. Saha
M. Hasanuzzaman
206
77
0
11 Sep 2021
A Better Loss for Visual-Textual Grounding
A Better Loss for Visual-Textual GroundingACM Symposium on Applied Computing (SAC), 2021
Davide Rigoni
Luciano Serafini
A. Sperduti
ObjD
175
3
0
11 Aug 2021
Efficient Algorithms for Estimating the Parameters of Mixed Linear
  Regression Models
Efficient Algorithms for Estimating the Parameters of Mixed Linear Regression Models
Babak Barazandeh
Ali Ghafelebashi
Meisam Razaviyayn
Ram Sriharsha
142
3
0
12 May 2021
Continual learning in cross-modal retrieval
Continual learning in cross-modal retrieval
Kai Wang
Luis Herranz
Joost van de Weijer
CLL
148
17
0
14 Apr 2021
Probabilistic Embeddings for Cross-Modal Retrieval
Probabilistic Embeddings for Cross-Modal RetrievalComputer Vision and Pattern Recognition (CVPR), 2021
Sanghyuk Chun
Seong Joon Oh
Rafael Sampaio de Rezende
Yannis Kalantidis
Diane Larlus
UQCV
908
259
0
13 Jan 2021
Learning to Scale Multilingual Representations for Vision-Language Tasks
Learning to Scale Multilingual Representations for Vision-Language TasksEuropean Conference on Computer Vision (ECCV), 2020
Andrea Burns
Donghyun Kim
Derry Wijaya
Kate Saenko
Bryan A. Plummer
196
36
0
09 Apr 2020
Ladder Loss for Coherent Visual-Semantic Embedding
Ladder Loss for Coherent Visual-Semantic EmbeddingAAAI Conference on Artificial Intelligence (AAAI), 2019
Mo Zhou
Zhenxing Niu
Le Wang
Zhanning Gao
Qilin Zhang
G. Hua
280
45
0
18 Nov 2019
Do Cross Modal Systems Leverage Semantic Relationships?
Do Cross Modal Systems Leverage Semantic Relationships?
Shah Nawaz
Muhammad Kamran Janjua
I. Gallo
Arif Mahmood
Alessandro Calefati
Faisal Shafait
97
9
0
03 Sep 2019
Language Features Matter: Effective Language Representations for
  Vision-Language Tasks
Language Features Matter: Effective Language Representations for Vision-Language TasksIEEE International Conference on Computer Vision (ICCV), 2019
Andrea Burns
Reuben Tan
Kate Saenko
Stan Sclaroff
Bryan A. Plummer
VLM
154
28
0
17 Aug 2019
Semi Supervised Phrase Localization in a Bidirectional Caption-Image
  Retrieval Framework
Semi Supervised Phrase Localization in a Bidirectional Caption-Image Retrieval Framework
Deepan Das
Noor Mohammed Ghouse
Shashank Verma
Yin Li
116
0
0
08 Aug 2019
Position Focused Attention Network for Image-Text Matching
Position Focused Attention Network for Image-Text MatchingInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Yaxiong Wang
Hao-Hsiang Yang
Xueming Qian
Lin Ma
Jing Lu
Biao Li
Xin Fan
179
186
0
23 Jul 2019
Coherent and Controllable Outfit Generation
Coherent and Controllable Outfit Generation
Kedan Li
Chen Liu
David A. Forsyth
266
15
0
17 Jun 2019
On the Behavior of the Expectation-Maximization Algorithm for Mixture
  Models
On the Behavior of the Expectation-Maximization Algorithm for Mixture Models
Babak Barazandeh
Meisam Razaviyayn
115
19
0
24 Sep 2018
Revisiting Cross Modal Retrieval
Revisiting Cross Modal Retrieval
Shah Nawaz
Muhammad Kamran Janjua
Alessandro Calefati
I. Gallo
127
6
0
19 Jul 2018
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image
Chenhui Chu
Mayu Otani
Yuta Nakashima
136
8
0
12 Jun 2018
Interpretable and Globally Optimal Prediction for Textual Grounding
  using Image Concepts
Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts
Raymond A. Yeh
Jinjun Xiong
Wen-mei W. Hwu
Minh Do
Alex Schwing
127
58
0
29 Mar 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh
Minh Do
Alex Schwing
120
44
0
29 Mar 2018
Learning Type-Aware Embeddings for Fashion Compatibility
Learning Type-Aware Embeddings for Fashion Compatibility
Mariya I. Vasileva
Bryan A. Plummer
Krishna Dusad
Shreya Rajpal
Ranjitha Kumar
David A. Forsyth
268
244
0
25 Mar 2018
Learning Social Image Embedding with Deep Multimodal Attention Networks
Learning Social Image Embedding with Deep Multimodal Attention Networks
Feiran Huang
Xiaoming Zhang
Zhoujun Li
Tao Mei
Yueying He
Zhonghua Zhao
127
20
0
18 Oct 2017
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual
  Cross Retrieval
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross RetrievalIEEE International Conference on Computer Vision (ICCV), 2017
Yuming Shen
Li Liu
Ling Shao
Jingkuan Song
145
51
0
08 Aug 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
534
3,572
0
26 May 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Liwei Wang
Yin Li
Jing-ling Huang
Svetlana Lazebnik
VLM
280
530
0
11 Apr 2017
Backpropagation Training for Fisher Vectors within Neural Networks
Backpropagation Training for Fisher Vectors within Neural Networks
P. Wieschollek
F. Groh
Hendrik P. A. Lensch
FedML
120
2
0
08 Feb 2017
Learning Visual N-Grams from Web Data
Learning Visual N-Grams from Web DataIEEE International Conference on Computer Vision (ICCV), 2016
Ang Li
Allan Jabri
Armand Joulin
Laurens van der Maaten
VLM
292
149
0
29 Dec 2016
Picture It In Your Mind: Generating High Level Visual Representations
  From Textual Descriptions
Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions
F. Carrara
Andrea Esuli
T. Fagni
Fabrizio Falchi
Alejandro Moreo
DiffM
126
31
0
23 Jun 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual GroundingConference on Empirical Methods in Natural Language Processing (EMNLP), 2016
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
596
1,541
0
06 Jun 2016
Multi-Cue Zero-Shot Learning with Strong Supervision
Multi-Cue Zero-Shot Learning with Strong Supervision
Zeynep Akata
Mateusz Malinowski
Mario Fritz
Bernt Schiele
209
152
0
29 Mar 2016
Learning Deep Structure-Preserving Image-Text Embeddings
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
479
820
0
19 Nov 2015
Natural Language Object Retrieval
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
305
569
0
13 Nov 2015
Learning to Answer Questions From Image Using Convolutional Neural
  Network
Learning to Answer Questions From Image Using Convolutional Neural NetworkAAAI Conference on Artificial Intelligence (AAAI), 2015
Lin Ma
Zhengdong Lu
Hang Li
229
266
0
01 Jun 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image
  Question Answering
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question AnsweringNeural Information Processing Systems (NeurIPS), 2015
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
Wenyuan Xu
327
520
0
21 May 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for
  Richer Image-to-Sentence Models
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer
Liwei Wang
Christopher M. Cervantes
Juan C. Caicedo
Anjali Narayan-Chen
Svetlana Lazebnik
570
2,380
0
19 May 2015
Exploring Models and Data for Image Question Answering
Exploring Models and Data for Image Question Answering
Mengye Ren
Ryan Kiros
R. Zemel
346
750
0
08 May 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence
  Descriptions of Images
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images
Junhua Mao
Xu Wei
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
194
160
0
25 Apr 2015
1