ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.02036
  4. Cited By
Learning Semantic Concepts and Order for Image and Sentence Matching

Learning Semantic Concepts and Order for Image and Sentence Matching

6 December 2017
Yan Huang
Qi Wu
Liang Wang
    VLM
ArXiv (abs)PDFHTML

Papers citing "Learning Semantic Concepts and Order for Image and Sentence Matching"

43 / 93 papers shown
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time
  Image-Text Retrieval
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text RetrievalNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Siqi Sun
Yen-Chun Chen
Linjie Li
Shuohang Wang
Yuwei Fang
Jingjing Liu
VLM
211
89
0
16 Mar 2021
A Universal Model for Cross Modality Mapping by Relational Reasoning
A Universal Model for Cross Modality Mapping by Relational Reasoning
Zun Li
Congyan Lang
Liqian Liang
Tao Wang
Songhe Feng
Jun Wu
Yidong Li
152
2
0
26 Feb 2021
DOC2PPT: Automatic Presentation Slides Generation from Scientific
  Documents
DOC2PPT: Automatic Presentation Slides Generation from Scientific DocumentsAAAI Conference on Artificial Intelligence (AAAI), 2021
Tsu-Jui Fu
Wenjie Wang
Daniel J. McDuff
Yale Song
304
72
0
28 Jan 2021
Similarity Reasoning and Filtration for Image-Text Matching
Similarity Reasoning and Filtration for Image-Text MatchingAAAI Conference on Artificial Intelligence (AAAI), 2021
Haiwen Diao
Ying Zhang
Lingyun Ma
Huchuan Lu
588
397
0
05 Jan 2021
VisualSparta: An Embarrassingly Simple Approach to Large-scale
  Text-to-Image Search with Weighted Bag-of-words
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-wordsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xiaopeng Lu
Tiancheng Zhao
Kyusong Lee
269
29
0
01 Jan 2021
Beyond the Deep Metric Learning: Enhance the Cross-Modal Matching with
  Adversarial Discriminative Domain Regularization
Beyond the Deep Metric Learning: Enhance the Cross-Modal Matching with Adversarial Discriminative Domain Regularization
Li Ren
Keqin Li
Liqiang Wang
K. Hua
156
5
0
23 Oct 2020
Universal Weighting Metric Learning for Cross-Modal Matching
Universal Weighting Metric Learning for Cross-Modal Matching
Jiwei Wei
Xing Xu
Yang Yang
Yanli Ji
Zheng Wang
Heng Tao Shen
164
99
0
07 Oct 2020
Learning to Represent Image and Text with Denotation Graph
Learning to Represent Image and Text with Denotation Graph
Bowen Zhang
Hexiang Hu
Vihan Jain
Eugene Ie
Fei Sha
164
22
0
06 Oct 2020
Weakly supervised cross-domain alignment with optimal transport
Weakly supervised cross-domain alignment with optimal transport
Siyang Yuan
Ke Bai
Liqun Chen
Yizhe Zhang
Chenyang Tao
Chunyuan Li
Guoyin Wang
Ricardo Henao
Lawrence Carin
OT
164
7
0
14 Aug 2020
Graph Optimal Transport for Cross-Domain Alignment
Graph Optimal Transport for Cross-Domain Alignment
Liqun Chen
Zhe Gan
Yu Cheng
Linjie Li
Lawrence Carin
Jingjing Liu
OT
339
179
0
26 Jun 2020
Deep Multimodal Neural Architecture Search
Deep Multimodal Neural Architecture SearchACM Multimedia (ACM MM), 2020
Zhou Yu
Yuhao Cui
Jun-chen Yu
Meng Wang
Dacheng Tao
Qi Tian
165
108
0
25 Apr 2020
Transformer Reasoning Network for Image-Text Matching and Retrieval
Transformer Reasoning Network for Image-Text Matching and RetrievalInternational Conference on Pattern Recognition (ICPR), 2020
Nicola Messina
Fabrizio Falchi
Andrea Esuli
Giuseppe Amato
ViT
174
65
0
20 Apr 2020
Graph Structured Network for Image-Text Matching
Graph Structured Network for Image-Text MatchingComputer Vision and Pattern Recognition (CVPR), 2020
Chunxiao Liu
Zhendong Mao
Tianzhu Zhang
Hongtao Xie
Bin Wang
Yongdong Zhang
192
281
0
01 Apr 2020
IMRAM: Iterative Matching with Recurrent Attention Memory for
  Cross-Modal Image-Text Retrieval
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text RetrievalComputer Vision and Pattern Recognition (CVPR), 2020
Hui Chen
Guiguang Ding
Xudong Liu
Zijia Lin
Ji Liu
Jungong Han
205
368
0
08 Mar 2020
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningComputer Vision and Pattern Recognition (CVPR), 2020
Shizhe Chen
Yida Zhao
Qin Jin
Qi Wu
241
359
0
01 Mar 2020
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media
  Retrieval
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media Retrieval
Hadi Abdi Khojasteh
Ebrahim Ansari
Parvin Razzaghi
Akbar Karimi
VLM
130
4
0
23 Feb 2020
Expressing Objects just like Words: Recurrent Visual Embedding for
  Image-Text Matching
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text MatchingAAAI Conference on Artificial Intelligence (AAAI), 2020
Tianlang Chen
Jiebo Luo
150
70
0
20 Feb 2020
MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding
MHSAN: Multi-Head Self-Attention Network for Visual Semantic EmbeddingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Geondo Park
Chihye Han
Wonjun Yoon
Dae-Shik Kim
97
23
0
11 Jan 2020
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
ScanRefer: 3D Object Localization in RGB-D Scans using Natural LanguageEuropean Conference on Computer Vision (ECCV), 2019
Dave Zhenyu Chen
Angel X. Chang
Matthias Nießner
3DPC
436
507
0
18 Dec 2019
Neural Storyboard Artist: Visualizing Stories with Coherent Image
  Sequences
Neural Storyboard Artist: Visualizing Stories with Coherent Image SequencesACM Multimedia (ACM MM), 2019
Shizhe Chen
Bei Liu
Jianlong Fu
Ruihua Song
Qin Jin
Pingping Lin
Xiaoyu Qi
Chunting Wang
Jin Zhou
DiffM
171
34
0
24 Nov 2019
HUSE: Hierarchical Universal Semantic Embeddings
HUSE: Hierarchical Universal Semantic Embeddings
P. Narayana
Aniket Pednekar
A. Krishnamoorthy
Kazoo Sone
Sugato Basu
164
11
0
14 Nov 2019
Target-Oriented Deformation of Visual-Semantic Embedding Space
Target-Oriented Deformation of Visual-Semantic Embedding Space
Takashi Matsubara
149
7
0
15 Oct 2019
Cross-modal Scene Graph Matching for Relationship-aware Image-Text
  Retrieval
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019
Sijin Wang
Ruiping Wang
Ziwei Yao
Shiguang Shan
Xilin Chen
3DV
205
239
0
11 Oct 2019
Learning Visual Relation Priors for Image-Text Matching and Image
  Captioning with Neural Scene Graph Generators
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
154
40
0
22 Sep 2019
Bridging Visual Perception with Contextual Semantics for Understanding
  Robot Manipulation Tasks
Bridging Visual Perception with Contextual Semantics for Understanding Robot Manipulation Tasks
Chen Jiang
Martin Jägersand
219
4
0
16 Sep 2019
Joint Wasserstein Autoencoders for Aligning Multimodal Embeddings
Joint Wasserstein Autoencoders for Aligning Multimodal Embeddings
Shweta Mahajan
Teresa Botschen
Iryna Gurevych
Stefan Roth
107
8
0
14 Sep 2019
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
CAMP: Cross-Modal Adaptive Message Passing for Text-Image RetrievalIEEE International Conference on Computer Vision (ICCV), 2019
Zihao Wang
Xihui Liu
Jiaming Song
Lu Sheng
Junjie Yan
Xiaogang Wang
Jing Shao
VLM
308
338
0
12 Sep 2019
MULE: Multimodal Universal Language Embedding
MULE: Multimodal Universal Language EmbeddingAAAI Conference on Artificial Intelligence (AAAI), 2019
Donghyun Kim
Kuniaki Saito
Kate Saenko
Stan Sclaroff
Bryan A. Plummer
VLM
198
43
0
08 Sep 2019
Visual Semantic Reasoning for Image-Text Matching
Visual Semantic Reasoning for Image-Text MatchingIEEE International Conference on Computer Vision (ICCV), 2019
Kunpeng Li
Yulun Zhang
Keqin Li
Yuanyuan Li
Y. Fu
VLM
290
575
0
06 Sep 2019
Adversarial Representation Learning for Text-to-Image Matching
Adversarial Representation Learning for Text-to-Image MatchingIEEE International Conference on Computer Vision (ICCV), 2019
N. Sarafianos
Xiang Xu
I. Kakadiaris
GAN
268
217
0
28 Aug 2019
Language Features Matter: Effective Language Representations for
  Vision-Language Tasks
Language Features Matter: Effective Language Representations for Vision-Language TasksIEEE International Conference on Computer Vision (ICCV), 2019
Andrea Burns
Reuben Tan
Kate Saenko
Stan Sclaroff
Bryan A. Plummer
VLM
165
28
0
17 Aug 2019
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal
  Pre-training
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-trainingAAAI Conference on Artificial Intelligence (AAAI), 2019
Gen Li
Nan Duan
Yuejian Fang
Ming Gong
Daxin Jiang
Ming Zhou
SSLVLMMLLM
809
948
0
16 Aug 2019
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Matching Images and Text with Multi-modal Tensor Fusion and Re-rankingACM Multimedia (ACM MM), 2019
Tan Wang
Xing Xu
Yang Yang
Alan Hanjalic
Heng Tao Shen
Jingkuan Song
119
165
0
12 Aug 2019
Position Focused Attention Network for Image-Text Matching
Position Focused Attention Network for Image-Text MatchingInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Yaxiong Wang
Hao-Hsiang Yang
Xueming Qian
Lin Ma
Jing Lu
Biao Li
Xin Fan
187
187
0
23 Jul 2019
Improving Description-based Person Re-identification by
  Multi-granularity Image-text Alignments
Improving Description-based Person Re-identification by Multi-granularity Image-text AlignmentsIEEE Transactions on Image Processing (TIP), 2019
K. Niu
Y. Huang
Wanli Ouyang
Liang Wang
194
181
0
23 Jun 2019
ParNet: Position-aware Aggregated Relation Network for Image-Text
  matching
ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Yaxian Xia
Lun Huang
Wenmin Wang
Xiao-Yong Wei
Jie Chen
204
2
0
17 Jun 2019
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Polysemous Visual-Semantic Embedding for Cross-Modal RetrievalComputer Vision and Pattern Recognition (CVPR), 2019
Yale Song
M. Soleymani
265
267
0
11 Jun 2019
Saliency-Guided Attention Network for Image-Sentence Matching
Saliency-Guided Attention Network for Image-Sentence Matching
Zhong Ji
Haoran Wang
Jiawei Han
Yanwei Pang
173
95
0
20 Apr 2019
Show, Translate and Tell
Show, Translate and Tell
D. Peri
Shagan Sah
R. Ptucha
101
5
0
14 Mar 2019
Multi-task Learning of Hierarchical Vision-Language Representation
Multi-task Learning of Hierarchical Vision-Language Representation
Duy-Kien Nguyen
Takayuki Okatani
259
56
0
03 Dec 2018
Pedestrian Trajectory Prediction with Structured Memory Hierarchies
Pedestrian Trajectory Prediction with Structured Memory Hierarchies
Tharindu Fernando
Akila Pemasiri
Sridha Sridharan
Clinton Fookes
157
19
0
22 Jul 2018
Stacked Cross Attention for Image-Text Matching
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
436
1,300
0
21 Mar 2018
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Zhedong Zheng
Liang Zheng
Michael Garrett
Yi Yang
Mingliang Xu
Yi-Dong Shen
519
562
0
15 Nov 2017
Previous
12
Page 2 of 2