ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1504.06063
  4. Cited By
Multimodal Convolutional Neural Networks for Matching Image and Sentence

Multimodal Convolutional Neural Networks for Matching Image and Sentence

23 April 2015
Lin Ma
Zhengdong Lu
Lifeng Shang
Hang Li
ArXivPDFHTML

Papers citing "Multimodal Convolutional Neural Networks for Matching Image and Sentence"

24 / 24 papers shown
Title
PREMISE: Matching-based Prediction for Accurate Review Recommendation
PREMISE: Matching-based Prediction for Accurate Review Recommendation
Wei Han
Hui Chen
Soujanya Poria
29
0
0
02 May 2025
Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation
Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation
Palaash Goel
Dushyant Singh Chauhan
Md. Shad Akhtar
LRM
50
0
0
11 Feb 2025
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and
  Representation Mapping
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping
Bo Zhou
Jiahui Liu
Songyi Cui
Yaping Zhao
18
4
0
23 Aug 2022
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for
  Image-Text Retrieval
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Haoran Wang
Dongliang He
Wenhao Wu
Boyang Xia
Min Yang
Fu Li
Yunlong Yu
Zhong Ji
Errui Ding
Jingdong Wang
19
22
0
21 Aug 2022
Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Ran Chen
Hanli Wang
Lei Wang
Sam Kwong
11
9
0
10 Mar 2022
Step-Wise Hierarchical Alignment Network for Image-Text Matching
Step-Wise Hierarchical Alignment Network for Image-Text Matching
Zhong Ji
Kexin Chen
Haoran Wang
14
93
0
11 Jun 2021
Graph Structured Network for Image-Text Matching
Graph Structured Network for Image-Text Matching
Chunxiao Liu
Zhendong Mao
Tianzhu Zhang
Hongtao Xie
Bin Wang
Yongdong Zhang
9
231
0
01 Apr 2020
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media
  Retrieval
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media Retrieval
Hadi Abdi Khojasteh
Ebrahim Ansari
Parvin Razzaghi
Akbar Karimi
VLM
4
4
0
23 Feb 2020
Target-Oriented Deformation of Visual-Semantic Embedding Space
Target-Oriented Deformation of Visual-Semantic Embedding Space
Takashi Matsubara
16
7
0
15 Oct 2019
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal
  Pre-training
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training
Gen Li
Nan Duan
Yuejian Fang
Ming Gong
Daxin Jiang
Ming Zhou
SSL
VLM
MLLM
24
895
0
16 Aug 2019
Listening while Speaking and Visualizing: Improving ASR through
  Multimodal Chain
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain
Johanes Effendi
Andros Tjandra
S. Sakti
Satoshi Nakamura
6
3
0
03 Jun 2019
Multi-modal gated recurrent units for image description
Multi-modal gated recurrent units for image description
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
GAN
11
26
0
20 Apr 2019
Reconstruction Network for Video Captioning
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
W. Liu
8
316
0
30 Mar 2018
Discriminability objective for training descriptive captions
Discriminability objective for training descriptive captions
Ruotian Luo
Brian L. Price
Scott D. Cohen
Gregory Shakhnarovich
8
202
0
12 Mar 2018
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
H. Ben-younes
Rémi Cadène
Matthieu Cord
Nicolas Thome
21
577
0
18 May 2017
Supervised Learning of Universal Sentence Representations from Natural
  Language Inference Data
Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
Alexis Conneau
Douwe Kiela
Holger Schwenk
Loïc Barrault
Antoine Bordes
AI4TS
SSL
14
2,091
0
05 May 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Liwei Wang
Yin Li
Jing-ling Huang
Svetlana Lazebnik
VLM
14
494
0
11 Apr 2017
AMC: Attention guided Multi-modal Correlation Learning for Image Search
AMC: Attention guided Multi-modal Correlation Learning for Image Search
Kan Chen
Trung Bui
Chen Fang
Zhaowen Wang
Ram Nevatia
27
38
0
03 Apr 2017
Comprehension-guided referring expressions
Comprehension-guided referring expressions
Ruotian Luo
Gregory Shakhnarovich
ObjD
19
171
0
12 Jan 2017
Dual Attention Networks for Multimodal Reasoning and Matching
Dual Attention Networks for Multimodal Reasoning and Matching
Hyeonseob Nam
Jung-Woo Ha
Jeonghee Kim
23
664
0
02 Nov 2016
Linking Image and Text with 2-Way Nets
Linking Image and Text with 2-Way Nets
Aviv Eisenschtat
Lior Wolf
11
176
0
29 Aug 2016
Learning to Answer Questions From Image Using Convolutional Neural
  Network
Learning to Answer Questions From Image Using Convolutional Neural Network
Lin Ma
Zhengdong Lu
Hang Li
13
262
0
01 Jun 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence
  Descriptions of Images
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images
Junhua Mao
Xu Wei
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
23
154
0
25 Apr 2015
Improving neural networks by preventing co-adaptation of feature
  detectors
Improving neural networks by preventing co-adaptation of feature detectors
Geoffrey E. Hinton
Nitish Srivastava
A. Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
VLM
243
7,633
0
03 Jul 2012
1