ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,507 papers shown
Title
Rethinking Referring Object Removal
Rethinking Referring Object Removal
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
32
0
0
14 Mar 2024
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision
  Language Navigation
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation
Dingbang Li
Wenzhou Chen
Xin Lin
LLMAG
LM&Ro
34
4
0
13 Mar 2024
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing
  Objects in 3D Scenes
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu
Xiaojun Lin
Shuhui Wang
Weiguo Sheng
Qingming Huang
Jun-chen Yu
3DV
40
10
0
12 Mar 2024
Enhancing Image Caption Generation Using Reinforcement Learning with
  Human Feedback
Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback
L. AdarshN
V. ArunP
L. AravindhN
21
1
0
11 Mar 2024
How to Understand Named Entities: Using Common Sense for News Captioning
How to Understand Named Entities: Using Common Sense for News Captioning
Ning Xu
Yanhui Wang
Tingting Zhang
Hongshuo Tian
Mohan S. Kankanhalli
An-An Liu
24
0
0
11 Mar 2024
Transformer based Multitask Learning for Image Captioning and Object
  Detection
Transformer based Multitask Learning for Image Captioning and Object Detection
Debolena Basak
P. K. Srijith
M. Desarkar
16
1
0
10 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video
  Generation
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVM
VGen
36
40
0
08 Mar 2024
Rule-driven News Captioning
Rule-driven News Captioning
Ning Xu
Tingting Zhang
Hongshuo Tian
An-An Liu
60
0
0
08 Mar 2024
MeaCap: Memory-Augmented Zero-shot Image Captioning
MeaCap: Memory-Augmented Zero-shot Image Captioning
Zequn Zeng
Yan Xie
Hao Zhang
Chiyu Chen
Zhengjue Wang
Boli Chen
VLM
25
14
0
06 Mar 2024
Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach
  for Relation Classification
Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification
Robert Vacareanu
F. Alam
M. Islam
Haris Riaz
Mihai Surdeanu
NAI
19
2
0
05 Mar 2024
Causal Prompting: Debiasing Large Language Model Prompting based on
  Front-Door Adjustment
Causal Prompting: Debiasing Large Language Model Prompting based on Front-Door Adjustment
Congzhi Zhang
Linhai Zhang
Jialong Wu
Deyu Zhou
Guoqiang Xu
CML
AI4CE
LRM
44
15
0
05 Mar 2024
Attention Guidance Mechanism for Handwritten Mathematical Expression
  Recognition
Attention Guidance Mechanism for Handwritten Mathematical Expression Recognition
Yutian Liu
Wenjun Ke
Jianguo Wei
38
0
0
04 Mar 2024
DINER: Debiasing Aspect-based Sentiment Analysis with Multi-variable
  Causal Inference
DINER: Debiasing Aspect-based Sentiment Analysis with Multi-variable Causal Inference
Jialong Wu
Linhai Zhang
Deyu Zhou
Guoqiang Xu
CML
19
3
0
02 Mar 2024
ELA: Efficient Local Attention for Deep Convolutional Neural Networks
ELA: Efficient Local Attention for Deep Convolutional Neural Networks
Wei Xu
Yi Wan
33
31
0
02 Mar 2024
How to Understand "Support"? An Implicit-enhanced Causal Inference
  Approach for Weakly-supervised Phrase Grounding
How to Understand "Support"? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Grounding
Jiamin Luo
Jianing Zhao
Jingjing Wang
Guodong Zhou
41
0
0
29 Feb 2024
SNE-RoadSegV2: Advancing Heterogeneous Feature Fusion and Fallibility
  Awareness for Freespace Detection
SNE-RoadSegV2: Advancing Heterogeneous Feature Fusion and Fallibility Awareness for Freespace Detection
Yi Feng
Yu Ma
Qijun Chen
Ioannis Pitas
Rui Fan
37
5
0
29 Feb 2024
Polos: Multimodal Metric Learning from Human Feedback for Image
  Captioning
Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Yuiga Wada
Kanta Kaneda
Daichi Saito
Komei Sugiura
34
24
0
28 Feb 2024
Vision Language Model-based Caption Evaluation Method Leveraging Visual
  Context Extraction
Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction
Koki Maeda
Shuhei Kurita
Taiki Miyanishi
Naoaki Okazaki
30
2
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
56
17
0
28 Feb 2024
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing
  Different Modalities as Different Languages
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages
Minsu Kim
Jee-weon Jung
Hyeongseop Rha
Soumi Maiti
Siddhant Arora
Xuankai Chang
Shinji Watanabe
Y. Ro
28
6
0
25 Feb 2024
ConVQG: Contrastive Visual Question Generation with Multimodal Guidance
ConVQG: Contrastive Visual Question Generation with Multimodal Guidance
Li Mi
Syrielle Montariol
J. Castillo-Navarro
Xianjie Dai
Antoine Bosselut
D. Tuia
22
4
0
20 Feb 2024
Heterogeneity-aware Cross-school Electives Recommendation: a Hybrid
  Federated Approach
Heterogeneity-aware Cross-school Electives Recommendation: a Hybrid Federated Approach
Chengyi Ju
Jiannong Cao
Yu Yang
Zhen-Qun Yang
Ho Man Lee
13
0
0
19 Feb 2024
AICAttack: Adversarial Image Captioning Attack with Attention-Based
  Optimization
AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization
Jiyao Li
Mingze Ni
Yifei Dong
Tianqing Zhu
Wei Liu
AAML
27
2
0
19 Feb 2024
Align before Attend: Aligning Visual and Textual Features for Multimodal
  Hateful Content Detection
Align before Attend: Aligning Visual and Textual Features for Multimodal Hateful Content Detection
E. Hossain
Omar Sharif
M. M. Hoque
S. Preum
24
3
0
15 Feb 2024
On the Resurgence of Recurrent Models for Long Sequences -- Survey and
  Research Opportunities in the Transformer Era
On the Resurgence of Recurrent Models for Long Sequences -- Survey and Research Opportunities in the Transformer Era
Matteo Tiezzi
Michele Casoni
Alessandro Betti
Tommaso Guidi
Marco Gori
S. Melacci
19
9
0
12 Feb 2024
Savvy: Trustworthy Autonomous Vehicles Architecture
Savvy: Trustworthy Autonomous Vehicles Architecture
Ali Shoker
Rehana Yasmin
Paulo Esteves-Verissimo
23
0
0
08 Feb 2024
Intensive Vision-guided Network for Radiology Report Generation
Intensive Vision-guided Network for Radiology Report Generation
Fudan Zheng
Mengfei Li
Ying Wang
Weijiang Yu
Ruixuan Wang
Zhiguang Chen
Nong Xiao
Yutong Lu
23
1
0
06 Feb 2024
Revisiting Generative Adversarial Networks for Binary Semantic
  Segmentation on Imbalanced Datasets
Revisiting Generative Adversarial Networks for Binary Semantic Segmentation on Imbalanced Datasets
Lei Xu
M. Gabbouj
GAN
16
1
0
03 Feb 2024
Image Fusion via Vision-Language Model
Image Fusion via Vision-Language Model
Zixiang Zhao
Lilun Deng
Haowen Bai
Yukun Cui
Zhipeng Zhang
...
Haotong Qin
Dongdong Chen
Jiangshe Zhang
Peng Wang
Luc Van Gool
VLM
24
18
0
03 Feb 2024
MLIP: Enhancing Medical Visual Representation with Divergence Encoder
  and Knowledge-guided Contrastive Learning
MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning
Zhe Li
Laurence T. Yang
Bocheng Ren
Xin Nie
Zhangyang Gao
Cheng Tan
Stan Z. Li
VLM
13
12
0
03 Feb 2024
Streaming Sequence Transduction through Dynamic Compression
Streaming Sequence Transduction through Dynamic Compression
Weiting Tan
Yunmo Chen
Tongfei Chen
Guanghui Qin
Haoran Xu
Heidi C. Zhang
Benjamin Van Durme
Philipp Koehn
19
1
0
02 Feb 2024
Attention-based Dynamic Multilayer Graph Neural Networks for Loan
  Default Prediction
Attention-based Dynamic Multilayer Graph Neural Networks for Loan Default Prediction
Sahab Zandi
Kamesh Korangi
María Óskarsdóttir
Christophe Mues
Cristián Bravo
14
5
0
01 Feb 2024
GQHAN: A Grover-inspired Quantum Hard Attention Network
GQHAN: A Grover-inspired Quantum Hard Attention Network
Ren-Xin Zhao
Jinjing Shi
Xuelong Li
22
3
0
25 Jan 2024
MAST: Video Polyp Segmentation with a Mixture-Attention Siamese
  Transformer
MAST: Video Polyp Segmentation with a Mixture-Attention Siamese Transformer
Geng Chen
Junqing Yang
Xiaozhou Pu
Ge-Peng Ji
Huan Xiong
Yongsheng Pan
Hengfei Cui
Yong-quan Xia
MedIm
ViT
38
2
0
23 Jan 2024
Unsupervised Learning of Graph from Recipes
Unsupervised Learning of Graph from Recipes
Aissatou Diallo
Antonis Bikakis
Luke Dickens
Anthony Hunter
Rob Miller
SSL
15
0
0
22 Jan 2024
Collaborative Position Reasoning Network for Referring Image
  Segmentation
Collaborative Position Reasoning Network for Referring Image Segmentation
Jianjian Cao
Beiya Dai
Yulin Li
Xiameng Qin
Jingdong Wang
25
0
0
22 Jan 2024
Spatial-temporal Forecasting for Regions without Observations
Spatial-temporal Forecasting for Regions without Observations
Xinyu Su
Jianzhong Qi
E. Tanin
Yanchuan Chang
Majid Sarvi
AI4TS
30
2
0
19 Jan 2024
Enhancing Scalability in Recommender Systems through Lottery Ticket
  Hypothesis and Knowledge Distillation-based Neural Network Pruning
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning
R. Rajaram
Manoj Bharadhwaj
VS Vasan
N. Pervin
19
1
0
19 Jan 2024
Supervised Fine-tuning in turn Improves Visual Foundation Models
Supervised Fine-tuning in turn Improves Visual Foundation Models
Xiaohu Jiang
Yixiao Ge
Yuying Ge
Dachuan Shi
Chun Yuan
Ying Shan
VLM
CLIP
38
8
0
18 Jan 2024
Jewelry Recognition via Encoder-Decoder Models
Jewelry Recognition via Encoder-Decoder Models
José M. Alcalde-Llergo
Enrique Yeguas-Bolivar
Andrea Zingoni
Alejandro Fuerte-Jurado
27
0
0
15 Jan 2024
Survey of Natural Language Processing for Education: Taxonomy,
  Systematic Review, and Future Trends
Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends
Yunshi Lan
Xinyuan Li
Hanyue Du
Xuesong Lu
Ming Gao
Weining Qian
Aoying Zhou
33
1
0
15 Jan 2024
HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced
  Diffusion Models
HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models
Hanzhang Wang
Haoran Wang
Jinze Yang
Zhongrui Yu
Zeke Xie
Lei Tian
Xinyan Xiao
Junjun Jiang
Xianming Liu
Mingming Sun
DiffM
17
1
0
11 Jan 2024
Complementary Information Mutual Learning for Multimodality Medical
  Image Segmentation
Complementary Information Mutual Learning for Multimodality Medical Image Segmentation
Chuyun Shen
Wenhao Li
Haoqing Chen
Xiaoling Wang
Fengping Zhu
Yuxin Li
Xiangfeng Wang
Bo Jin
38
3
0
05 Jan 2024
Object-oriented backdoor attack against image captioning
Object-oriented backdoor attack against image captioning
Meiling Li
Nan Zhong
Xinpeng Zhang
Zhenxing Qian
Sheng Li
11
8
0
05 Jan 2024
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via
  Text-Only Training
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Longtian Qiu
Shan Ning
Xuming He
VLM
33
3
0
04 Jan 2024
Short-Term Multi-Horizon Line Loss Rate Forecasting of a Distribution
  Network Using Attention-GCN-LSTM
Short-Term Multi-Horizon Line Loss Rate Forecasting of a Distribution Network Using Attention-GCN-LSTM
Jie Liu
Yijia Cao
Yong Li
Yixiu Guo
Wei Deng
11
1
0
19 Dec 2023
Satellite Captioning: Large Language Models to Augment Labeling
Satellite Captioning: Large Language Models to Augment Labeling
Grant Rosario
David A. Noever
82
0
0
18 Dec 2023
Dual Branch Network Towards Accurate Printed Mathematical Expression
  Recognition
Dual Branch Network Towards Accurate Printed Mathematical Expression Recognition
Yuqing Wang
Zhenyu Weng
Zhaokun Zhou
Shuaijian Ji
Zhongjie Ye
Yuesheng Zhu
16
2
0
14 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLM
MLLM
30
18
0
13 Dec 2023
Pain Analysis using Adaptive Hierarchical Spatiotemporal Dynamic Imaging
Pain Analysis using Adaptive Hierarchical Spatiotemporal Dynamic Imaging
Issam Serraoui
Eric Granger
Abdenour Hadid
Abdelmalik Taleb-Ahmed
16
0
0
12 Dec 2023
Previous
123456...697071
Next