v1v2v3v4 (latest)

Improved Image Captioning via Policy Gradient optimization of SPIDEr

1 December 2016

Papers citing "Improved Image Captioning via Policy Gradient optimization of SPIDEr"

50 / 232 papers shown

Enhanced Modality Transition for Image Captioning

Ziwei Wang

Yadan Luo

Zi Huang

23 Feb 2021

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2021

430

274

20 Feb 2021

Image Captioning using Multiple Transformers for Self-Attention Mechanism

14 Feb 2021

The Role of Syntactic Planning in Compositional Image CaptioningConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Emanuele Bugliarello

Desmond Elliott

CoGe

102

28 Jan 2021

Text-Free Image-to-Speech Synthesis Using Learned Segmental UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

175

31 Dec 2020

WEmbSim: A Simple yet Effective Metric for Image CaptioningInternational Conference on Digital Image Computing: Techniques and Applications (DICTA), 2020

Wei Liu

104

24 Dec 2020

LCEval: Learned Composite Metric for Caption EvaluationInternational Journal of Computer Vision (IJCV), 2019

Wei Liu

120

24 Dec 2020

Image Captioning with Context-Aware Auxiliary GuidanceAAAI Conference on Artificial Intelligence (AAAI), 2020

205

10 Dec 2020

Understanding Guided Image Captioning Performance across DomainsConference on Computational Natural Language Learning (CoNLL), 2020

369

04 Dec 2020

DORB: Dynamically Optimizing Multiple Rewards with BanditsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

189

15 Nov 2020

Dual Attention on Pyramid Feature Maps for Image CaptioningIEEE transactions on multimedia (TMM), 2020

Litao Yu

Jian Zhang

Qiang Wu

335

02 Nov 2020

Boost Image Captioning with Knowledge ReasoningMachine-mediated learning (ML), 2020

115

02 Nov 2020

DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual ExplanationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020

...

206

01 Nov 2020

WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information

An Tran

Konstantinos Drossos

Maria Sandsten

197

21 Oct 2020

New Ideas and Trends in Deep Multimodal Content Understanding: A ReviewNeurocomputing (Neurocomputing), 2020

329

16 Oct 2020

Adversarial Grammatical Error CorrectionFindings (Findings), 2020

Vipul Raheja

Dimitrios Alikaniotis

148

06 Oct 2020

Teacher-Critical Training Strategies for Image Captioning

Yiqing Huang

Jiansheng Chen

VLM

147

30 Sep 2020

Where is the Model Looking At?--Concentrate and Explain the Network Attention

Wenjia Xu

123

29 Sep 2020

Effects of Word-frequency based Pre- and Post- Processings for Audio CaptioningWorkshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2020

171

24 Sep 2020

Towards Unique and Informative Captioning of ImagesEuropean Conference on Computer Vision (ECCV), 2020

168

08 Sep 2020

A Survey of Evaluation Metrics Used for NLG SystemsACM Computing Surveys (ACM CSUR), 2020

Ananya B. Sai

Akash Kumar Mohankumar

Mitesh M. Khapra

ELM

446

288

27 Aug 2020

Learn to Talk via Proactive Knowledge Transfer

Qing Sun

James Cross

23 Aug 2020

Assisting Scene Graph Generation with Self-Supervision

Sandeep Inuganti

V. Balasubramanian

SSL

177

08 Aug 2020

Neural Language Generation: Formulation, Methods, and Evaluation

Cristina Garbacea

Qiaozhu Mei

353

31 Jul 2020

A Unified Framework of Surrogate Loss by Refactoring and InterpolationEuropean Conference on Computer Vision (ECCV), 2020

Lanlan Liu

Mingzhe Wang

Gaowen Liu

154

27 Jul 2020

Multi-task Regularization Based on Infrequent Classes for Audio CaptioningWorkshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2020

Emre Çakir

Konstantinos Drossos

Maria Sandsten

108

09 Jul 2020

Temporal Sub-sampling of Audio Feature Sequences for Automated Audio Captioning

K. Nguyen

Konstantinos Drossos

Maria Sandsten

106

06 Jul 2020

Listen carefully and tell: an audio captioning system based on residual learning and gammatone audio representation

Sergi Perez-Castanos

Javier Naranjo-Alcazar

P. Zuccarello

M. Cobos

157

27 Jun 2020

Evaluation of Text Generation: A Survey

327

420

26 Jun 2020

Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation

Mingjie Li

Fuyu Wang

Xiaojun Chang

Xiaodan Liang

MedIm

201

132

06 Jun 2020

MLE-guided parameter search for task loss minimization in neural sequence modeling

Sean Welleck

Dong Wang

193

04 Jun 2020

Chat as Expected: Learning to Manipulate Black-box Neural Dialogue Models

169

27 May 2020

ALBA : Reinforcement Learning for Video Object SegmentationBritish Machine Vision Conference (BMVC), 2020

Shreyank N. Gowda

Panagiotis Eustratiadis

Timothy M. Hospedales

Laura Sevilla-Lara

VOS

173

26 May 2020

Learning a Reinforced Agent for Flexible Exposure Bracketing SelectionComputer Vision and Pattern Recognition (CVPR), 2020

Ping Luo

147

26 May 2020

Self-Supervised and Controlled Multi-Document Opinion SummarizationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2020

126

30 Apr 2020

Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray ReportsAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Baoyu Jing

Zeya Wang

Eric Xing

268

167

26 Apr 2020

Context-Aware Group Captioning via Self-Attention and Contrastive FeaturesComputer Vision and Pattern Recognition (CVPR), 2020

168

07 Apr 2020

Learning Compact Reward for Image Captioning

Nannan Li

Zhenzhong Chen

149

24 Mar 2020

Visual Question Answering for Cultural Heritage

841

22 Mar 2020

Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene GraphsComputer Vision and Pattern Recognition (CVPR), 2020

Shizhe Chen

Qin Jin

Peng Wang

Qi Wu

DiffM

327

238

01 Mar 2020

Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework

C. Sur

226

16 Feb 2020

MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)Multimedia tools and applications (MTA), 2020

C. Sur

159

15 Feb 2020

aiTPR: Attribute Interaction-Tensor Product Representation for Image CaptionNeural Processing Letters (NPL), 2020

C. Sur

119

27 Jan 2020

Nested-Wasserstein Self-Imitation Learning for Sequence GenerationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Lawrence Carin

204

20 Jan 2020

Spatio-Temporal Ranked-Attention Networks for Video CaptioningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020

117

17 Jan 2020

Adaptive Correlated Monte Carlo for Contextual Categorical Sequence GenerationInternational Conference on Learning Representations (ICLR), 2019

Xinjie Fan

Yizhe Zhang

Zhendong Wang

Mingyuan Zhou

BDL

139

31 Dec 2019

Vision and Language: from Visual Perception to Content CreationAPSIPA Transactions on Signal and Information Processing (APSIPA TSIP), 2019

Tao Mei

Wei Zhang

Ting Yao

VLM

182

26 Dec 2019

Meshed-Memory Transformer for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2019

Marcella Cornia

Matteo Stefanini

Lorenzo Baraldi

Rita Cucchiara

262

1,025

17 Dec 2019

Learning to Relate from Captions and Bounding BoxesAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Sarthak Garg

Joel Ruben Antony Moniz

Anshu Aviral

Priyatham Bollimpalli

161

01 Dec 2019

CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and Context Capture for Language Representation -- A Generalization of Bi Directional LSTMMultimedia tools and applications (MTA), 2019

C. Sur

BDL

188

22 Nov 2019