Aligning Linguistic Words and Visual Semantic Units for Image Captioning

ACM Multimedia (ACM MM), 2019

6 August 2019

Jing Liu

Papers citing "Aligning Linguistic Words and Visual Semantic Units for Image Captioning"

27 / 27 papers shown

SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioningAAAI Conference on Artificial Intelligence (AAAI), 2025

178

01 Dec 2025

A Comprehensive Analysis of Real-World Image Captioning and Scene Identification

Sai Suprabhanu Nallapaneni

Subrahmanyam Konakanchi

227

05 Aug 2023

Semantic Composition in Visually Grounded Language Models

Rohan Pandey

CoGe

254

15 May 2023

Graph Neural Networks in Vision-Language Image Understanding: A SurveyThe Visual Computer (TVC), 2023

357

07 Mar 2023

Cross-modal Attention Congruence Regularization for Vision-Language Relation AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Louis-Philippe Morency

253

20 Dec 2022

A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic, and MultimodalIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

577

256

12 Dec 2022

Controllable Image Captioning

Luka Maxwell

420

28 Apr 2022

Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets

Marcella Cornia

Lorenzo Baraldi

G. Fiameni

Rita Cucchiara

388

24 Nov 2021

Unifying Multimodal Transformer for Bi-directional Image and Text Generation

259

19 Oct 2021

Domain Adaptive Semantic Segmentation without Source Data

Zi Huang

264

13 Oct 2021

Geometry-Entangled Visual Semantic Transformer for Image Captioning

241

29 Sep 2021

Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for
Stylized Image Captioning

327

26 Aug 2021

Scene Designer: a Unified Model for Scene Search and Synthesis from Sketch

Leo Sampaio Ferraz Ribeiro

242

16 Aug 2021

OSCAR-Net: Object-centric Scene Graph Attention for Image AttributionIEEE International Conference on Computer Vision (ICCV), 2021

197

07 Aug 2021

ReFormer: The Relational Transformer for Image CaptioningACM Multimedia (ACM MM), 2021

268

29 Jul 2021

X-GGM: Graph Generative Modeling for Out-of-Distribution Generalization in Visual Question AnsweringACM Multimedia (ACM MM), 2021

290

24 Jul 2021

From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

Lorenzo Baraldi

575

368

14 Jul 2021

Productivity, Portability, Performance: Data-Centric Python

456

116

01 Jul 2021

LayoutGMN: Neural Graph Matching for Structural Layout SimilarityComputer Vision and Pattern Recognition (CVPR), 2020

316

11 Dec 2020

DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction DetectionAAAI Conference on Artificial Intelligence (AAAI), 2020

347

02 Oct 2020

Dynamic Context-guided Capsule Network for Multimodal Machine TranslationACM Multimedia (ACM MM), 2020

Jie Zhou

258

04 Sep 2020

HOSE-Net: Higher Order Structure Embedded Network for Scene Graph GenerationACM Multimedia (ACM MM), 2020

371

12 Aug 2020

Improving Image Captioning with Better Use of Captions

Zhan Shi

Xu Zhou

Xipeng Qiu

Xiao-Dan Zhu

209

155

21 Jun 2020

Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning

Jing Liu

201

10 May 2020

Image Captioning through Image TransformerAsian Conference on Computer Vision (ACCV), 2020

310

116

29 Apr 2020

More Grounded Image Captioning by Distilling Image-Text Matching ModelComputer Vision and Pattern Recognition (CVPR), 2020

Yuanen Zhou

Meng Wang

Daqing Liu

Zhenzhen Hu

Hanwang Zhang

270

145

01 Apr 2020

Normalized and Geometry-Aware Self-Attention Network for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2020

Jing Liu

362

221

19 Mar 2020