Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2007.11731
Cited By
Comprehensive Image Captioning via Scene Graph Decomposition
23 July 2020
Yiwu Zhong
Liwei Wang
Jianshu Chen
Dong Yu
Yin Li
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Comprehensive Image Captioning via Scene Graph Decomposition"
47 / 47 papers shown
Title
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement
Shaoqing Lin
Chong Teng
Fei Li
Donghong Ji
Lizhen Qu
Z. Li
72
0
0
18 Jun 2025
PRISM-0: A Predicate-Rich Scene Graph Generation Framework for Zero-Shot Open-Vocabulary Tasks
Abdelrahman Elskhawy
Mengze Li
Nassir Navab
Benjamin Busam
VLM
132
2
0
01 Apr 2025
SuperCap: Multi-resolution Superpixel-based Image Captioning
Henry Senior
Luca Rossi
Gregory Slabaugh
Shanxin Yuan
VLM
139
0
0
11 Mar 2025
ROOT: VLM based System for Indoor Scene Understanding and Beyond
Yonghui Wang
Shi-Yong Chen
Zhenxing Zhou
Siyi Li
Haoran Li
Wengang Zhou
Haoyang Li
VLM
179
4
0
24 Nov 2024
VeriGraph: Scene Graphs for Execution Verifiable Robot Planning
Daniel Ekpo
Mara Levy
Saksham Suri
Chuong Huynh
Abhinav Shrivastava
144
2
0
15 Nov 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
136
0
0
09 Aug 2024
Beyond Embeddings: The Promise of Visual Table in Visual Reasoning
Yiwu Zhong
Zi-Yuan Hu
Michael R. Lyu
Liwei Wang
88
3
0
27 Mar 2024
Joint Generative Modeling of Grounded Scene Graphs and Images via Diffusion Models
Bicheng Xu
Qi Yan
Renjie Liao
Lele Wang
Leonid Sigal
DiffM
136
2
0
02 Jan 2024
Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting
Hejie Cui
Xinyu Fang
Zihan Zhang
Ran Xu
Xuan Kan
Xin Liu
Yue Yu
Manling Li
Yangqiu Song
Carl Yang
VLM
81
4
0
28 Oct 2023
LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation
Kibum Kim
Kanghoon Yoon
Jaeyeong Jeon
Yeonjun In
Jinyoung Moon
Donghyun Kim
Chanyoung Park
226
22
0
16 Oct 2023
Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction
Jiankai Li
Yunhong Wang
Weixin Li
118
3
0
07 Sep 2023
Head-Tail Cooperative Learning Network for Unbiased Scene Graph Generation
Lei Wang
Zejian Yuan
Yao Lu
Badong Chen
101
0
0
23 Aug 2023
Video Captioning with Aggregated Features Based on Dual Graphs and Gated Fusion
Yutao Jin
Bin Liu
Jing Wang
97
1
0
13 Aug 2023
Environment-Invariant Curriculum Relation Learning for Fine-Grained Scene Graph Generation
Yu Min
Aming Wu
Cheng Deng
139
9
0
07 Aug 2023
Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
Jinghao Wang
Zhengyu Wen
Xiangtai Li
Zujin Guo
Jingkang Yang
Ziwei Liu
109
23
0
17 Jul 2023
FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing
Zhuang Li
Yuyang Chai
Terry Yue Zhuo
Zhuang Li
Gholamreza Haffari
Fei Li
Donghong Ji
Quan Hung Tran
156
42
0
27 May 2023
Fusion-S2iGan: An Efficient and Effective Single-Stage Framework for Speech-to-Image Generation
Zhenxing Zhang
Lambert Schomaker
73
4
0
17 May 2023
Transforming Visual Scene Graphs to Image Captions
Xu Yang
Jiawei Peng
Zihua Wang
Haiyang Xu
Qinghao Ye
Chenliang Li
Mingshi Yan
Feisi Huang
Zhangzikang Li
Yu Zhang
119
24
0
03 May 2023
Devil's on the Edges: Selective Quad Attention for Scene Graph Generation
Deunsol Jung
Sanghyun Kim
Wonhui Kim
Minsu Cho
3DPC
GNN
97
35
0
07 Apr 2023
Learning Combinatorial Prompts for Universal Controllable Image Captioning
Zhen Wang
Jun Xiao
Yueting Zhuang
Fei Gao
Jian Shao
Long Chen
124
8
0
11 Mar 2023
Graph Neural Networks in Vision-Language Image Understanding: A Survey
Henry Senior
Greg Slabaugh
Shanxin Yuan
Luca Rossi
GNN
141
25
0
07 Mar 2023
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng
Xuehai He
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Xinze Wang
William Yang Wang
CoGe
297
342
0
09 Dec 2022
CLID: Controlled-Length Image Descriptions with Limited Data
Elad Hirsch
A. Tal
VLM
3DV
98
5
0
27 Nov 2022
Visual Semantic Parsing: From Images to Abstract Meaning Representation
M. A. Abdelsalam
Zhan Shi
Federico Fancellu
Kalliopi Basioti
Dhaivat Bhatt
Vladimir Pavlovic
Afsaneh Fazly
GNN
111
5
0
26 Oct 2022
Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation
Yu Zhao
Jianguo Wei
Zhichao Lin
Yueheng Sun
Meishan Zhang
Hao Fei
101
16
0
20 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Rui Li
Weihua Li
Yi Yang
Hanyu Wei
Jianhua Jiang
Quan-wei Bai
DiffM
178
13
0
18 Oct 2022
SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation
Woo Suk Choi
Y. Heo
Byoung-Tak Zhang
GNN
88
3
0
17 Oct 2022
Contextual Modeling for 3D Dense Captioning on Point Clouds
Yufeng Zhong
Longdao Xu
Jiebo Luo
Lin Ma
106
17
0
08 Oct 2022
A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective
Chaoqi Chen
Yushuang Wu
Qiyuan Dai
Hong-Yu Zhou
Mutian Xu
Sibei Yang
Xiaoguang Han
Yizhou Yu
ViT
MedIm
AI4CE
167
93
0
27 Sep 2022
Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation
Liguang Zhou
Yuhongze Zhou
Tin Lun Lam
Yangsheng Xu
EDL
MoE
159
2
0
15 Aug 2022
Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection
VS Vibashan
Poojan Oza
Vishal M. Patel
189
76
0
29 Mar 2022
Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation
Xingning Dong
Tian Gan
Xuemeng Song
Yue Yu
Yuan Cheng
Liqiang Nie
153
97
0
18 Mar 2022
Taking an Emotional Look at Video Paragraph Captioning
Qinyu Li
Tengpeng Li
Hanli Wang
Changan Chen
85
6
0
12 Mar 2022
Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling
Tengpeng Li
Hanli Wang
Bin He
Changan Chen
DiffM
106
11
0
10 Mar 2022
Deep Learning Approaches on Image Captioning: A Review
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
229
110
0
31 Jan 2022
Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation
Chao Chen
Yibing Zhan
Baosheng Yu
Liu Liu
Yong Luo
Bo Du
98
44
0
18 Jan 2022
Representing Videos as Discriminative Sub-graphs for Action Recognition
Dong Li
Zhaofan Qiu
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
127
28
0
11 Jan 2022
DVCFlow: Modeling Information Flow Towards Human-like Video Captioning
Xu Yan
Zhengcong Fei
Shuhui Wang
Qingming Huang
Qi Tian
VGen
113
4
0
19 Nov 2021
Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Ali Furkan Biten
L. G. I. Bigorda
Dimosthenis Karatzas
230
68
0
04 Oct 2021
Learning to Generate Scene Graph from Natural Language Supervision
Yiwu Zhong
Jing Shi
Jianwei Yang
Chenliang Xu
Yin Li
SSL
116
80
0
06 Sep 2021
ReFormer: The Relational Transformer for Image Captioning
Xuewen Yang
Yingru Liu
Xin Wang
ViT
115
60
0
29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
231
285
0
14 Jul 2021
Human-like Controllable Image Captioning with Verb-specific Semantic Roles
Long Chen
Zhihong Jiang
Jun Xiao
Wei Liu
135
77
0
22 Mar 2021
A Comprehensive Survey of Scene Graphs: Generation and Application
Xiaojun Chang
Pengzhen Ren
Pengfei Xu
Zhihui Li
Xiaojiang Chen
Alexander G. Hauptmann
3DV
231
251
0
17 Mar 2021
Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
Gengcong Yang
Jingyi Zhang
Yong Zhang
Baoyuan Wu
Yujiu Yang
109
68
0
09 Mar 2021
In Defense of Scene Graphs for Image Captioning
Kien Nguyen
Subarna Tripathi
Bang Du
T. Guha
Truong Thao Nguyen
98
50
0
09 Feb 2021
UNISON: Unpaired Cross-lingual Image Captioning
Jiahui Gao
Yi Zhou
Philip L. H. Yu
Shafiq Joty
Jiuxiang Gu
86
17
0
03 Oct 2020
1