Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1812.02378
Cited By
v1
v2
v3 (latest)
Auto-Encoding Scene Graphs for Image Captioning
6 December 2018
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Auto-Encoding Scene Graphs for Image Captioning"
50 / 311 papers shown
OSIC: A New One-Stage Image Captioner Coined
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Bo Wang
Zhao Zhang
Ming Zhao
Xiaojie Jin
Mingliang Xu
Meng Wang
VLM
226
6
0
04 Nov 2022
Leveraging commonsense for object localisation in partial scenes
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Francesco Giuliari
Geri Skenderi
Marco Cristani
Alessio Del Bue
Yiming Wang
203
3
0
01 Nov 2022
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention
ACM Transactions on Knowledge Discovery from Data (TKDD), 2021
Fenglin Liu
Xian Wu
Shen Ge
Xuancheng Ren
Wei Fan
Xu Sun
Yuexian Zou
VLM
209
13
0
28 Oct 2022
Visual Semantic Parsing: From Images to Abstract Meaning Representation
Conference on Computational Natural Language Learning (CoNLL), 2022
M. A. Abdelsalam
Zhan Shi
Federico Fancellu
Kalliopi Basioti
Dhaivat Bhatt
Vladimir Pavlovic
Afsaneh Fazly
GNN
270
5
0
26 Oct 2022
Prophet Attention: Predicting Attention with Future Attention for Image Captioning
Neural Information Processing Systems (NeurIPS), 2022
Fenglin Liu
Xuancheng Ren
Xian Wu
Wei Fan
Yuexian Zou
Xu Sun
232
52
0
19 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Rui Li
Weihua Li
Yi Yang
Hanyu Wei
Jianhua Jiang
Quan-wei Bai
DiffM
369
17
0
18 Oct 2022
Explore Contextual Information for 3D Scene Graph Generation
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2022
Yu-An Liu
Chengjiang Long
Zhaoxuan Zhang
Bo Liu
Qiang Zhang
Baocai Yin
Xin Yang
371
16
0
12 Oct 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
International Journal of Computer Vision (IJCV), 2022
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
274
10
0
04 Oct 2022
Unbiased Scene Graph Generation using Predicate Similarities
IEEE Access (IEEE Access), 2022
Misaki Ohashi
Yusuke Matsui
264
1
0
03 Oct 2022
A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Chaoqi Chen
Yushuang Wu
Qiyuan Dai
Hong-Yu Zhou
Mutian Xu
Sibei Yang
Xiaoguang Han
Yizhou Yu
ViT
MedIm
AI4CE
388
133
0
27 Sep 2022
Learning Distinct and Representative Styles for Image Captioning
Neural Information Processing Systems (NeurIPS), 2022
Qi Chen
Chaorui Deng
Qi Wu
VLM
187
27
0
17 Sep 2022
Scene Graph Modification as Incremental Structure Expanding
International Conference on Computational Linguistics (COLING), 2022
Xuming Hu
Zhijiang Guo
Yuwei Fu
Lijie Wen
Philip S. Yu
205
3
0
15 Sep 2022
Towards Open-vocabulary Scene Graph Generation with Prompt-based Finetuning
European Conference on Computer Vision (ECCV), 2022
Tao He
Lianli Gao
Jingkuan Song
Yuan-Fang Li
VLM
276
69
0
17 Aug 2022
Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation
Liguang Zhou
Yuhongze Zhou
Tin Lun Lam
Yangsheng Xu
EDL
MoE
312
3
0
15 Aug 2022
Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning
BigData Congress [Services Society] (BSS), 2022
J. Hu
Roberto Cavicchioli
Alessandro Capotondi
456
30
0
13 Aug 2022
K-UNN: k-Space Interpolation With Untrained Neural Network
Zhuoxu Cui
Seng Jia
Qingyong Zhu
Congcong Liu
Zhilang Qiu
Yuanyuan Liu
Jing Cheng
Haifeng Wang
Yanjie Zhu
Dong Liang
143
1
0
11 Aug 2022
Label Semantic Knowledge Distillation for Unbiased Scene Graph Generation
Lin Li
Long Chen
Hanrong Shi
Wenxiao Wang
Jian Shao
Yi Yang
Jun Xiao
VLM
263
29
0
07 Aug 2022
Rethinking the Evaluation of Unbiased Scene Graph Generation
British Machine Vision Conference (BMVC), 2022
Xingchen Li
Long Chen
Jian Shao
Shaoning Xiao
Songyang Zhang
Jun Xiao
303
15
0
03 Aug 2022
Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph Generation
ACM Multimedia (ACM MM), 2022
Xingchen Li
Long Chen
Wenbo Ma
Yi Yang
Jun Xiao
196
30
0
03 Aug 2022
Iterative Scene Graph Generation
Neural Information Processing Systems (NeurIPS), 2022
Siddhesh Khandelwal
Leonid Sigal
OCL
221
37
0
27 Jul 2022
NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Lin Li
Jun Xiao
Hanrong Shi
Hanwang Zhang
Yi Yang
Wen Liu
Long Chen
216
31
0
27 Jul 2022
Retrieval-Augmented Transformer for Image Captioning
International Conference on Content-Based Multimedia Indexing (CBMI), 2022
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
202
69
0
26 Jul 2022
Rethinking the Reference-based Distinctive Image Captioning
ACM Multimedia (ACM MM), 2022
Yangjun Mao
Long Chen
Zhihong Jiang
Dong Zhang
Zhimeng Zhang
Jian Shao
Jun Xiao
DiffM
228
23
0
22 Jul 2022
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Yoad Tewel
Yoav Shalev
Roy Nadler
Idan Schwartz
Lior Wolf
233
33
0
22 Jul 2022
GRIT: Faster and Better Image captioning Transformer Using Dual Visual Features
European Conference on Computer Vision (ECCV), 2022
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
ViT
218
148
0
20 Jul 2022
Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Xinyu Lyu
Lianli Gao
Pengpeng Zeng
Hengtao Shen
Jingkuan Song
224
21
0
11 Jul 2022
Exploring the sequence length bottleneck in the Transformer for Image Captioning
Jiapeng Hu
Roberto Cavicchioli
Alessandro Capotondi
ViT
330
4
0
07 Jul 2022
Comprehending and Ordering Semantics for Image Captioning
Computer Vision and Pattern Recognition (CVPR), 2022
Yehao Li
Yingwei Pan
Ting Yao
Tao Mei
193
114
0
14 Jun 2022
Visual Transformer for Object Detection
M. Yang
ViT
127
10
0
01 Jun 2022
Importance Weighted Structure Learning for Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Daqing Liu
M. Bober
J. Kittler
319
8
0
14 May 2022
Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Computer Vision and Pattern Recognition (CVPR), 2022
Chia-Wen Kuo
Z. Kira
266
81
0
09 May 2022
Controllable Image Captioning
Luka Maxwell
364
0
0
28 Apr 2022
Attention Mechanism based Cognition-level Scene Understanding
Xuejiao Tang
Tai Le Quy
LRM
345
0
0
17 Apr 2022
On Distinctive Image Captioning via Comparing and Reweighting
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
188
23
0
08 Apr 2022
Fine-Grained Predicates Learning for Scene Graph Generation
Computer Vision and Pattern Recognition (CVPR), 2022
Xinyu Lyu
Lianli Gao
Yuyu Guo
Zhou Zhao
Hao Huang
Hengtao Shen
Jingkuan Song
286
45
0
06 Apr 2022
Fine-Grained Scene Graph Generation with Data Transfer
European Conference on Computer Vision (ECCV), 2022
Ao Zhang
Yuan Yao
Qián Chen
Wei Ji
Zhiyuan Liu
Maosong Sun
Tat-Seng Chua
315
111
0
22 Mar 2022
Self-Supervised Road Layout Parsing with Graph Auto-Encoding
Chenyang Lu
Gijs Dubbelman
SSL
255
1
0
21 Mar 2022
Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
European Conference on Computer Vision (ECCV), 2022
Youming Deng
Yansheng Li
Yongjun Zhang
Xiang Xiang
Jian Wang
Jingdong Chen
Jiayi Ma
317
26
0
14 Mar 2022
Spatial Commonsense Graph for Object Localisation in Partial Scenes
Computer Vision and Pattern Recognition (CVPR), 2022
Francesco Giuliari
Geri Skenderi
Marco Cristani
Yiming Wang
Alessio Del Bue
259
20
0
10 Mar 2022
Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Ran Chen
Hanli Wang
Lei Wang
Sam Kwong
162
10
0
10 Mar 2022
Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Tengpeng Li
Hanli Wang
Bin He
Changan Chen
DiffM
247
16
0
10 Mar 2022
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes
European Conference on Computer Vision (ECCV), 2022
Yang Jiao
Shaoxiang Chen
Zequn Jie
Wenke Huang
Lin Ma
Yu-Gang Jiang
3DPC
298
59
0
10 Mar 2022
Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept Recognition
IEEE transactions on multimedia (IEEE TMM), 2022
Peipei Zhu
Tianlin Li
Yong Luo
Zhenglong Sun
Wei-Shi Zheng
Yaowei Wang
Chen Chen
217
15
0
07 Mar 2022
CaMEL: Mean Teacher Learning for Image Captioning
International Conference on Pattern Recognition (ICPR), 2022
Manuele Barraco
Matteo Stefanini
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
ViT
VLM
200
37
0
21 Feb 2022
What Functions Can Graph Neural Networks Generate?
Mohammad Fereydounian
Hamed Hassani
Amin Karbasi
194
5
0
17 Feb 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Xiangru Zhu
Zhixu Li
Xiaodan Wang
Xueyao Jiang
Yixiang Chen
Xuwu Wang
Yanghua Xiao
N. Yuan
211
238
0
11 Feb 2022
Deep Learning Approaches on Image Captioning: A Review
ACM Computing Surveys (ACM CSUR), 2022
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
485
155
0
31 Jan 2022
A Frustratingly Simple Approach for End-to-End Image Captioning
Ziyang Luo
Yadong Xi
Rongsheng Zhang
Jing Ma
VLM
MLLM
244
19
0
30 Jan 2022
Constrained Structure Learning for Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Daqing Liu
M. Bober
J. Kittler
3DV
CML
BDL
OCL
257
9
0
27 Jan 2022
RelTR: Relation Transformer for Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yuren Cong
M. Yang
Bodo Rosenhahn
ViT
467
184
0
27 Jan 2022
Previous
1
2
3
4
5
6
7
Next
Page 3 of 7