Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1511.07571
Cited By
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
24 November 2015
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DenseCap: Fully Convolutional Localization Networks for Dense Captioning"
50 / 468 papers shown
Title
Bypass Network for Semantics Driven Image Paragraph Captioning
Computer Vision and Image Understanding (CVIU), 2022
Qinjie Zheng
Chaoyue Wang
Dadong Wang
206
1
0
21 Jun 2022
FD-CAM: Improving Faithfulness and Discriminability of Visual Explanation for CNNs
International Conference on Pattern Recognition (ICPR), 2022
Hui Li
Zihao Li
Rui Ma
Tieru Wu
FAtt
127
13
0
17 Jun 2022
Language Models Can See: Plugging Visual Controls in Text Generation
Yixuan Su
Tian Lan
Yahui Liu
Fangyu Liu
Dani Yogatama
Yan Wang
Lingpeng Kong
Nigel Collier
VLM
MLLM
223
111
0
05 May 2022
Diverse Image Captioning with Grounded Style
German Conference on Pattern Recognition (GCPR), 2022
Franz Klein
Shweta Mahajan
S. Roth
171
8
0
03 May 2022
CapOnImage: Context-driven Dense-Captioning on Image
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yiqi Gao
Xinglin Hou
Yuanmeng Zhang
Bo Xiao
Yuning Jiang
Peifeng Wang
178
13
0
27 Apr 2022
"It Feels Like Being Locked in A Cage": Understanding Blind or Low Vision Streamers' Perceptions of Content Curation Algorithms
Ethan Z. Rong
Mo Morgana Zhou
Zhicong Lu
Mingming Fan
84
28
0
24 Apr 2022
Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Heng Wang
Chaoyi Zhang
Jianhui Yu
Weidong (Tom) Cai
3DPC
178
48
0
22 Apr 2022
Vision Transformers in Medical Computer Vision -- A Contemplative Retrospection
Arshi Parvaiz
Muhammad Anwaar Khalid
Rukhsana Zafar
Huma Ameer
M. Ali
M. Fraz
MedIm
216
94
0
29 Mar 2022
ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer
The Web Conference (WWW), 2022
Kohei Uehara
Yusuke Mori
Yusuke Mukuta
Tatsuya Harada
293
6
0
15 Feb 2022
Describing image focused in cognitive and visual details for visually impaired people: An approach to generating inclusive paragraphs
VISIGRAPP (VISIGRAPP), 2022
Daniel Louzada Fernandes
Marcos Henrique Fonseca Ribeiro
F. Cerqueira
Michel Melo Silva
101
7
0
10 Feb 2022
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
European Conference on Computer Vision (ECCV), 2022
Jack Hessel
Jena D. Hwang
Jinho Park
Rowan Zellers
Chandra Bhagavatula
Anna Rohrbach
Kate Saenko
Yejin Choi
ReLM
446
59
0
10 Feb 2022
Robotic Grasping from Classical to Modern: A Survey
Hanbo Zhang
Jian Tang
Shiguang Sun
Xuguang Lan
200
54
0
08 Feb 2022
Deep Learning Approaches on Image Captioning: A Review
ACM Computing Surveys (ACM CSUR), 2022
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
350
143
0
31 Jan 2022
Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation
International Conference on Information Photonics (ICIP), 2021
Philipp Harzig
Moritz Einfalt
Rainer Lienhart
ViT
149
2
0
28 Dec 2021
Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds
Ayush Jain
N. Gkanatsios
Ishita Mediratta
Katerina Fragkiadaki
ObjD
425
145
0
16 Dec 2021
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning
Wenqiao Zhang
Haochen Shi
Jiannan Guo
Shengyu Zhang
Qingpeng Cai
Juncheng Li
Sihui Luo
Yueting Zhuang
DiffM
223
48
0
13 Dec 2021
Magnifying Networks for Images with Billions of Pixels
Neofytos Dimitriou
Ognjen Arandjelovic
329
3
0
12 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
168
49
0
02 Dec 2021
Object-Centric Unsupervised Image Captioning
Zihang Meng
David Yang
Xuefei Cao
Ashish Shah
Ser-Nam Lim
OCL
VLM
174
14
0
02 Dec 2021
ContIG: Self-supervised Multimodal Contrastive Learning for Medical Imaging with Genetics
Computer Vision and Pattern Recognition (CVPR), 2021
Aiham Taleb
Matthias Kirchler
Remo Monti
Christoph Lippert
SSL
MedIm
422
69
0
26 Nov 2021
Talk-to-Resolve: Combining scene understanding and spatial dialogue to resolve granular task ambiguity for a collocated robot
Pradip Pramanick
Chayan Sarkar
Snehasis Banerjee
Brojeshwar Bhowmick
168
18
0
22 Nov 2021
ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation
Laurynas Karazija
Iro Laina
Christian Rupprecht
3DV
VOS
274
100
0
19 Nov 2021
Single-Modal Entropy based Active Learning for Visual Question Answering
British Machine Vision Conference (BMVC), 2021
Dong-Jin Kim
Jae-Won Cho
Jinsoo Choi
Yunjae Jung
In So Kweon
178
14
0
21 Oct 2021
Integrating Visuospatial, Linguistic and Commonsense Structure into Story Visualization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
A. Maharana
Joey Tianyi Zhou
222
68
0
21 Oct 2021
A Self-Explainable Stylish Image Captioning Framework via Multi-References
Chengxi Li
Brent Harrison
187
0
0
20 Oct 2021
AUTO-DISCERN: Autonomous Driving Using Common Sense Reasoning
Suraj Kothawade
Vinaya Khandelwal
Kinjal Basu
Huaduo Wang
Gopal Gupta
LRM
111
26
0
17 Oct 2021
Topic Scene Graph Generation by Attention Distillation from Caption
IEEE International Conference on Computer Vision (ICCV), 2021
Wenbin Wang
R. Wang
X. Chen
DiffM
208
16
0
12 Oct 2021
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Ling Cheng
Wei Wei
Feida Zhu
Yong Liu
Chunyan Miao
ViT
154
3
0
29 Sep 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
139
38
0
28 Sep 2021
Survey: Transformer based Video-Language Pre-training
Ludan Ruan
Qin Jin
VLM
ViT
181
49
0
21 Sep 2021
Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering
Ander Salaberria
Gorka Azkune
Oier López de Lacalle
Aitor Soroa Etxabe
Eneko Agirre
264
67
0
15 Sep 2021
RefineCap: Concept-Aware Refinement for Image Captioning
Yekun Chai
Shuo Jin
Junliang Xing
VLM
97
1
0
08 Sep 2021
Journalistic Guidelines Aware News Image Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xuewen Yang
Svebor Karaman
Joel R. Tetreault
Alex Jaimes
222
32
0
07 Sep 2021
Improving Object Detection and Attribute Recognition by Feature Entanglement Reduction
International Conference on Information Photonics (ICIP), 2021
Zhao-Heng Zheng
Arka Sadhu
Ramkant Nevatia
76
2
0
25 Aug 2021
INVIGORATE: Interactive Visual Grounding and Grasping in Clutter
Hanbo Zhang
Yunfan Lu
Cunjun Yu
David Hsu
Xuguang Lan
Nanning Zheng
LM&Ro
183
70
0
25 Aug 2021
Caption Generation on Scenes with Seen and Unseen Object Categories
Image and Vision Computing (IVC), 2021
B. Demirel
R. G. Cinbis
VLM
262
2
0
13 Aug 2021
Neural Twins Talk & Alternative Calculations
International Journal of Semantic Computing (IJSC), 2021
Zanyar Zohourianshahzadi
Jugal Kalita
126
0
0
05 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
ACM Multimedia (ACM MM), 2021
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
249
71
0
05 Aug 2021
ReFormer: The Relational Transformer for Image Captioning
ACM Multimedia (ACM MM), 2021
Xuewen Yang
Yingru Liu
Xin Wang
ViT
199
62
0
29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
379
342
0
14 Jul 2021
Leveraging Explainability for Comprehending Referring Expressions in the Real World
Fethiye Irmak Dogan
G. I. Melsión
Iolanda Leite
174
8
0
12 Jul 2021
Controlled Caption Generation for Images Through Adversarial Attacks
Nayyer Aafaq
Naveed Akhtar
Wei Liu
M. Shah
Lin Wang
AAML
111
12
0
07 Jul 2021
Morphological Classification of Galaxies in S-PLUS using an Ensemble of Convolutional Networks
N. M. Cardoso
G. B. O. Schwarz
L. O. Dias
C. R. Bom
L. Sodré
C. Mendes de Oliveira
80
0
0
05 Jul 2021
Pre-Trained Models: Past, Present and Future
AI Open (AO), 2021
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
371
976
0
14 Jun 2021
Check It Again: Progressive Visual Question Answering via Visual Entailment
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Q. Si
Zheng Lin
Mingyu Zheng
Peng Fu
Weiping Wang
139
52
0
08 Jun 2021
Giving Commands to a Self-Driving Car: How to Deal with Uncertain Situations?
Engineering applications of artificial intelligence (EAAI), 2021
Thierry Deruyttere
Victor Milewski
Marie-Francine Moens
177
15
0
08 Jun 2021
An End-to-End Breast Tumour Classification Model Using Context-Based Patch Modelling- A BiLSTM Approach for Image Classification
S. Tripathi
S. Singh
H. Lee
115
49
0
05 Jun 2021
Connecting What to Say With Where to Look by Modeling Human Attention Traces
Computer Vision and Pattern Recognition (CVPR), 2021
Zihang Meng
Licheng Yu
Ning Zhang
Tamara L. Berg
Babak Damavandi
Vikas Singh
Amy Bearman
248
31
0
12 May 2021
Analyzing Online Political Advertisements
Findings (Findings), 2021
Danae Sánchez Villegas
S. Mokaram
Nikolaos Aletras
198
12
0
09 May 2021
Towards Accurate Text-based Image Captioning with Content Diversity Exploration
Computer Vision and Pattern Recognition (CVPR), 2021
Guanghui Xu
Shuaicheng Niu
Zhuliang Yu
Yucheng Luo
Qing Du
Qi Wu
DiffM
212
67
0
23 Apr 2021
Previous
1
2
3
4
5
6
...
8
9
10
Next