Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.07571
Cited By
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
24 November 2015
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DenseCap: Fully Convolutional Localization Networks for Dense Captioning"
50 / 452 papers shown
Title
Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards
Junjie Zhang
Qi Wu
Chunhua Shen
Jian Andrew Zhang
Jianfeng Lu
A. Hengel
LRM
29
29
0
21 Nov 2017
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
Keren Ye
Adriana Kovashka
19
50
0
17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
A. Hengel
ObjD
19
134
0
17 Nov 2017
Image Captioning and Classification of Dangerous Situations
Octavio Arriaga
Paul G. Plöger
Matias Valdenegro-Toro
20
8
0
07 Nov 2017
BENCHIP: Benchmarking Intelligence Processors
Jinhua Tao
Zidong Du
Qi Guo
Huiying Lan
Lei Zhang
...
Allen Rush
Willian Chen
Shaoli Liu
Yunji Chen
Tianshi Chen
20
35
0
23 Oct 2017
Interactively Picking Real-World Objects with Unconstrained Spoken Language Instructions
Jun Hatori
Yuta Kikuchi
Sosuke Kobayashi
K. Takahashi
Yuta Tsuboi
Y. Unno
W. Ko
Jethro Tan
22
160
0
17 Oct 2017
iVQA: Inverse Visual Question Answering
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
31
47
0
10 Oct 2017
What Does Explainable AI Really Mean? A New Conceptualization of Perspectives
Derek Doran
Sarah Schulz
Tarek R. Besold
XAI
24
436
0
02 Oct 2017
Semantic Segmentation from Limited Training Data
Anton Milan
Trung T. Pham
B. V. Kumar
D. Morrison
Adam W. Tow
...
Christopher F. Lehnert
G. Lin
Ian Reid
Peter Corke
Jurgen Leitner
16
51
0
22 Sep 2017
Visual Question Generation as Dual Task of Visual Question Answering
Yikang Li
Nan Duan
Bolei Zhou
Xiao Chu
Wanli Ouyang
Xiaogang Wang
29
165
0
21 Sep 2017
Learning Functional Causal Models with Generative Neural Networks
Hugo Jair Escalante
Sergio Escalera
Xavier Baro
Isabelle M Guyon
Umut Güçlü
Marcel van Gerven
CML
BDL
20
107
0
15 Sep 2017
Joint Learning of Set Cardinality and State Distribution
S. Hamid Rezatofighi
Anton Milan
Javen Qinfeng Shi
A. Dick
Ian Reid
SSL
BDL
21
16
0
13 Sep 2017
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval
Yuming Shen
Li Liu
Ling Shao
Jingkuan Song
20
49
0
08 Aug 2017
Scene Graph Generation from Objects, Phrases and Region Captions
Yikang Li
Wanli Ouyang
Bolei Zhou
Kun Wang
Xiaogang Wang
21
499
0
31 Jul 2017
Weakly-supervised learning of visual relations
Julia Peyre
Ivan Laptev
Cordelia Schmid
Josef Sivic
11
193
0
29 Jul 2017
Deep Interactive Region Segmentation and Captioning
Ali Sharifi Boroujerdi
M. Khanian
M. Breuß
16
7
0
26 Jul 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
29
4,180
0
25 Jul 2017
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey
Hirokatsu Kataoka
Soma Shirakabe
Yun He
S. Ueta
Teppei Suzuki
...
Ryousuke Takasawa
Masataka Fuchida
Yudai Miyashita
Kazushige Okayasu
Yuta Matsuzaki
22
1
0
20 Jul 2017
Video Question Answering via Attribute-Augmented Attention Network Learning
Yunan Ye
Zhou Zhao
Yimeng Li
Long Chen
Jun Xiao
Yueting Zhuang
6
107
0
20 Jul 2017
Grounding Spatio-Semantic Referring Expressions for Human-Robot Interaction
Mohit Shridhar
David Hsu
ObjD
11
20
0
18 Jul 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
Zizhao Zhang
Yuanpu Xie
Fuyong Xing
M. McGough
L. Yang
MedIm
13
301
0
08 Jul 2017
Visually Grounded Word Embeddings and Richer Visual Features for Improving Multimodal Neural Machine Translation
Jean-Benoit Delbrouck
Stéphane Dupont
Omar Seddati
17
8
0
04 Jul 2017
Pedestrian Alignment Network for Large-scale Person Re-identification
Zhedong Zheng
Liang Zheng
Yi Yang
19
478
0
03 Jul 2017
Where to Play: Retrieval of Video Segments using Natural-Language Queries
Sangkuk Lee
Daesik Kim
Myunggi Lee
Jihye Hwang
Nojun Kwak
25
3
0
02 Jul 2017
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention
Marcella Cornia
Lorenzo Baraldi
Giuseppe Serra
Rita Cucchiara
11
79
0
26 Jun 2017
Using Artificial Tokens to Control Languages for Multilingual Image Caption Generation
Satoshi Tsutsui
David J. Crandall
11
19
0
20 Jun 2017
An Entropy-based Pruning Method for CNN Compression
Jian-Hao Luo
Jianxin Wu
14
180
0
19 Jun 2017
Who Will Share My Image? Predicting the Content Diffusion Path in Online Social Networks
Wenjian Hu
Krishna Kumar Singh
Fanyi Xiao
Jinyoung Han
Chen-Nee Chuah
Yong Jae Lee
GNN
DiffM
14
1
0
25 May 2017
Deep image representations using caption generators
Konda Reddy Mopuri
Vishal B. Athreya
R. Venkatesh Babu
VLM
SSL
14
1
0
25 May 2017
ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases
Xiaosong Wang
Yifan Peng
Le Lu
Zhiyong Lu
M. Bagheri
Ronald M. Summers
LM&MA
9
2,467
0
05 May 2017
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures
Fanyi Xiao
Leonid Sigal
Yong Jae Lee
19
138
0
03 May 2017
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
48
1,214
0
02 May 2017
AMTnet: Action-Micro-Tube Regression by End-to-end Trainable Deep Architecture
Suman Saha
Gurkirt Singh
Fabio Cuzzolin
16
70
0
17 Apr 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions
Amir Mazaheri
Dong-Ming Zhang
M. Shah
9
12
0
15 Apr 2017
Spatial Memory for Context Reasoning in Object Detection
Xinlei Chen
Abhinav Gupta
ObjD
19
164
0
13 Apr 2017
Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries
Y. Zhang
Luyao Yuan
Yijie Guo
Zhiyuan He
I-An Huang
Honglak Lee
ObjD
28
57
0
12 Apr 2017
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Zhou Ren
Xiaoyu Wang
Ning Zhang
Xutao Lv
Li-Jia Li
26
324
0
12 Apr 2017
What's in a Question: Using Visual Questions as a Form of Supervision
Siddha Ganju
Olga Russakovsky
Abhinav Gupta
11
16
0
12 Apr 2017
Creativity: Generating Diverse Questions using Variational Autoencoders
Unnat Jain
Ziyu Zhang
A. Schwing
17
152
0
11 Apr 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Liwei Wang
Yin Li
Jing-ling Huang
Svetlana Lazebnik
VLM
27
494
0
11 Apr 2017
Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering
V. Kazemi
Ali Elqursh
OOD
26
183
0
11 Apr 2017
Generating Descriptions with Grounded and Co-Referenced People
Anna Rohrbach
Marcus Rohrbach
Siyu Tang
Seong Joon Oh
Bernt Schiele
314
72
0
05 Apr 2017
Weakly Supervised Dense Video Captioning
Zhiqiang Shen
Jianguo Li
Zhou Su
Minjun Li
Yurong Chen
Yu-Gang Jiang
Xiangyang Xue
21
134
0
05 Apr 2017
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks
Tanmay Gupta
Kevin J. Shih
Saurabh Singh
Derek Hoiem
29
26
0
02 Apr 2017
Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention
Jinkyu Kim
John F. Canny
FAtt
XAI
OOD
MILM
CML
30
333
0
30 Mar 2017
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation
Albert Gatt
E. Krahmer
LM&MA
ELM
18
809
0
29 Mar 2017
Neural Ctrl-F: Segmentation-free Query-by-String Word Spotting in Handwritten Manuscript Collections
T. Wilkinson
Jonas Lindström
Anders Brun
14
38
0
22 Mar 2017
An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning
Fan Wu
Zhongwen Xu
Yi Yang
ObjD
23
11
0
22 Mar 2017
Recurrent Topic-Transition GAN for Visual Paragraph Generation
Xiaodan Liang
Zhiting Hu
H. M. Zhang
Chuang Gan
Eric P. Xing
GAN
19
200
0
21 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
31
423
0
20 Mar 2017
Previous
1
2
3
...
10
7
8
9
Next