ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.07571
  4. Cited By
DenseCap: Fully Convolutional Localization Networks for Dense Captioning

DenseCap: Fully Convolutional Localization Networks for Dense Captioning

24 November 2015
Justin Johnson
A. Karpathy
Li Fei-Fei
    VLM
ArXiv (abs)PDFHTML

Papers citing "DenseCap: Fully Convolutional Localization Networks for Dense Captioning"

50 / 468 papers shown
Title
Visual Entailment Task for Visually-Grounded Language Learning
Visual Entailment Task for Visually-Grounded Language Learning
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
121
59
0
26 Nov 2018
Senti-Attend: Image Captioning using Sentiment and Attention
Senti-Attend: Image Captioning using Sentiment and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
VLM
101
18
0
24 Nov 2018
Object-oriented Targets for Visual Navigation using Rich Semantic
  Representations
Object-oriented Targets for Visual Navigation using Rich Semantic Representations
Jean-Benoit Delbrouck
Stéphane Dupont
143
3
0
22 Nov 2018
Intention Oriented Image Captions with Guiding Objects
Intention Oriented Image Captions with Guiding ObjectsComputer Vision and Pattern Recognition (CVPR), 2018
Yue Zheng
Yali Li
Shengjin Wang
165
56
0
19 Nov 2018
Revisiting Image-Language Networks for Open-ended Phrase Detection
Revisiting Image-Language Networks for Open-ended Phrase Detection
Bryan A. Plummer
Kevin J. Shih
Yichen Li
Ke Xu
Svetlana Lazebnik
Stan Sclaroff
Kate Saenko
ObjDSSeg
126
4
0
17 Nov 2018
Image Captioning as Neural Machine Translation Task in SOCKEYE
Image Captioning as Neural Machine Translation Task in SOCKEYE
Loris Bazzani
Tobias Domhan
Felix Hieber
VLM
154
2
0
09 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM3DV
294
840
0
06 Oct 2018
Team NimbRo at MBZIRC 2017: Autonomous Valve Stem Turning using a Wrench
Team NimbRo at MBZIRC 2017: Autonomous Valve Stem Turning using a Wrench
Max Schwarz
David Droeschel
Christian Lenz
Arul Selvam Periyasamy
En Yen Puang
Jan Razlaw
Diego Rodriguez
Sebastian Schüller
M. Schreiber
Sven Behnke
118
17
0
06 Oct 2018
RGB-D Object Detection and Semantic Segmentation for Autonomous
  Manipulation in Clutter
RGB-D Object Detection and Semantic Segmentation for Autonomous Manipulation in Clutter
Max Schwarz
Anton Milan
Arul Selvam Periyasamy
Sven Behnke
3DPC
168
169
0
01 Oct 2018
Vector Learning for Cross Domain Representations
Vector Learning for Cross Domain RepresentationsInternational Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2017
Shagan Sah
Chi Zhang
Thang Nguyen
D. Peri
Ameya Shringi
R. Ptucha
GAN
91
3
0
27 Sep 2018
Object Detection from Scratch with Deep Supervision
Object Detection from Scratch with Deep SupervisionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2018
Zhiqiang Shen
Zhuang Liu
Jianguo Li
Yu-Gang Jiang
Yurong Chen
Xiangyang Xue
ObjD
201
89
0
25 Sep 2018
Image Reassembly Combining Deep Learning and Shortest Path Problem
Image Reassembly Combining Deep Learning and Shortest Path Problem
Marie-Morgane Paumard
David Picard
Hedi Tabia
OCL3DV
96
28
0
04 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
Alex Schwing
122
67
0
03 Sep 2018
Wavelet based edge feature enhancement for convolutional neural networks
Wavelet based edge feature enhancement for convolutional neural networks
Dedimuni D. De Silva
Subha Fernando
I. Piyatilake
A. Karunarathne
176
16
0
29 Aug 2018
Multimodal Differential Network for Visual Question Generation
Multimodal Differential Network for Visual Question Generation
Badri N. Patro
Sandeep Kumar
V. Kurmi
Vinay P. Namboodiri
205
40
0
12 Aug 2018
Community Regularization of Visually-Grounded Dialog
Community Regularization of Visually-Grounded Dialog
Akshat Agarwal
Swaminathan Gurumurthy
Vasu Sharma
M. Lewis
Katia Sycara
133
10
0
10 Aug 2018
Improving Deep Visual Representation for Person Re-identification by
  Global and Local Image-language Association
Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association
Dapeng Chen
Jiaming Song
Xihui Liu
Yantao Shen
Zejian Yuan
Xiaogang Wang
165
148
0
05 Aug 2018
Equal But Not The Same: Understanding the Implicit Relationship Between
  Persuasive Images and Text
Equal But Not The Same: Understanding the Implicit Relationship Between Persuasive Images and TextBritish Machine Vision Conference (BMVC), 2018
Ruotong Wang
R. Hwa
Adriana Kovashka
154
59
0
21 Jul 2018
Presentation Attack Detection for Cadaver Iris
Presentation Attack Detection for Cadaver Iris
Mateusz Trokielewicz
A. Czajka
P. Maciejewicz
CVBM
133
27
0
11 Jul 2018
Dynamic Multimodal Instance Segmentation guided by natural language
  queries
Dynamic Multimodal Instance Segmentation guided by natural language queriesEuropean Conference on Computer Vision (ECCV), 2018
Edgar Margffoy-Tuay
Juan C. Pérez
Emilio Botero
Pablo Arbelaez
256
187
0
06 Jul 2018
Face-Cap: Image Captioning using Facial Expression Analysis
Face-Cap: Image Captioning using Facial Expression Analysis
Omid Mohamad Nezami
Mark Dras
Peter Anderson
Len Hamey
CVBM
97
30
0
06 Jul 2018
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph
  Generation
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph GenerationEuropean Conference on Computer Vision (ECCV), 2018
Yikang Li
Wanli Ouyang
Bolei Zhou
Jianping Shi
Yawen Cui
Xiaogang Wang
GNN
215
280
0
29 Jun 2018
Learning Multimodal Representations for Unseen Activities
Learning Multimodal Representations for Unseen Activities
A. Piergiovanni
Michael S. Ryoo
SSL
171
4
0
21 Jun 2018
Part-Aware Fine-grained Object Categorization using Weakly Supervised
  Part Detection Network
Part-Aware Fine-grained Object Categorization using Weakly Supervised Part Detection Network
Yabin Zhang
Kui Jia
Zhixin Wang
128
24
0
16 Jun 2018
Interactive Visual Grounding of Referring Expressions for Human-Robot
  Interaction
Interactive Visual Grounding of Referring Expressions for Human-Robot Interaction
Mohit Shridhar
David Hsu
152
153
0
11 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Lin Wang
Wen Liu
Syed Zulqarnain Gilani
Mubarak Shah
477
100
0
01 Jun 2018
GLAC Net: GLocal Attention Cascading Networks for Multi-image Cued Story
  Generation
GLAC Net: GLocal Attention Cascading Networks for Multi-image Cued Story Generation
Taehyeong Kim
Min-Oh Heo
Seonil Son
Kyoung-Wha Park
Byoung-Tak Zhang
172
82
0
28 May 2018
Identifying Object States in Cooking-Related Images
Identifying Object States in Cooking-Related Images
Ahmad Babaeian Jelodar
Md Sirajus Salekin
Yu Sun
248
39
0
17 May 2018
Deep Perm-Set Net: Learn to predict sets with unknown permutation and
  cardinality using deep neural networks
Deep Perm-Set Net: Learn to predict sets with unknown permutation and cardinality using deep neural networks
S. Hamid Rezatofighi
Roman Kaskman
F. Motlagh
Javen Qinfeng Shi
Zorah Lähner
Laura Leal-Taixé
Ian Reid
SSL
189
24
0
02 May 2018
Large-Scale Visual Relationship Understanding
Large-Scale Visual Relationship Understanding
Ji Zhang
Yannis Kalantidis
Marcus Rohrbach
Manohar Paluri
Ahmed Elgammal
Mohamed Elhoseiny
374
172
0
27 Apr 2018
Customized Image Narrative Generation via Interactive Visual Question
  Generation and Answering
Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
177
8
0
27 Apr 2018
Entity-aware Image Caption Generation
Entity-aware Image Caption GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Di Lu
Spencer Whitehead
Lifu Huang
Heng Ji
Shih-Fu Chang
VLM
124
85
0
21 Apr 2018
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Huijuan Xu
Kun He
Bryan A. Plummer
Leonid Sigal
Stan Sclaroff
Kate Saenko
CLIP
198
345
0
13 Apr 2018
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Christian Schroeder de Witt
Vishal Batchu
Rohit Gajawada
Sri Aurobindo Munagala
A. Namboodiri
MQ
147
18
0
11 Apr 2018
Decoupled Novel Object Captioner
Decoupled Novel Object Captioner
Yuehua Wu
Linchao Zhu
Lu Jiang
Yi Yang
254
63
0
11 Apr 2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Antoine Miech
Ivan Laptev
Josef Sivic
283
244
0
07 Apr 2018
Guess Where? Actor-Supervision for Spatiotemporal Action Localization
Guess Where? Actor-Supervision for Spatiotemporal Action Localization
Victor Escorcia
Cuong Duc Dao
Mihir Jain
Guohao Li
Cees G. M. Snoek
196
33
0
05 Apr 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory
  Input
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
211
206
0
04 Apr 2018
Guide Me: Interacting with Deep Networks
Guide Me: Interacting with Deep Networks
Christian Rupprecht
Iro Laina
Nassir Navab
Gregory Hager
Federico Tombari
HAI
135
39
0
30 Mar 2018
A New Target-specific Object Proposal Generation Method for Visual
  Tracking
A New Target-specific Object Proposal Generation Method for Visual Tracking
Guanjun Guo
Hanzi Wang
Yan Yan
H. Liao
Yue Liu
97
5
0
27 Mar 2018
Neural Baby Talk
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
331
456
0
27 Mar 2018
Explicit Reasoning over End-to-End Neural Architectures for Visual
  Question Answering
Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering
Somak Aditya
Yezhou Yang
Chitta Baral
LRMNAIReLM
148
54
0
23 Mar 2018
EVA$^2$: Exploiting Temporal Redundancy in Live Computer Vision
EVA2^22: Exploiting Temporal Redundancy in Live Computer Vision
Mark Buckler
Philip Bedoukian
Suren Jayasuriya
Adrian Sampson
255
85
0
16 Mar 2018
Object Captioning and Retrieval with Natural Language
Object Captioning and Retrieval with Natural Language
A. Nguyen
Thanh-Toan Do
Ian Reid
D. Caldwell
Nikos G. Tsagarakis
3DV
95
21
0
16 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis
  Tool
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis ToolIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2018
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
165
30
0
16 Mar 2018
Approximate Query Matching for Image Retrieval
Approximate Query Matching for Image Retrieval
Abhijit Suprem
Polo Chau
75
1
0
14 Mar 2018
Less Is More: Picking Informative Frames for Video Captioning
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
Feiyu Xiong
Qingming Huang
149
207
0
05 Mar 2018
Joint Event Detection and Description in Continuous Video Streams
Joint Event Detection and Description in Continuous Video Streams
Huijuan Xu
Boyang Albert Li
Vasili Ramanishka
Leonid Sigal
Kate Saenko
128
58
0
28 Feb 2018
Neural Aesthetic Image Reviewer
Neural Aesthetic Image ReviewerIET Computer Vision (ICV), 2018
Wenshan Wang
Su Yang
Weishan Zhang
Jiulong Zhang
113
41
0
28 Feb 2018
Teaching Machines to Code: Neural Markup Generation with Visual
  Attention
Teaching Machines to Code: Neural Markup Generation with Visual Attention
Sumeet S. Singh
129
9
0
15 Feb 2018
Previous
123...106789
Next