ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.07332
  4. Cited By
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
  Image Annotations

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

23 February 2016
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
Joshua Kravitz
Stephanie Chen
Yannis Kalantidis
Li-Jia Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
ArXivPDFHTML

Papers citing "Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations"

50 / 903 papers shown
Title
Adversarial Attacks on Deep Learning Models in Natural Language
  Processing: A Survey
Adversarial Attacks on Deep Learning Models in Natural Language Processing: A Survey
W. Zhang
Quan Z. Sheng
A. Alhazmi
Chenliang Li
AAML
21
57
0
21 Jan 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
31
321
0
20 Jan 2019
Evaluating Text-to-Image Matching using Binary Image Selection (BISON)
Evaluating Text-to-Image Matching using Binary Image Selection (BISON)
Hexiang Hu
Ishan Misra
L. V. D. van der Maaten
24
22
0
19 Jan 2019
Using Scene Graph Context to Improve Image Generation
Using Scene Graph Context to Improve Image Generation
Subarna Tripathi
Anahita Bhiwandiwalla
A. Bastidas
Hanlin Tang
GNN
45
32
0
11 Jan 2019
nocaps: novel object captioning at scale
nocaps: novel object captioning at scale
Harsh Agrawal
Karan Desai
Yufei Wang
Xinlei Chen
Rishabh Jain
Mark Johnson
Dhruv Batra
Devi Parikh
Stefan Lee
Peter Anderson
VLM
13
466
0
20 Dec 2018
Grounded Video Description
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
27
190
0
17 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual
  Question Answering
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
19
362
0
13 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
41
476
0
12 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
21
692
0
06 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Long Chen
Hanwang Zhang
Jun Xiao
Xiangnan He
Shiliang Pu
Shih-Fu Chang
14
159
0
06 Dec 2018
Image Generation from Layout
Image Generation from Layout
Bo-Lu Zhao
Lili Meng
Weidong Yin
Leonid Sigal
11
208
0
28 Nov 2018
From Recognition to Cognition: Visual Commonsense Reasoning
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
27
865
0
27 Nov 2018
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware
  Image-to-Image Translation
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation
Matteo Tomei
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
DiffM
17
76
0
26 Nov 2018
Object-oriented Targets for Visual Navigation using Rich Semantic
  Representations
Object-oriented Targets for Visual Navigation using Rich Semantic Representations
Jean-Benoit Delbrouck
Stéphane Dupont
26
3
0
22 Nov 2018
Scene Graph Generation via Conditional Random Fields
Weilin Cong
W. Wang
Wang-Chien Lee
GNN
16
22
0
20 Nov 2018
Explicit Bias Discovery in Visual Question Answering Models
Explicit Bias Discovery in Visual Question Answering Models
Varun Manjunatha
Nirat Saini
L. Davis
CML
FAtt
19
92
0
19 Nov 2018
LinkNet: Relational Embedding for Scene Graph
LinkNet: Relational Embedding for Scene Graph
Sanghyun Woo
Dahun Kim
Donghyeon Cho
In So Kweon
GNN
13
147
0
15 Nov 2018
Hybrid Knowledge Routed Modules for Large-scale Object Detection
Hybrid Knowledge Routed Modules for Large-scale Object Detection
Chenhan Jiang
Hang Xu
Xiangdan Liang
Liang Lin
VLM
ObjD
31
86
0
30 Oct 2018
Visual Semantic Navigation using Scene Priors
Visual Semantic Navigation using Scene Priors
Wei Yang
X. Wang
Ali Farhadi
Abhinav Gupta
Roozbeh Mottaghi
LM&Ro
22
320
0
15 Oct 2018
The Focus-Aspect-Polarity Model for Predicting Subjective Noun
  Attributes in Images
The Focus-Aspect-Polarity Model for Predicting Subjective Noun Attributes in Images
Tushar Karayil
Philipp Blandfort
Jörn Hees
Andreas Dengel
19
0
0
15 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md. Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
28
760
0
06 Oct 2018
Context-Dependent Diffusion Network for Visual Relationship Detection
Context-Dependent Diffusion Network for Visual Relationship Detection
Zhen Cui
Chunyan Xu
Wenming Zheng
Jian Yang
GNN
14
50
0
11 Sep 2018
Recent Advances in Object Detection in the Age of Deep Convolutional
  Neural Networks
Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks
Shivang Agarwal
Jean Ogier du Terrail
F. Jurie
ObjD
18
123
0
10 Sep 2018
Deep Learning for Generic Object Detection: A Survey
Deep Learning for Generic Object Detection: A Survey
Li Liu
Wanli Ouyang
Xiaogang Wang
Paul Fieguth
Jie Chen
Xinwang Liu
M. Pietikäinen
ObjD
VLM
OOD
70
2,419
0
06 Sep 2018
Object Hallucination in Image Captioning
Object Hallucination in Image Captioning
Anna Rohrbach
Lisa Anne Hendricks
Kaylee Burns
Trevor Darrell
Kate Saenko
13
402
0
06 Sep 2018
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Interpretable Visual Question Answering by Reasoning on Dependency Trees
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
27
55
0
06 Sep 2018
OCNet: Object Context Network for Scene Parsing
OCNet: Object Context Network for Scene Parsing
Yuhui Yuan
Lang Huang
Jianyuan Guo
Chao Zhang
Xilin Chen
Jingdong Wang
23
599
0
04 Sep 2018
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and
  Comprehensive Image Captions
simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Houfeng Wang
Xu Sun
95
65
0
27 Aug 2018
Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Context-Aware Visual Policy Network for Sequence-Level Image Captioning
Daqing Liu
Zhengjun Zha
Hanwang Zhang
Yongdong Zhang
Feng Wu
CLIP
33
103
0
16 Aug 2018
Graph R-CNN for Scene Graph Generation
Graph R-CNN for Scene Graph Generation
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
GNN
28
836
0
01 Aug 2018
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship
  Features
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features
Xu Yang
Hanwang Zhang
Jianfei Cai
42
74
0
01 Aug 2018
Visual Graphs from Motion (VGfM): Scene understanding with object
  geometry reasoning
Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning
P. Gay
Stuart James
Alessio Del Bue
OCL
42
31
0
16 Jul 2018
Object Relation Detection Based on One-shot Learning
Object Relation Detection Based on One-shot Learning
Li Zhou
Jian-jun Zhao
Jianshu Li
Li-xin Yuan
Jiashi Feng
ObjD
14
23
0
16 Jul 2018
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship
  Recognition
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition
Guojun Yin
Lu Sheng
Bin Liu
Nenghai Yu
Xiaogang Wang
Jing Shao
Chen Change Loy
ObjD
30
156
0
13 Jul 2018
Dynamic Multimodal Instance Segmentation guided by natural language
  queries
Dynamic Multimodal Instance Segmentation guided by natural language queries
Edgar Margffoy-Tuay
Juan C. Pérez
Emilio Botero
Pablo Arbelaez
22
170
0
06 Jul 2018
Long Activity Video Understanding using Functional Object-Oriented
  Network
Long Activity Video Understanding using Functional Object-Oriented Network
Ahmad Babaeian Jelodar
D. Paulius
Yu Sun
23
35
0
03 Jul 2018
Learning Visual Knowledge Memory Networks for Visual Question Answering
Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su
Chen Zhu
Yinpeng Dong
Dongqi Cai
Yurong Chen
Jianguo Li
34
62
0
13 Jun 2018
Visual Relationship Detection Based on Guided Proposals and Semantic
  Knowledge Distillation
Visual Relationship Detection Based on Guided Proposals and Semantic Knowledge Distillation
François Plesse
A. Gînsca
Bertrand Delezoide
F. Prêteux
13
29
0
28 May 2018
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual
  Question Answering
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
Pan Lu
Lei Ji
Wei Zhang
Nan Duan
M. Zhou
Jianyong Wang
CoGe
17
79
0
24 May 2018
Token-level and sequence-level loss smoothing for RNN language models
Token-level and sequence-level loss smoothing for RNN language models
Maha Elbayad
Laurent Besacier
Jakob Verbeek
22
19
0
14 May 2018
Rethinking Diversified and Discriminative Proposal Generation for Visual
  Grounding
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding
Zhou Yu
Jun-chen Yu
Chenchao Xiang
Zhou Zhao
Q. Tian
Dacheng Tao
ObjD
18
138
0
09 May 2018
Automatic Metric Validation for Grammatical Error Correction
Automatic Metric Validation for Grammatical Error Correction
Leshem Choshen
Omri Abend
14
30
0
30 Apr 2018
Zero-Shot Object Detection
Zero-Shot Object Detection
Ankan Bansal
Karan Sikka
Gaurav Sharma
Rama Chellappa
Ajay Divakaran
VLM
ObjD
35
359
0
12 Apr 2018
Improved Fusion of Visual and Language Representations by Dense
  Symmetric Co-Attention for Visual Question Answering
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen
Takayuki Okatani
22
279
0
03 Apr 2018
Referring Relationships
Referring Relationships
Ranjay Krishna
Ines Chami
Michael S. Bernstein
Li Fei-Fei
22
94
0
28 Mar 2018
Discriminability objective for training descriptive captions
Discriminability objective for training descriptive captions
Ruotian Luo
Brian L. Price
Scott D. Cohen
Gregory Shakhnarovich
19
202
0
12 Mar 2018
Neural Aesthetic Image Reviewer
Neural Aesthetic Image Reviewer
Wenshan Wang
Su Yang
Weishan Zhang
Jiulong Zhang
19
38
0
28 Feb 2018
Mapping Images to Scene Graphs with Permutation-Invariant Structured
  Prediction
Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction
Roei Herzig
Moshiko Raboh
Gal Chechik
Jonathan Berant
Amir Globerson
GNN
OCL
24
133
0
15 Feb 2018
From BoW to CNN: Two Decades of Texture Representation for Texture
  Classification
From BoW to CNN: Two Decades of Texture Representation for Texture Classification
Li Liu
Jie Chen
Paul Fieguth
Guoying Zhao
Rama Chellappa
M. Pietikäinen
3DV
39
332
0
31 Jan 2018
DVQA: Understanding Data Visualizations via Question Answering
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
33
363
0
24 Jan 2018
Previous
123...16171819
Next