OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts

OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts

    VLM

Papers citing "OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts"

30 / 30 papers shown
Title
VX2TEXT: End-to-End Learning of Video-Based Text Generation From
  Multimodal Inputs
VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal InputsComputer Vision and Pattern Recognition (CVPR), 2021
184
73
0
28 Jan 2021
Language-Mediated, Object-Centric Representation Learning
Language-Mediated, Object-Centric Representation LearningFindings (Findings), 2020
Ruocheng Wang
Jiayuan Mao
S. Gershman
Jiajun Wu
203
13
0
31 Dec 2020
On Modality Bias in the TVQA Dataset
On Modality Bias in the TVQA DatasetBritish Machine Vision Conference (BMVC), 2020
153
44
0
18 Dec 2020
Interpretable Neural Computation for Real-World Compositional Visual
  Question Answering
Interpretable Neural Computation for Real-World Compositional Visual Question AnsweringChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2020
70
2
0
10 Oct 2020
Comprehensive Image Captioning via Scene Graph Decomposition
Comprehensive Image Captioning via Scene Graph DecompositionEuropean Conference on Computer Vision (ECCV), 2020
191
136
0
23 Jul 2020
Image Captioning with Unseen Objects
Image Captioning with Unseen ObjectsBritish Machine Vision Conference (BMVC), 2019
199
17
0
31 Jul 2019
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
VIFIDEL: Evaluating the Visual Fidelity of Image DescriptionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Pranava Madhyastha
Josiah Wang
Lucia Specia
130
38
0
22 Jul 2019
Video Question Generation via Cross-Modal Self-Attention Networks
  Learning
Video Question Generation via Cross-Modal Self-Attention Networks LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
135
12
0
05 Jul 2019
Object Counts! Bringing Explicit Detections Back into Image Captioning
Object Counts! Bringing Explicit Detections Back into Image CaptioningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2018
Josiah Wang
Pranava Madhyastha
Lucia Specia
116
38
0
23 Apr 2018