A Survey of Current Datasets for Vision and Language Research

A Survey of Current Datasets for Vision and Language Research

23 June 2015

Francis Ferraro

N. Mostafazadeh

Ting-Hao 'Kenneth' Huang

Lucy Vanderwende

Margaret Mitchell

Papers citing "A Survey of Current Datasets for Vision and Language Research"

12 / 12 papers shown

Title
Multi-VQG: Generating Engaging Questions for Multiple Images Min-Hsuan Yeh Vicent Chen Ting-Hao Haung Lun-Wei Ku CoGe 18 7 0 14 Nov 2022
VL-Taboo: An Analysis of Attribute-based Zero-shot Capabilities of Vision-Language Models Felix Vogel Nina Shvetsova Leonid Karlinsky Hilde Kuehne VLM 63 7 0 12 Sep 2022
Learning More May Not Be Better: Knowledge Transferability in Vision and Language Tasks Tianwei Chen Noa Garcia Mayu Otani Chenhui Chu Yuta Nakashima Hajime Nagahara VLM 38 0 0 23 Aug 2022
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning Paul Pu Liang Yiwei Lyu Xiang Fan Zetian Wu Yun Cheng ... Peter Wu Michelle A. Lee Yuke Zhu Ruslan Salakhutdinov Louis-Philippe Morency VLM 29 159 0 15 Jul 2021
Evaluating Text-to-Image Matching using Binary Image Selection (BISON) Hexiang Hu Ishan Misra L. V. D. van der Maaten 24 22 0 19 Jan 2019
Paris-Lille-3D: a large and high-quality ground truth urban point cloud dataset for automatic segmentation and classification Xavier Roynard Jean-Emmanuel Deschaud F. Goulette 3DPC 3DV 16 280 0 30 Nov 2017
An Analysis of Action Recognition Datasets for Language and Vision Tasks Spandana Gella Frank Keller ObjD 14 11 0 24 Apr 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering Y. Jang Yale Song Youngjae Yu Youngjin Kim Gunhee Kim 32 546 0 14 Apr 2017
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation Albert Gatt E. Krahmer LM&MA ELM 27 808 0 29 Mar 2017
Visual Storytelling Ting-Hao 'Kenneth' Huang Huang Francis Ferraro N. Mostafazadeh Ishan Misra ... C. L. Zitnick Devi Parikh Lucy Vanderwende Michel Galley Margaret Mitchell VGen 13 464 0 13 Apr 2016
Multimodal Pivots for Image Caption Translation Julian Hitschler Shigehiko Schamoni Stefan Riezler 25 97 0 15 Jan 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures Raffaella Bernardi Ruken Cakici Desmond Elliott Aykut Erdem Erkut Erdem Nazli Ikizler-Cinbis Frank Keller A. Muscat Barbara Plank EGVM VLM 21 363 0 15 Jan 2016