Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.02779
Cited By
Unifying Vision-and-Language Tasks via Text Generation
4 February 2021
Jaemin Cho
Jie Lei
Hao Tan
Mohit Bansal
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unifying Vision-and-Language Tasks via Text Generation"
5 / 5 papers shown
Title
A Large Vision-Language Model based Environment Perception System for Visually Impaired People
Zezhou Chen
Zhaoxiang Liu
Kai Wang
Kohou Wang
Shiguo Lian
37
0
0
25 Apr 2025
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjD
VLM
235
147
0
02 Jan 2021
Making Pre-trained Language Models Better Few-shot Learners
Tianyu Gao
Adam Fisch
Danqi Chen
223
1,649
0
31 Dec 2020
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
231
815
0
24 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
267
6,003
0
20 Apr 2018
1