Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.17043
Cited By
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
28 November 2023
Yanwei Li
Chengyao Wang
Jiaya Jia
VLM
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models"
2 / 202 papers shown
Title
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
2,875
0
11 Feb 2021
Previous
1
2
3
4
5