Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2312.17432
Cited By
v1
v2
v3
v4
v5 (latest)
Video Understanding with Large Language Models: A Survey
29 December 2023
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
Teng Wang
Daoan Zhang
Jie An
Jingyang Lin
Rongyi Zhu
Ali Vosoughi
Chao Huang
Zeliang Zhang
Pinxin Liu
Mingqian Feng
Feng Zheng
Jianguo Zhang
Chenliang Xu
Jiebo Luo
Chenliang Xu
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (3 upvotes)
Github (2325★)
Papers citing
"Video Understanding with Large Language Models: A Survey"
4 / 104 papers shown
Title
LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Yunxin Li
Xinyu Chen
Baotain Hu
Min Zhang
223
9
0
21 Feb 2024
VideoPrism: A Foundational Visual Encoder for Video Understanding
Long Zhao
N. B. Gundavarapu
Liangzhe Yuan
Hao Zhou
Shen Yan
...
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Ting Liu
Boqing Gong
VGen
337
62
0
20 Feb 2024
Tri
2
^{2}
2
-plane: Thinking Head Avatar via Feature Pyramid
European Conference on Computer Vision (ECCV), 2024
Luchuan Song
Pinxin Liu
Lele Chen
Guojun Yin
Chenliang Xu
3DH
232
14
0
17 Jan 2024
Valley: Video Assistant with Large Language model Enhanced abilitY
Ruipu Luo
Ziwang Zhao
Min Yang
Junwei Dong
Da Li
Pengcheng Lu
Tao Wang
Linmei Hu
Ming-Hui Qiu
MLLM
423
247
0
12 Jun 2023
Previous
1
2
3