ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.17432
  4. Cited By
Video Understanding with Large Language Models: A Survey
v1v2v3v4v5 (latest)

Video Understanding with Large Language Models: A Survey

29 December 2023
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
Teng Wang
Daoan Zhang
Jie An
Jingyang Lin
Rongyi Zhu
Ali Vosoughi
Chao Huang
Zeliang Zhang
Pinxin Liu
Mingqian Feng
Feng Zheng
Jianguo Zhang
Chenliang Xu
Jiebo Luo
Chenliang Xu
    VLM
ArXiv (abs)PDFHTMLHuggingFace (3 upvotes)Github (2325★)

Papers citing "Video Understanding with Large Language Models: A Survey"

4 / 104 papers shown
Title
LLMs Meet Long Video: Advancing Long Video Comprehension with An
  Interactive Visual Adapter in LLMs
LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Yunxin Li
Xinyu Chen
Baotain Hu
Min Zhang
223
9
0
21 Feb 2024
VideoPrism: A Foundational Visual Encoder for Video Understanding
VideoPrism: A Foundational Visual Encoder for Video Understanding
Long Zhao
N. B. Gundavarapu
Liangzhe Yuan
Hao Zhou
Shen Yan
...
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Ting Liu
Boqing Gong
VGen
337
62
0
20 Feb 2024
Tri$^{2}$-plane: Thinking Head Avatar via Feature Pyramid
Tri2^{2}2-plane: Thinking Head Avatar via Feature PyramidEuropean Conference on Computer Vision (ECCV), 2024
Luchuan Song
Pinxin Liu
Lele Chen
Guojun Yin
Chenliang Xu
3DH
232
14
0
17 Jan 2024
Valley: Video Assistant with Large Language model Enhanced abilitY
Valley: Video Assistant with Large Language model Enhanced abilitY
Ruipu Luo
Ziwang Zhao
Min Yang
Junwei Dong
Da Li
Pengcheng Lu
Tao Wang
Linmei Hu
Ming-Hui Qiu
MLLM
423
247
0
12 Jun 2023
Previous
123