ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.16038
  4. Cited By
A Survey on Generative AI and LLM for Video Generation, Understanding,
  and Streaming

A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming

30 January 2024
Pengyuan Zhou
Lin Wang
Zhi Liu
Yanbin Hao
Pan Hui
Sasu Tarkoma
J. Kangasharju
    VGen
ArXiv (abs)PDFHTML

Papers citing "A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming"

20 / 20 papers shown
On-device System of Compositional Multi-tasking in Large Language Models
On-device System of Compositional Multi-tasking in Large Language Models
Ondrej Bohdal
Konstantinos Theodosiadis
Asterios Mpatziakas
Dimitris Filippidis
Iro Spyrou
...
Kyeng-Hun Lee
J. Moon
Hyeonmok Ko
Mete Ozay
Umberto Michieli
117
0
0
11 Oct 2025
Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future Directions
Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future DirectionsExpert systems with applications (ESWA), 2025
Smita Khapre
Melkamu Mersha
Hassan Shakil
Jonali Baruah
Jugal Kalita
127
2
0
29 Sep 2025
SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation
SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation
Peng Hu
Yu Gu
Liang Luo
Fuji Ren
DiffMVGen
108
0
0
23 Aug 2025
DeepFleet: Multi-Agent Foundation Models for Mobile Robots
DeepFleet: Multi-Agent Foundation Models for Mobile Robots
Ameya Agaskar
Sriram Siva
William Pickering
Kyle O'Brien
Charles Kekeh
...
Tamir Hegazy
Scott Niekum
Usman A. Khan
Federico Pecora
Joseph W. Durham
149
1
0
12 Aug 2025
AI-Generated Video Detection via Perceptual Straightening
AI-Generated Video Detection via Perceptual Straightening
Christian Internò
Robert Geirhos
Markus Olhofer
Sunny Liu
Barbara Hammer
David Klindt
334
1
0
01 Jul 2025
Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing
Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing
Zehua Liu
Xiaolou Li
Li Guo
Lantian Li
D. Wang
154
1
0
27 May 2025
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design
Benjamin Schneider
Dongfu Jiang
Chao Du
Tianyu Pang
Wenhu Chen
VLM
231
4
0
22 May 2025
Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI
Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI
Danaja Rutar
Alva Markelius
Konstantinos Voudouris
José Hernández-Orallo
Lucy G. Cheke
OCLELM
489
1
0
27 Mar 2025
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Philip S. Yu
Xuming Hu
Qingsong Wen
712
21
0
23 Mar 2025
Neuroplasticity and Corruption in Model Mechanisms: A Case Study Of Indirect Object IdentificationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Vishnu Kabir Chhabra
Ding Zhu
Mohammad Mahdi Khalili
325
5
0
27 Feb 2025
LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation
Junchen Fu
Xuri Ge
Kaiwen Zheng
Ioannis Arapakis
Xin Xin
J. Jose
336
1
0
20 Feb 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in MedicineIEEE Reviews in Biomedical Engineering (RBME), 2024
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CELM&MAVLM
766
69
0
17 Jan 2025
Generative AI for Cel-Animation: A Survey
Generative AI for Cel-Animation: A Survey
Yunlong Tang
Junjia Guo
Pinxin Liu
Zhiyuan Wang
Hang Hua
...
Jing Bi
Mingqian Feng
Xuzhao Li
Zeliang Zhang
Chenliang Xu
VGen
695
17
0
08 Jan 2025
ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation
ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation
Ting Zhang
Zhiqiang Yuan
Yeshuang Zhu
Jinchao Zhang
DiffM
325
0
0
31 Dec 2024
Do Language Models Understand Time?
Do Language Models Understand Time?The Web Conference (WWW), 2024
Xi Ding
Lei Wang
919
10
0
18 Dec 2024
Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding
Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding
Yiming Zhang
Zhuokai Zhao
Zhaorun Chen
Zenghui Ding
Xianjun Yang
Yining Sun
1.1K
9
0
21 Nov 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
523
155
0
09 Jul 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video
  Generation
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVMVGen
274
66
0
08 Mar 2024
Video Understanding with Large Language Models: A Survey
Video Understanding with Large Language Models: A Survey
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
...
Feng Zheng
Jianguo Zhang
Chenliang Xu
Jiebo Luo
Chenliang Xu
VLM
711
163
0
29 Dec 2023
Valley: Video Assistant with Large Language model Enhanced abilitY
Valley: Video Assistant with Large Language model Enhanced abilitY
Ruipu Luo
Ziwang Zhao
Min Yang
Junwei Dong
Da Li
Pengcheng Lu
Tao Wang
Linmei Hu
Ming-Hui Qiu
MLLM
516
253
0
12 Jun 2023
1