Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2404.16038
Cited By

A Survey on Generative AI and LLM for Video Generation, Understanding,
and Streaming

A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming

30 January 2024

Lin Wang

ArXiv (abs)PDF HTML

Papers citing "A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming"

20 / 20 papers shown

On-device System of Compositional Multi-tasking in Large Language Models

On-device System of Compositional Multi-tasking in Large Language Models

Konstantinos Theodosiadis

Asterios Mpatziakas

Dimitris Filippidis

...

Umberto Michieli

117

0

0

11 Oct 2025

Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future Directions

Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future DirectionsExpert systems with applications (ESWA), 2025

127

2

0

29 Sep 2025

SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation

SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation

108

0

0

23 Aug 2025

DeepFleet: Multi-Agent Foundation Models for Mobile Robots

DeepFleet: Multi-Agent Foundation Models for Mobile Robots

William Pickering

...

Federico Pecora

Joseph W. Durham

149

1

0

12 Aug 2025

AI-Generated Video Detection via Perceptual Straightening

AI-Generated Video Detection via Perceptual Straightening

Christian Internò

334

1

0

01 Jul 2025

Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing

Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing

154

1

0

27 May 2025

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Benjamin Schneider

231

4

0

22 May 2025

Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI

Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI

Konstantinos Voudouris

José Hernández-Orallo

489

1

0

27 Mar 2025

MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection

MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection

712

21

0

23 Mar 2025

Neuroplasticity and Corruption in Model Mechanisms: A Case Study Of Indirect Object IdentificationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Vishnu Kabir Chhabra

Mohammad Mahdi Khalili

325

5

0

27 Feb 2025

LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation

Ioannis Arapakis

336

1

0

20 Feb 2025

A Comprehensive Survey of Foundation Models in Medicine

A Comprehensive Survey of Foundation Models in MedicineIEEE Reviews in Biomedical Engineering (RBME), 2024

AI4CE LM&MA VLM

766

69

0

17 Jan 2025

Generative AI for Cel-Animation: A Survey

Generative AI for Cel-Animation: A Survey

...

695

17

0

08 Jan 2025

ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation

ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation

325

0

0

31 Dec 2024

Do Language Models Understand Time?

Do Language Models Understand Time?The Web Conference (WWW), 2024

919

10

0

18 Dec 2024

Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

1.1K

9

0

21 Nov 2024

Mobile Edge Intelligence for Large Language Models: A Contemporary Survey

Mobile Edge Intelligence for Large Language Models: A Contemporary Survey

Wei Wei

Zheng Lin

Kaibin Huang

523

155

0

09 Jul 2024

Sora as an AGI World Model? A Complete Survey on Text-to-Video
Generation

Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation

Fachrina Dewi Puspitasari

Lik-Hang Lee

Choong Seon Hong

274

66

0

08 Mar 2024

Video Understanding with Large Language Models: A Survey

Video Understanding with Large Language Models: A Survey

...

711

163

0

29 Dec 2023

Valley: Video Assistant with Large Language model Enhanced abilitY

Valley: Video Assistant with Large Language model Enhanced abilitY

516

253

0

12 Jun 2023