Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.08093
Cited By
When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
17 February 2025
Pingping Zhang
Jinlong Li
Kecheng Chen
Meng Wang
Long Xu
Haoliang Li
N. Sebe
Sam Kwong
Shiqi Wang
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding"
3 / 3 papers shown
Title
GIViC: Generative Implicit Video Compression
Ge Gao
Siyue Teng
Tianhao Peng
Fan Zhang
David Bull
DiffM
VGen
36
0
0
25 Mar 2025
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Jinlong Li
Cristiano Saltori
Fabio Poiesi
N. Sebe
67
0
0
20 Mar 2025
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
115
269
0
17 Jan 2024
1