Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.09596
Cited By
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
12 December 2024
Pan Zhang
Xiaoyi Dong
Yuhang Cao
Yuhang Zang
Rui Qian
Xilin Wei
Lin Chen
Y. Li
Junbo Niu
Shuangrui Ding
Qipeng Guo
Haodong Duan
Xin Chen
Han Lv
Zheng Nie
Min-Xia Zhang
Bin Wang
W. Zhang
Xinyue Zhang
Jiaye Ge
Wei Li
Jingwen Li
Zhongying Tu
Conghui He
X. Zhang
K. Chen
Yu Qiao
D. Lin
Jiaqi Wang
KELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions"
3 / 3 papers shown
Title
RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video
Shuhang Xun
Sicheng Tao
J. Li
Yibo Shi
Zhixin Lin
...
Shikang Wang
Y. Liu
H. Zhang
Ying Ma
Xuming Hu
VLM
LRM
32
0
0
04 May 2025
Towards Understanding Camera Motions in Any Video
Zhiqiu Lin
Siyuan Cen
Daniel Jiang
Jay Karhade
Hewei Wang
...
Rushikesh Zawar
Xue Bai
Yilun Du
Chuang Gan
Deva Ramanan
VGen
21
0
0
21 Apr 2025
Large Language Models as Attribution Regularizers for Efficient Model Training
Davor Vukadin
Marin Šilić
Goran Delač
31
0
0
27 Feb 2025
1