Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.14168
Cited By
M
3
^3
3
AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
21 March 2024
Zhe Chen
Heyang Liu
Wenyi Yu
Guangzhi Sun
Hongcheng Liu
Ji Wu
Chao Zhang
Yu Wang
Yanfeng Wang
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset"
1 / 1 papers shown
Title
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model
Guangzhi Sun
Yudong Yang
Jimin Zhuang
Changli Tang
Y. Li
W. Li
Z. Ma
Chao Zhang
LRM
MLLM
VLM
64
3
0
17 Feb 2025
1