Title |
---|
![]() Towards Event-oriented Long Video Understanding Yifan Du Kun Zhou Yuqi Huo Yifan Li Wayne Xin Zhao Haoyu Lu Zijia Zhao Bingning Wang Weipeng Chen Ji-Rong Wen |
![]() Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with
Instruction Tuning Zebang Cheng Zhi-Qi Cheng Jun-Yan He Jingdong Sun Kai Wang Yuxiang Lin Zheng Lian Xiaojiang Peng Alexander G. Hauptmann |
![]() MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation
in Videos Xuehai He Weixi Feng Kaizhi Zheng Yujie Lu Wanrong Zhu ...Zhengyuan Yang Kevin Lin William Yang Wang Lijuan Wang Xin Eric Wang |
![]() LVBench: An Extreme Long Video Understanding Benchmark Weihan Wang Zehai He Wenyi Hong Yean Cheng Xiaohan Zhang ...Shiyu Huang Bin Xu Yuxiao Dong Ming Ding Jie Tang |
![]() Needle In A Multimodal Haystack Weiyun Wang Shuibo Zhang Yiming Ren Yuchen Duan Tiantong Li ...Ping Luo Yu Qiao Jifeng Dai Wenqi Shao Wenhai Wang |