Title |
---|
![]() Long Context Transfer from Language to Vision Peiyuan Zhang Kaichen Zhang Bo Li Guangtao Zeng Jingkang Yang Yuanhan Zhang Ziyue Wang Haoran Tan Chunyuan Li Ziwei Liu |
![]() Towards Event-oriented Long Video Understanding Yifan Du Kun Zhou Yuqi Huo Yifan Li Wayne Xin Zhao Haoyu Lu Zijia Zhao Bingning Wang Weipeng Chen Ji-Rong Wen |
![]() MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation
in Videos Xuehai He Weixi Feng Kaizhi Zheng Yujie Lu Wanrong Zhu ...Zhengyuan Yang Kevin Lin William Yang Wang Lijuan Wang Xin Eric Wang |