Title |
---|
![]() MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? Yi-Fan Zhang Huanyu Zhang Haochen Tian Chaoyou Fu Shuangqing Zhang ...Qingsong Wen Zhang Zhang L. Wang Rong Jin Tieniu Tan |
![]() Crossing New Frontiers: Knowledge-Augmented Large Language Model
Prompting for Zero-Shot Text-Based De Novo Molecule Design Sakhinana Sagar Srinivas Venkataramana Runkana |
![]() AI-Assisted Generation of Difficult Math Questions Vedant Shah Dingli Yu Kaifeng Lyu Simon Park Nan Rosemary Ke ...Yoshua Bengio Sanjeev Arora Anirudh Goyal Sanjeev Arora Anirudh Goyal |
![]() Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs
and Topological Graphs Hao-Tien Lewis Chiang Zhuo Xu Zipeng Fu M. Jacob Tingnan Zhang ...Carolina Parada Chelsea Finn Peng Xu Sergey Levine Jie Tan |
![]() Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based
Speech Recognition Ye Bai Jingping Chen Jitong Chen Wei Chen Zhuo Chen ...Wanyi Zhang Yang Zhang Yawei Zhang Yijie Zheng Ming Zou |