Title |
---|
![]() Focus Anywhere for Fine-grained Multi-page Document Understanding Chenglong Liu Haoran Wei Jinyue Chen Lingyu Kong Zheng Ge Zining Zhu Liang Zhao Jian‐Yuan Sun Chunrui Han Xiangyu Zhang |
![]() OneChart: Purify the Chart Structural Extraction via One Auxiliary Token Jinyue Chen Lingyu Kong Haoran Wei Chenglong Liu Zheng Ge Liang Zhao Jian‐Yuan Sun Chunrui Han Xiangyu Zhang |
![]() Merlin:Empowering Multimodal LLMs with Foresight Minds En Yu Liang Zhao Yana Wei Jinrong Yang Dongming Wu ...Haoran Wei Tiancai Wang Zheng Ge Xiangyu Zhang Wenbing Tao |
![]() DreamLLM: Synergistic Multimodal Comprehension and Creation Runpei Dong Chunrui Han Yuang Peng Zekun Qi Zheng Ge ...Hao-Ran Wei Xiangwen Kong Xiangyu Zhang Kaisheng Ma Li Yi |