Title |
---|
![]() DAPE V2: Process Attention Score as Feature Map for Length Extrapolation Chuanyang Zheng Yihang Gao Han Shi Jing Xiong Jiankai Sun ...Xiaozhe Ren Michael Ng Xin Jiang Zhenguo Li Yu Li |
![]() LoGra-Med: Long Context Multi-Graph Alignment for Medical
Vision-Language Model Duy M. H. Nguyen N. T. Diep Trung Q. Nguyen Hoang-Bao Le Tai Nguyen ...Pengtao Xie Roger Wattenhofer James Zhou Daniel Sonntag Mathias Niepert |
![]() Longer is (Not Necessarily) Stronger: Punctuated Long-Sequence Training
for Enhanced Speech Recognition and Translation Nithin Rao Koluguri Travis M. Bartley Hainan Xu Oleksii Hrinchuk Jagadeesh Balam Boris Ginsburg Georg Kucsko |
![]() LongRecipe: Recipe for Efficient Long Context Generalization in Large
Language Models Zhiyuan Hu Yuliang Liu Jinman Zhao Suyuchen Wang Yan Wang ...Qing Gu Anh Tuan Luu See-Kiong Ng Zhiwei Jiang Bryan Hooi |
![]() Qwen2 Technical Report An Yang Baosong Yang Binyuan Hui Bo Zheng Bowen Yu ...Yuqiong Liu Zeyu Cui Zhenru Zhang Zhifang Guo Zhi-Wei Fan |