Title |
---|
![]() DAPE V2: Process Attention Score as Feature Map for Length Extrapolation Chuanyang Zheng Yihang Gao Han Shi Jing Xiong Jiankai Sun ...Xiaozhe Ren Michael Ng Xin Jiang Zhenguo Li Yu Li |
![]() LongRecipe: Recipe for Efficient Long Context Generalization in Large
Language Models Zhiyuan Hu Yuliang Liu Jinman Zhao Suyuchen Wang Yan Wang ...Qing Gu Anh Tuan Luu See-Kiong Ng Zhiwei Jiang Bryan Hooi |