Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.12423
Cited By
PSC: Extending Context Window of Large Language Models via Phase Shift Calibration
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
18 May 2025
Wenqiao Zhu
Chao Xu
Lulu Wang
Jun Wu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PSC: Extending Context Window of Large Language Models via Phase Shift Calibration"
21 / 21 papers shown
CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning
Wenqiao Zhu
Ji Liu
Rongjuncheng Zhang
Haipang Wu
Yulun Zhang
OffRL
LRM
211
1
0
21 Aug 2025
SGDPO: Self-Guided Direct Preference Optimization for Language Model Alignment
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Wenqiao Zhu
Ji Liu
Lulu Wang
Jun Wu
Yulun Zhang
372
2
0
18 May 2025
LoRA Learns Less and Forgets Less
D. Biderman
Jose Javier Gonzalez Ortiz
Jacob P. Portes
Mansheej Paul
Philip Greengard
...
Sam Havens
Vitaliy Chiley
Jonathan Frankle
Cody Blakeney
John P. Cunningham
CLL
344
230
0
15 May 2024
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Yiran Ding
Li Lyna Zhang
Chengruidong Zhang
Yuanyuan Xu
Ning Shang
Jiahang Xu
Fan Yang
Mao Yang
RALM
228
261
0
21 Feb 2024
Code Llama: Open Foundation Models for Code
Baptiste Rozière
Jonas Gehring
Fabian Gloeckle
Sten Sootla
Itai Gat
...
Hugo Touvron
Louis Martin
Nicolas Usunier
Thomas Scialom
Gabriel Synnaeve
ELM
ALM
464
2,786
0
24 Aug 2023
L-Eval: Instituting Standardized Evaluation for Long Context Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Chen An
Shansan Gong
Ming Zhong
Xingjian Zhao
Mukai Li
Jun Zhang
Lingpeng Kong
Xipeng Qiu
ELM
ALM
468
202
0
20 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
8.3K
15,302
0
18 Jul 2023
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
International Conference on Learning Representations (ICLR), 2023
Tri Dao
LRM
430
2,070
0
17 Jul 2023
Extending Context Window of Large Language Models via Positional Interpolation
Shouyuan Chen
Sherman Wong
Liangjian Chen
Yuandong Tian
436
684
0
27 Jun 2023
Landmark Attention: Random-Access Infinite Context Length for Transformers
Neural Information Processing Systems (NeurIPS), 2023
Amirkeivan Mohtashami
Martin Jaggi
LLMAG
323
195
0
25 May 2023
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
6.8K
17,868
0
27 Feb 2023
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Neural Information Processing Systems (NeurIPS), 2022
Tri Dao
Daniel Y. Fu
Stefano Ermon
Atri Rudra
Christopher Ré
VLM
845
3,353
0
27 May 2022
TruthfulQA: Measuring How Models Mimic Human Falsehoods
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Stephanie C. Lin
Jacob Hilton
Owain Evans
HILM
1.6K
2,692
0
08 Sep 2021
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
International Conference on Learning Representations (ICLR), 2021
Ofir Press
Noah A. Smith
M. Lewis
835
1,010
0
27 Aug 2021
RoFormer: Enhanced Transformer with Rotary Position Embedding
Jianlin Su
Yu Lu
Shengfeng Pan
Ahmed Murtadha
Bo Wen
Yunfeng Liu
842
4,005
0
20 Apr 2021
Measuring Massive Multitask Language Understanding
International Conference on Learning Representations (ICLR), 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
ELM
RALM
2.3K
6,617
0
07 Sep 2020
Compressive Transformers for Long-Range Sequence Modelling
International Conference on Learning Representations (ICLR), 2019
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALM
VLM
KELM
297
774
0
13 Nov 2019
HellaSwag: Can a Machine Really Finish Your Sentence?
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
638
3,460
0
19 May 2019
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
ELM
RALM
LRM
987
3,777
0
14 Mar 2018
Attention Is All You Need
Neural Information Processing Systems (NeurIPS), 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
4.2K
162,388
0
12 Jun 2017
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
3.7K
217,813
0
10 Dec 2015
1