Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.10882
Cited By
Giraffe: Adventures in Expanding Context Lengths in LLMs
21 August 2023
Arka Pal
Deep Karkhanis
Manley Roberts
Samuel Dooley
Arvind Sundararajan
Siddartha Naidu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Giraffe: Adventures in Expanding Context Lengths in LLMs"
11 / 11 papers shown
Title
Taxonomy-Guided Zero-Shot Recommendations with LLMs
Yueqing Liang
Liangwei Yang
Chen Wang
Xiongxiao Xu
Philip S. Yu
Kai Shu
72
6
0
21 Feb 2025
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
Cheng Luo
Zefan Cai
Hanshi Sun
Jinqi Xiao
Bo Yuan
Wen Xiao
Junjie Hu
Jiawei Zhao
Beidi Chen
Anima Anandkumar
69
1
0
18 Feb 2025
Large Language Models for In-File Vulnerability Localization Can Be "Lost in the End"
Francesco Sovrano
Adam Bauer
Alberto Bacchelli
54
0
0
09 Feb 2025
TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
Wei Wu
Zhuoshi Pan
Chao Wang
L. Chen
Y. Bai
Kun Fu
Zehua Wang
Hui Xiong
Hui Xiong
LLMAG
36
5
0
05 Nov 2024
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
Minzheng Wang
Longze Chen
Cheng Fu
Shengyi Liao
Xinghua Zhang
...
Run Luo
Yunshui Li
Min Yang
Fei Huang
Yongbin Li
RALM
54
44
0
25 Jun 2024
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models
Longze Chen
Ziqiang Liu
Wanwei He
Yunshui Li
Run Luo
Min Yang
42
9
0
28 May 2024
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
134
371
0
09 Feb 2024
RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models
Zefan Wang
Zichuan Liu
Yingying Zhang
Aoxiao Zhong
Lunting Fan
Lingfei Wu
Qingsong Wen
41
24
0
25 Oct 2023
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model
Nolan Dey
Daria Soboleva
Faisal Al-Khateeb
Bowen Yang
Ribhu Pathria
...
Robert Myers
Jacob Robert Steeves
Natalia Vassilieva
Marvin Tom
Joel Hestness
MoE
24
14
0
20 Sep 2023
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Ofir Press
Noah A. Smith
M. Lewis
253
695
0
27 Aug 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
279
1,996
0
31 Dec 2020
1