Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.06951
Cited By
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
13 January 2024
Jiaheng Liu
Zhiqi Bai
Yuanxing Zhang
Chenchen Zhang
Yu Zhang
Ge Zhang
Jiakai Wang
Haoran Que
Yukang Chen
Wenbo Su
Tiezheng Ge
Jie Fu
Wenhu Chen
Bo Zheng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"E^2-LLM: Efficient and Extreme Length Extension of Large Language Models"
6 / 6 papers shown
Title
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Tianyi Zhou
MoE
53
16
0
14 Oct 2024
How to Train Long-Context Language Models (Effectively)
Tianyu Gao
Alexander Wettig
Howard Yen
Danqi Chen
RALM
55
36
0
03 Oct 2024
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Ziyan Jiang
Xueguang Ma
Wenhu Chen
RALM
24
47
0
21 Jun 2024
OWL: A Large Language Model for IT Operations
Hongcheng Guo
Jian Yang
Jiaheng Liu
Liqun Yang
Linzheng Chai
...
Tieqiao Zheng
Liangfan Zheng
Bo-Wen Zhang
Ke Xu
Zhoujun Li
VLM
57
40
0
17 Sep 2023
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Ofir Press
Noah A. Smith
M. Lewis
234
690
0
27 Aug 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
236
1,508
0
31 Dec 2020
1