Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.01421
Cited By
Semiparametric Language Models Are Scalable Continual Learners
2 March 2023
Guangyue Peng
Tao Ge
Si-Qing Chen
Furu Wei
Houfeng Wang
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Semiparametric Language Models Are Scalable Continual Learners"
10 / 10 papers shown
Pre-training Limited Memory Language Models with Internal and External Knowledge
Linxi Zhao
Sofian Zalouk
Christian K. Belardi
Justin Lovelace
Jin Peng Zhou
Ryan Thomas Noonan
Dongyoung Go
Kilian Q. Weinberger
Yoav Artzi
Jennifer J. Sun
KELM
HILM
461
0
0
21 May 2025
A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhihao Wang
Shiyu Liu
Jianheng Huang
Zheng Wang
Yixuan Liao
Xiaoxin Chen
Junfeng Yao
Jinsong Su
263
2
0
05 Oct 2024
Collaborative Evolving Strategy for Automatic Data-Centric Development
Xu Yang
Haotian Chen
Wenjun Feng
Haoxue Wang
Zeqi Ye
Xinjie Shen
Xiao Yang
Shizhao Sun
Yuante Li
Jiang Bian
357
3
0
26 Jul 2024
Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Junhao Zheng
Shengjie Qiu
Qianli Ma
455
14
0
13 Dec 2023
Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zeyuan Yang
Peng Li
Yang Liu
LRM
274
31
0
24 Oct 2023
Heterogenous Memory Augmented Neural Networks
Zihan Qiu
Zhen Liu
Shuicheng Yan
Shanghang Zhang
Jie Fu
235
0
0
17 Oct 2023
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Luiza Amador Pozzobon
Beyza Ermis
Patrick Lewis
Sara Hooker
321
29
0
11 Oct 2023
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zihan Zhang
Meng Fang
Lingxi Chen
Mohammad-Reza Namazi-Rad
Jun Wang
KELM
286
45
0
11 Oct 2023
In-context Autoencoder for Context Compression in a Large Language Model
International Conference on Learning Representations (ICLR), 2023
Tao Ge
Jing Hu
Lei Wang
Xun Wang
Si-Qing Chen
Furu Wei
RALM
500
137
0
13 Jul 2023
Billion-scale similarity search with GPUs
IEEE Transactions on Big Data (TBD), 2017
Jeff Johnson
Matthijs Douze
Edouard Grave
1.2K
4,864
0
28 Feb 2017
1
Page 1 of 1