Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.00213
Cited By
v1
v2 (latest)
Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning
30 March 2024
Nick Mecklenburg
Yiyou Lin
Xiaoxiao Li
Daniel Holstein
Leonardo Nunes
Sara Malvar
B. Silva
Ranveer Chandra
Vijay Aski
Pavan Kumar Reddy Yannam
Tolga Aktas
Todd Hendry
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning"
30 / 30 papers shown
LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs
Pei-Fu Guo
Yun-Da Tsai
Chun-Chia Hsu
Kai-Xin Chen
Y. Tsai
Kai-Wei Chang
Nanyun Peng
Mi-Yen Yeh
Shou-De Lin
168
0
0
03 Nov 2025
KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints
Kailin Jiang
Hongbo Jiang
Ning Jiang
Zhi Gao
Jinhe Bi
Yuchen Ren
B. Li
Yuntao Du
L. J. Liu
Qing Li
CLL
OffRL
KELM
VLM
219
1
0
22 Oct 2025
Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs
Xu Pan
Ely Hahami
Jingxuan Fan
Ziqian Xie
H. Sompolinsky
148
0
0
10 Oct 2025
Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector
Amal Chebbi
Babajide Kolade
116
1
0
08 Sep 2025
Being Kind Isn't Always Being Safe: Diagnosing Affective Hallucination in LLMs
Sewon Kim
Jiwon Kim
Seungwoo Shin
Hyejin Chung
Daeun Moon
Yejin Kwon
Hyunsoo Yoon
116
0
0
23 Aug 2025
Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Bolei He
Xinran He
Run Shao
Shanfu Shu
Xianwei Xue
Mingquan Cheng
Haifeng Li
Zhenhua Ling
RALM
LRM
249
1
0
21 Aug 2025
GeoGPT-RAG Technical Report
Fei Huang
Fan Wu
Zeqing Zhang
Qihao Wang
Long Zhang
Grant Michael Boquet
Hongyang Chen
VLM
156
0
0
18 Aug 2025
LMAR: Language Model Augmented Retriever for Domain-specific Knowledge Indexing
Yao Zhao
Yantian Ding
Zhiyue Zhang
Dapeng Yao
Yanxun Xu
RALM
311
1
0
04 Aug 2025
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Lu Ma
Hao Liang
Meiyi Qiang
Lexiang Tang
Xiaochen Ma
...
Chengyu Shen
Runming He
Bin Cui
Wentao Zhang
Wentao Zhang
ReLM
OffRL
LRM
270
39
0
09 Jun 2025
Data Doping or True Intelligence? Evaluating the Transferability of Injected Knowledge in LLMs
Essa Jan
Moiz Ali
Muhammad Saram Hassan
Fareed Zaffar
Yasir Zaki
KELM
153
1
0
22 May 2025
IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment
Chenlin Ming
Chendi Qu
Mengzhang Cai
Qizhi Pei
Zhuoshi Pan
Yu Li
Xiaoming Duan
Lijun Wu
Bin Wang
197
2
0
19 May 2025
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Yuwei Zhang
Wenhao Yu
Shangbin Feng
Yifan Zhu
Letian Peng
Jayanth Srinivasa
Gaowen Liu
Jingbo Shang
KELM
292
4
0
18 May 2025
Synthesize-on-Graph: Knowledgeable Synthetic Data Generation for Continue Pre-training of Large Language Models
Xuhui Jiang
Shengjie Ma
Chengjin Xu
Cehao Yang
Liyu Zhang
Jian Guo
SyDa
395
3
0
02 May 2025
Memorization and Knowledge Injection in Gated LLMs
Xu Pan
Ely Hahami
Zechen Zhang
H. Sompolinsky
KELM
CLL
RALM
317
3
0
30 Apr 2025
InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Bowen Cao
Deng Cai
W. Lam
CLL
408
3
0
02 Apr 2025
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yixin Ou
Yunzhi Yao
Ningyu Zhang
Hui Jin
Jiacheng Sun
Shumin Deng
Hao Sun
Ningyu Zhang
KELM
CLL
340
12
0
16 Feb 2025
On the Impact of Fine-Tuning on Chain-of-Thought Reasoning
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Elita Lobo
Chirag Agarwal
Himabindu Lakkaraju
LRM
463
24
0
22 Nov 2024
On the Way to LLM Personalization: Learning to Remember User Conversations
Lucie Charlotte Magister
Katherine Metcalf
Yizhe Zhang
Maartje ter Hoeve
296
11
0
20 Nov 2024
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
International Conference on Learning Representations (ICLR), 2024
Tong Chen
Hao Fang
Patrick Xia
Xiaodong Liu
Benjamin Van Durme
Luke Zettlemoyer
Jianfeng Gao
Hao Cheng
KELM
320
7
0
08 Nov 2024
Transfer Learning for Finetuning Large Language Models
Tobias Strangmann
Lennart Purucker
Jörg Franke
Ivo Rapant
Fabio Ferreira
Katharina Eggensperger
225
4
0
02 Nov 2024
Learning and Unlearning of Fabricated Knowledge in Language Models
Chen Sun
Nolan Miller
A. Zhmoginov
Max Vladymyrov
Mark Sandler
KELM
MU
241
4
0
29 Oct 2024
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus
Raviraj Joshi
Kanishk Singla
Anusha Kamath
Raunak Kalani
Rakesh Paul
Utkarsh Vaidya
Sanjay Singh Chauhan
Niranjan Wartikar
Eileen Long
SyDa
CLL
376
22
0
18 Oct 2024
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains
International Conference on Learning Representations (ICLR), 2024
Yein Park
Chanwoong Yoon
Jungwoo Park
Donghyeon Lee
Minbyul Jeong
Jaewoo Kang
KELM
473
3
0
13 Oct 2024
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jiaxin Zhang
Wendi Cui
Yiran Huang
Kamalika Das
Sricharan Kumar
KELM
SyDa
237
7
0
12 Oct 2024
Synthetic continued pretraining
International Conference on Learning Representations (ICLR), 2024
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLL
SyDa
340
35
0
11 Sep 2024
DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models
Yuanhao Zeng
Fei Ren
Xinpeng Zhou
Yihang Wang
Yingxia Shao
ALM
195
0
0
19 Aug 2024
Structure-aware Domain Knowledge Injection for Large Language Models
Kai-Chun Liu
Ze Chen
Zhihang Fu
Rongxin Jiang
Fan Zhou
Yao-Shen Chen
Yue-bo Wu
Yue Wu
Jieping Ye
161
0
0
23 Jul 2024
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Xiaoying Zhang
Baolin Peng
Ye Tian
Jingyan Zhou
Yipeng Zhang
Haitao Mi
Helen Meng
CLL
KELM
436
12
0
10 Jun 2024
Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering
Neural Information Processing Systems (NeurIPS), 2024
Zhihua Wen
Zhiliang Tian
Z. Jian
Zhen Huang
Pei Ke
Yifu Gao
Shiyu Huang
Dongsheng Li
262
25
0
23 May 2024
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
Yougang Lyu
Lingyong Yan
Shuaiqiang Wang
Haibo Shi
D. Yin
Sudipta Singha Roy
Zhumin Chen
Maarten de Rijke
Zhaochun Ren
239
10
0
17 Feb 2024
1