ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.00213
  4. Cited By
Injecting New Knowledge into Large Language Models via Supervised
  Fine-Tuning
v1v2 (latest)

Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning

30 March 2024
Nick Mecklenburg
Yiyou Lin
Xiaoxiao Li
Daniel Holstein
Leonardo Nunes
Sara Malvar
B. Silva
Ranveer Chandra
Vijay Aski
Pavan Kumar Reddy Yannam
Tolga Aktas
Todd Hendry
ArXiv (abs)PDFHTML

Papers citing "Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning"

30 / 30 papers shown
LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs
LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs
Pei-Fu Guo
Yun-Da Tsai
Chun-Chia Hsu
Kai-Xin Chen
Y. Tsai
Kai-Wei Chang
Nanyun Peng
Mi-Yen Yeh
Shou-De Lin
168
0
0
03 Nov 2025
KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints
KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints
Kailin Jiang
Hongbo Jiang
Ning Jiang
Zhi Gao
Jinhe Bi
Yuchen Ren
B. Li
Yuntao Du
L. J. Liu
Qing Li
CLLOffRLKELMVLM
219
1
0
22 Oct 2025
Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs
Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs
Xu Pan
Ely Hahami
Jingxuan Fan
Ziqian Xie
H. Sompolinsky
148
0
0
10 Oct 2025
Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector
Towards EnergyGPT: A Large Language Model Specialized for the Energy Sector
Amal Chebbi
Babajide Kolade
116
1
0
08 Sep 2025
Being Kind Isn't Always Being Safe: Diagnosing Affective Hallucination in LLMs
Being Kind Isn't Always Being Safe: Diagnosing Affective Hallucination in LLMs
Sewon Kim
Jiwon Kim
Seungwoo Shin
Hyejin Chung
Daeun Moon
Yejin Kwon
Hyunsoo Yoon
116
0
0
23 Aug 2025
Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Bolei He
Xinran He
Run Shao
Shanfu Shu
Xianwei Xue
Mingquan Cheng
Haifeng Li
Zhenhua Ling
RALMLRM
249
1
0
21 Aug 2025
GeoGPT-RAG Technical Report
GeoGPT-RAG Technical Report
Fei Huang
Fan Wu
Zeqing Zhang
Qihao Wang
Long Zhang
Grant Michael Boquet
Hongyang Chen
VLM
156
0
0
18 Aug 2025
LMAR: Language Model Augmented Retriever for Domain-specific Knowledge Indexing
LMAR: Language Model Augmented Retriever for Domain-specific Knowledge Indexing
Yao Zhao
Yantian Ding
Zhiyue Zhang
Dapeng Yao
Yanxun Xu
RALM
311
1
0
04 Aug 2025
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Lu Ma
Hao Liang
Meiyi Qiang
Lexiang Tang
Xiaochen Ma
...
Chengyu Shen
Runming He
Bin Cui
Wentao Zhang
Wentao Zhang
ReLMOffRLLRM
270
39
0
09 Jun 2025
Data Doping or True Intelligence? Evaluating the Transferability of Injected Knowledge in LLMs
Data Doping or True Intelligence? Evaluating the Transferability of Injected Knowledge in LLMs
Essa Jan
Moiz Ali
Muhammad Saram Hassan
Fareed Zaffar
Yasir Zaki
KELM
153
1
0
22 May 2025
IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment
IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment
Chenlin Ming
Chendi Qu
Mengzhang Cai
Qizhi Pei
Zhuoshi Pan
Yu Li
Xiaoming Duan
Lijun Wu
Bin Wang
197
2
0
19 May 2025
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Yuwei Zhang
Wenhao Yu
Shangbin Feng
Yifan Zhu
Letian Peng
Jayanth Srinivasa
Gaowen Liu
Jingbo Shang
KELM
292
4
0
18 May 2025
Synthesize-on-Graph: Knowledgeable Synthetic Data Generation for Continue Pre-training of Large Language Models
Synthesize-on-Graph: Knowledgeable Synthetic Data Generation for Continue Pre-training of Large Language Models
Xuhui Jiang
Shengjie Ma
Chengjin Xu
Cehao Yang
Liyu Zhang
Jian Guo
SyDa
395
3
0
02 May 2025
Memorization and Knowledge Injection in Gated LLMs
Memorization and Knowledge Injection in Gated LLMs
Xu Pan
Ely Hahami
Zechen Zhang
H. Sompolinsky
KELMCLLRALM
317
3
0
30 Apr 2025
InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation
InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory TransformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Bowen Cao
Deng Cai
W. Lam
CLL
408
3
0
02 Apr 2025
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yixin Ou
Yunzhi Yao
Ningyu Zhang
Hui Jin
Jiacheng Sun
Shumin Deng
Hao Sun
Ningyu Zhang
KELMCLL
340
12
0
16 Feb 2025
On the Impact of Fine-Tuning on Chain-of-Thought Reasoning
On the Impact of Fine-Tuning on Chain-of-Thought ReasoningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Elita Lobo
Chirag Agarwal
Himabindu Lakkaraju
LRM
463
24
0
22 Nov 2024
On the Way to LLM Personalization: Learning to Remember User
  Conversations
On the Way to LLM Personalization: Learning to Remember User Conversations
Lucie Charlotte Magister
Katherine Metcalf
Yizhe Zhang
Maartje ter Hoeve
296
11
0
20 Nov 2024
Generative Adapter: Contextualizing Language Models in Parameters with A
  Single Forward Pass
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward PassInternational Conference on Learning Representations (ICLR), 2024
Tong Chen
Hao Fang
Patrick Xia
Xiaodong Liu
Benjamin Van Durme
Luke Zettlemoyer
Jianfeng Gao
Hao Cheng
KELM
320
7
0
08 Nov 2024
Transfer Learning for Finetuning Large Language Models
Transfer Learning for Finetuning Large Language Models
Tobias Strangmann
Lennart Purucker
Jörg Franke
Ivo Rapant
Fabio Ferreira
Katharina Eggensperger
225
4
0
02 Nov 2024
Learning and Unlearning of Fabricated Knowledge in Language Models
Learning and Unlearning of Fabricated Knowledge in Language Models
Chen Sun
Nolan Miller
A. Zhmoginov
Max Vladymyrov
Mark Sandler
KELMMU
241
4
0
29 Oct 2024
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus
Raviraj Joshi
Kanishk Singla
Anusha Kamath
Raunak Kalani
Rakesh Paul
Utkarsh Vaidya
Sanjay Singh Chauhan
Niranjan Wartikar
Eileen Long
SyDaCLL
376
22
0
18 Oct 2024
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple DomainsInternational Conference on Learning Representations (ICLR), 2024
Yein Park
Chanwoong Yoon
Jungwoo Park
Donghyeon Lee
Minbyul Jeong
Jaewoo Kang
KELM
473
3
0
13 Oct 2024
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and
  Injection for Enhancing Large Language Models
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jiaxin Zhang
Wendi Cui
Yiran Huang
Kamalika Das
Sricharan Kumar
KELMSyDa
237
7
0
12 Oct 2024
Synthetic continued pretraining
Synthetic continued pretrainingInternational Conference on Learning Representations (ICLR), 2024
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLLSyDa
340
35
0
11 Sep 2024
DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large
  Language Models
DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models
Yuanhao Zeng
Fei Ren
Xinpeng Zhou
Yihang Wang
Yingxia Shao
ALM
195
0
0
19 Aug 2024
Structure-aware Domain Knowledge Injection for Large Language Models
Structure-aware Domain Knowledge Injection for Large Language Models
Kai-Chun Liu
Ze Chen
Zhihang Fu
Rongxin Jiang
Fan Zhou
Yao-Shen Chen
Yue-bo Wu
Yue Wu
Jieping Ye
161
0
0
23 Jul 2024
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-TeachingAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Xiaoying Zhang
Baolin Peng
Ye Tian
Jingyan Zhou
Yipeng Zhang
Haitao Mi
Helen Meng
CLLKELM
436
12
0
10 Jun 2024
Perception of Knowledge Boundary for Large Language Models through
  Semi-open-ended Question Answering
Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question AnsweringNeural Information Processing Systems (NeurIPS), 2024
Zhihua Wen
Zhiliang Tian
Z. Jian
Zhen Huang
Pei Ke
Yifu Gao
Shiyu Huang
Dongsheng Li
262
25
0
23 May 2024
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
Yougang Lyu
Lingyong Yan
Shuaiqiang Wang
Haibo Shi
D. Yin
Sudipta Singha Roy
Zhumin Chen
Maarten de Rijke
Zhaochun Ren
239
10
0
17 Feb 2024
1