Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2211.00635
Cited By
v1
v2
v3 (latest)
Two-stage LLM Fine-tuning with Less Specialization and More Generalization
International Conference on Learning Representations (ICLR), 2022
1 November 2022
Yihan Wang
Si Si
Daliang Li
Michal Lukasik
Felix X. Yu
Cho-Jui Hsieh
Inderjit S Dhillon
Sanjiv Kumar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-stage LLM Fine-tuning with Less Specialization and More Generalization"
24 / 24 papers shown
Title
SVBRD-LLM: Self-Verifying Behavioral Rule Discovery for Autonomous Vehicle Identification
Xiangyu Li
Zhaomiao Guo
83
0
0
18 Nov 2025
MedPAO: A Protocol-Driven Agent for Structuring Medical Reports
Shrish Shrinath Vaidya
Gowthamaan Palani
Sidharth Ramesh
Velmurugan Balasubramanian
Minmini Selvam
Gokulraja Srinivasaraja
Ganapathy Krishnamurthi
LM&MA
76
0
0
06 Oct 2025
Correct Reasoning Paths Visit Shared Decision Pivots
Dongkyu Cho
Amy B.Z. Zhang
Bilel Fehri
Sheng Wang
Rumi Chunara
R. Song
Hengrui Cai
LRM
156
0
0
25 Sep 2025
RL Fine-Tuning Heals OOD Forgetting in SFT
Hangzhan Jin
Sitao Luan
Sicheng Lyu
Guillaume Rabusseau
Reihaneh Rabbany
Doina Precup
Mohammad Hamdaqa
CLL
LRM
131
2
0
08 Sep 2025
Evaluating and Improving Robustness in Large Language Models: A Survey and Future Directions
Kun Zhang
Le Wu
Kui Yu
Guangyi Lv
Dacao Zhang
AAML
ELM
274
1
0
08 Jun 2025
ReqBrain: Task-Specific Instruction Tuning of LLMs for AI-Assisted Requirements Generation
Mohammad Kasra Habib
Daniel Graziotin
Stefan Wagner
306
0
0
23 May 2025
HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models
Zheng Lin
Yuxin Zhang
Zhe Chen
Zihan Fang
Xianhao Chen
Praneeth Vepakomma
Wei Ni
Jun Luo
Yue Gao
MoE
312
15
0
05 May 2025
ML For Hardware Design Interpretability: Challenges and Opportunities
Raymond Baartmans
Andrew Ensinger
Victor Agostinelli
Lizhong Chen
151
0
0
11 Apr 2025
Boosting LLM-based Relevance Modeling with Distribution-Aware Robust Learning
International Conference on Information and Knowledge Management (CIKM), 2024
Hong Liu
Saisai Gong
Yixin Ji
Hai Ye
Jia Xu
Jinjie Gu
313
5
0
17 Dec 2024
Effective Text Adaptation for LLM-based ASR through Soft Prompt Fine-Tuning
Spoken Language Technology Workshop (SLT), 2024
Yingyi Ma
Zhe Liu
Ozlem Kalinli
315
1
0
09 Dec 2024
Preserving Pre-trained Representation Space: On Effectiveness of Prefix-tuning for Large Multi-modal Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Donghoon Kim
Gusang Lee
Kyuhong Shim
B. Shim
242
5
0
29 Oct 2024
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations
International Conference on Learning Representations (ICLR), 2024
Litu Rout
Yujia Chen
Nataniel Ruiz
Constantine Caramanis
Sanjay Shakkottai
Wen-Sheng Chu
DiffM
172
0
0
14 Oct 2024
DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models
Yuanhao Zeng
Fei Ren
Xinpeng Zhou
Yihang Wang
Yingxia Shao
ALM
171
0
0
19 Aug 2024
PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning
Jiaru Zou
Mengyu Zhou
Tao Li
Shi Han
Dongmei Zhang
183
18
0
02 Jul 2024
Large Scale Transfer Learning for Tabular Data via Language Modeling
Josh Gardner
Juan C. Perdomo
Ludwig Schmidt
LMTD
178
45
0
17 Jun 2024
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs
Chen Zheng
Ke Sun
Xun Zhou
MoE
146
1
0
12 Jun 2024
TAIA: Large Language Models are Out-of-Distribution Data Learners
Shuyang Jiang
Yusheng Liao
Ya Zhang
Yu Wang
Yanfeng Wang
171
7
0
30 May 2024
Unveiling the Generalization Power of Fine-Tuned Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Haoran Yang
Yumeng Zhang
Jiaqi Xu
Hongyuan Lu
Pheng Ann Heng
Wai Lam
270
53
0
14 Mar 2024
Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF
Chen Zheng
Ke Sun
Hang Wu
Chenguang Xi
Xun Zhou
173
15
0
04 Mar 2024
Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Jianheng Huang
Leyang Cui
Ante Wang
Chengyi Yang
Xinting Liao
Linfeng Song
Junfeng Yao
Jinsong Su
KELM
CLL
196
78
0
02 Mar 2024
Aalap: AI Assistant for Legal & Paralegal Functions in India
Aman Tiwari
Prathamesh Kalamkar
Atreyo Banerjee
S. Karn
V. Hemachandran
Smita Gupta
AILaw
ELM
VLM
184
2
0
30 Jan 2024
The ART of LLM Refinement: Ask, Refine, and Trust
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Kumar Shridhar
Koustuv Sinha
Andrew Cohen
Tianlu Wang
Ping Yu
Ramakanth Pasunuru
Mrinmaya Sachan
Jason Weston
Asli Celikyilmaz
LLMAG
ReLM
LRM
175
31
0
14 Nov 2023
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey
ACM Computing Surveys (ACM Comput. Surv.), 2023
Chen Ling
Xujiang Zhao
Jiaying Lu
Chengyuan Deng
Can Zheng
...
Chris White
Quanquan Gu
Jian Pei
Carl Yang
Bo Pan
ALM
320
205
0
30 May 2023
Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023
Yunhu Ye
Binyuan Hui
Min Yang
Binhua Li
Fei Huang
Yongbin Li
LMTD
ReLM
LRM
259
211
0
31 Jan 2023
1