ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.08701
  4. Cited By
AlpaGasus: Training A Better Alpaca with Fewer Data
v1v2v3v4v5 (latest)

AlpaGasus: Training A Better Alpaca with Fewer Data

17 July 2023
Lichang Chen
Shiyang Li
Jun Yan
Hai Wang
Kalpa Gunaratna
Vikas Yadav
Zheng Tang
Vijay Srinivasan
Wanrong Zhu
Heng-Chiao Huang
Hongxia Jin
    ALM
ArXiv (abs)PDFHTMLHuggingFace (23 upvotes)

Papers citing "AlpaGasus: Training A Better Alpaca with Fewer Data"

50 / 189 papers shown
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Cehao Yang
Xueyuan Lin
Chengjin Xu
Xuhui Jiang
Xiaojun Wu
Honghao Liu
Hui Xiong
Jian Guo
LRM
307
3
0
24 Dec 2025
Towards Active Synthetic Data Generation for Finetuning Language Models
Samuel Kessler
Menglin Xia
Daniel Madrigal Diaz
Dongge Han
Helia Heshemi
Saravan Rajmohan
Victor Ruehle
Jordan T. Ash
SyDa
182
0
0
30 Nov 2025
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data
Sultan AlRashed
Chadi Helwe
Francesco Orabona
MoE
108
0
0
23 Nov 2025
Learning from the Undesirable: Robust Adaptation of Language Models without Forgetting
Learning from the Undesirable: Robust Adaptation of Language Models without Forgetting
Yunhun Nam
Jaehyung Kim
Jongheon Jeong
118
0
0
17 Nov 2025
PrAda-GAN: A Private Adaptive Generative Adversarial Network with Bayes Network Structure
PrAda-GAN: A Private Adaptive Generative Adversarial Network with Bayes Network Structure
Ke Jia
Yuheng Ma
Yang Li
Feifei Wang
124
2
0
11 Nov 2025
Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains
Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains
P. Wang
Hongcheng Liu
Yusheng Liao
Ziqing Fan
Yaxin Du
Shuo Tang
Y. Wang
Y Samuel Wang
129
1
0
10 Nov 2025
LM-mixup: Text Data Augmentation via Language Model based Mixup
LM-mixup: Text Data Augmentation via Language Model based Mixup
Zhijie Deng
Zhouan Shen
Ling Li
Yao Zhou
Zhaowei Zhu
Yanji He
Wei Wang
Jiaheng Wei
100
0
0
23 Oct 2025
See the Text: From Tokenization to Visual Reading
See the Text: From Tokenization to Visual Reading
Ling Xing
Alex Jinpeng Wang
Rui Yan
Hongyu Qu
Zechao Li
Jinhui Tang
VLM
159
1
0
21 Oct 2025
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning
Xiaohan Qin
Xiaoxing Wang
Ning Liao
Cancheng Zhang
Xiangdong Zhang
Mingquan Feng
Jingzhi Wang
Junchi Yan
145
1
0
21 Oct 2025
Computational Budget Should Be Considered in Data Selection
Computational Budget Should Be Considered in Data Selection
Weilin Wan
Weizhong Zhang
Cheng Jin
205
0
0
19 Oct 2025
Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning
Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning
Heming Zou
Yixiu Mao
Yun Qu
Qi Wang
Xiangyang Ji
178
1
0
19 Oct 2025
Holdout-Loss-Based Data Selection for LLM Finetuning via In-Context Learning
Holdout-Loss-Based Data Selection for LLM Finetuning via In-Context Learning
Ling Zhang
Xianliang Yang
Juwon Yu
Park Cheonyoung
Lei Song
Jiang Bian
75
0
0
16 Oct 2025
Towards Understanding Valuable Preference Data for Large Language Model Alignment
Towards Understanding Valuable Preference Data for Large Language Model Alignment
Zizhuo Zhang
Qizhou Wang
Shanshan Ye
Jianing Zhu
Jiangchao Yao
Bo Han
Masashi Sugiyama
112
0
0
15 Oct 2025
The Harder The Better: Maintaining Supervised Fine-tuning Generalization with Less but Harder Data
The Harder The Better: Maintaining Supervised Fine-tuning Generalization with Less but Harder Data
Zhaoyang Shang
Sibo Wei
Jianbin Guo
Rui Zhou
Lifeng Dong
Yin Luo
ALM
81
0
0
14 Oct 2025
Evolution of meta's llama models and parameter-efficient fine-tuning of large language models: a survey
Evolution of meta's llama models and parameter-efficient fine-tuning of large language models: a survey
Abdulhady Abas Abdullah
Arkaitz Zubiaga
Seyedali Mirjalili
Amir Gandomi
Fatemeh Daneshfar
Mohammadsadra Amini
Alan Salam Mohammed
Hadi Veisi
ALM
193
0
0
14 Oct 2025
CoIDO: Efficient Data Selection for Visual Instruction Tuning via Coupled Importance-Diversity Optimization
CoIDO: Efficient Data Selection for Visual Instruction Tuning via Coupled Importance-Diversity Optimization
Yichen Yan
Ming Zhong
Qi Zhu
Xiaoling Gu
Jinpeng Chen
Huan Li
119
0
0
11 Oct 2025
TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
Chanjoo Jung
Jaehyung Kim
151
0
0
06 Oct 2025
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning
Ziyan Wang
Zheng Wang
Jie Fu
Xingwei Qu
Qi Cheng
Shengpu Tang
Minjia Zhang
Xiaoming Huo
LRM
244
1
0
05 Oct 2025
Increasing LLM response trustworthiness using voting ensembles
Increasing LLM response trustworthiness using voting ensembles
Aparna Nair-Kanneganti
Trevor J. Chan
Shir Goldfinger
Emily Mackay
Brian Anthony
Alison M. Pouch
142
0
0
05 Oct 2025
Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Nilay Naharas
Dang Nguyen
Nesihan Bulut
M. Bateni
Vahab Mirrokni
Baharan Mirzasoleiman
104
0
0
01 Oct 2025
Large-Scale Constraint Generation - Can LLMs Parse Hundreds of Constraints?
Large-Scale Constraint Generation - Can LLMs Parse Hundreds of Constraints?
Matteo Boffa
Jiaxuan You
179
0
0
28 Sep 2025
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning
Shaobo Wang
Jiaming Wang
Jiajun Zhang
C. Wang
Yue Min
...
Fei Huang
Huiqiang Jiang
Junyang Lin
Dayiheng Liu
Linfeng Zhang
147
5
0
28 Sep 2025
Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment
Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment
Min-Hsuan Yeh
Yixuan Li
191
1
0
28 Sep 2025
TsqLoRA: Towards Sensitivity and Quality Low-Rank Adaptation for Efficient Fine-Tuning
TsqLoRA: Towards Sensitivity and Quality Low-Rank Adaptation for Efficient Fine-Tuning
Yu Chen
Yifei Han
Long Zhang
Yue Du
Bin Li
168
0
0
23 Sep 2025
A method for improving multilingual quality and diversity of instruction fine-tuning datasets
A method for improving multilingual quality and diversity of instruction fine-tuning datasets
Chunguang Zhao
Yilun Liu
Pufan Zeng
Yuanchang Luo
Shimin Tao
...
Chen Liu
Hongxia Ma
Li Zhang
Boxing Chen
Daimeng Wei
109
0
0
19 Sep 2025
Generating High-Quality Datasets for Code Editing via Open-Source Language Models
Generating High-Quality Datasets for Code Editing via Open-Source Language Models
Zekai Zhang
Xin Peng
Z. Chen
Linxi Liang
Yuxuan Chen
Guangsheng Ou
Yanlin Wang
Dan Li
Xin Peng
Zibin Zheng
SyDa
201
0
0
19 Sep 2025
Teaching According to Talents! Instruction Tuning LLMs with Competence-Aware Curriculum Learning
Teaching According to Talents! Instruction Tuning LLMs with Competence-Aware Curriculum LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Yangning Li
Tingwei Lu
Yinghui Li
Yankai Chen
Wei-Chieh Huang
Wenhao Jiang
Hui Wang
Hai-Tao Zheng
Philip S.Yu
234
0
0
17 Sep 2025
DaMoC: Efficiently Selecting the Optimal Large Language Model for Fine-tuning Domain Tasks Based on Data and Model Compression
DaMoC: Efficiently Selecting the Optimal Large Language Model for Fine-tuning Domain Tasks Based on Data and Model Compression
Wei Huang
Huang Wei
Yinggui Wang
213
0
0
01 Sep 2025
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Zinan Tang
Xin Gao
Qizhi Pei
Zhuoshi Pan
Mengzhang Cai
Jiang Wu
Conghui He
Lijun Wu
SyDa
324
2
0
29 Aug 2025
Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation
Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation
Harold Haodong Chen
Haojian Huang
Qifeng Chen
Harry Yang
Ser-Nam Lim
DiffMVGen
125
10
0
14 Aug 2025
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
Ping Yu
Jack Lanchantin
Tianlu Wang
Weizhe Yuan
O. Yu. Golovneva
I. Kulikov
Sainbayar Sukhbaatar
Jason Weston
Jing Xu
SyDaReLMLRM
286
12
0
31 Jul 2025
Trust the Model: Compact VLMs as In-Context Judges for Image-Text Data Quality
Trust the Model: Compact VLMs as In-Context Judges for Image-Text Data Quality
Daulet Toibazar
Kesen Wang
Sherif Mohamed
Abdulaziz Al-Badawi
Abdulrahman Alfulayt
Pedro J. Moreno
VLM
194
0
0
27 Jul 2025
RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models
RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models
Yeongtak Oh
J. Mok
Juhyeon Shin
Juhyeon Shin
Sangha Park
J. Mok
Sungroh Yoon
VLM
403
1
0
23 Jun 2025
FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE
FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE
Khiem Le
Tuan V. Tran
Ting Hua
Nitesh Chawla
MoE
270
0
0
19 Jun 2025
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
Zixuan Huang
Yikun Ban
Lean Fu
Xiaojie Li
Zhongxiang Dai
Jianxin Li
Deqing Wang
335
1
0
08 Jun 2025
Large Language Models are Demonstration Pre-Selectors for Themselves
Large Language Models are Demonstration Pre-Selectors for Themselves
Jiarui Jin
Yuwei Wu
Haoxuan Li
Xiaoting He
Weinan Zhang
Y. Yang
Yong Yu
Jun Wang
Mengyue Yang
282
2
0
06 Jun 2025
Understanding the Impact of Sampling Quality in Direct Preference Optimization
Understanding the Impact of Sampling Quality in Direct Preference Optimization
Kyung Rok Kim
Yumo Bai
Chonghuan Wang
Guanting Chen
276
0
0
03 Jun 2025
T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
Yanjun Fu
Faisal Hamman
Sanghamitra Dutta
ALM
343
6
0
02 Jun 2025
Resolving Knowledge Conflicts in Domain-specific Data Selection: A Case Study on Medical Instruction-tuning
Resolving Knowledge Conflicts in Domain-specific Data Selection: A Case Study on Medical Instruction-tuning
Qihuang Zhong
Liang Ding
Fei Liao
Juhua Liu
Bo Du
Dacheng Tao
242
0
0
28 May 2025
Efficient Data Selection at Scale via Influence Distillation
Efficient Data Selection at Scale via Influence Distillation
Mahdi Nikdan
Vincent Cohen-Addad
Dan Alistarh
Vahab Mirrokni
TDI
329
4
0
25 May 2025
Not All Documents Are What You Need for Extracting Instruction Tuning Data
Not All Documents Are What You Need for Extracting Instruction Tuning Data
Chi Zhang
Huaping Zhong
Hongtao Li
Chengliang Chai
Jiawei Hong
...
Jiantao Qiu
Ye Yuan
Guoren Wang
Bin Wang
Lei Cao
SyDa
236
0
0
18 May 2025
Large Language Models for Computer-Aided Design: A Survey
Large Language Models for Computer-Aided Design: A Survey
Licheng Zhang
Bach Le
Naveed Akhtar
Siew-Kei Lam
Tuan Ngo
3DVAI4CE
390
9
0
13 May 2025
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Yiping Wang
Qing Yang
Zhiyuan Zeng
Liliang Ren
Liu Liu
...
Jianfeng Gao
Weizhu Chen
Shuaiqiang Wang
Simon Shaolei Du
Haoran Pan
OffRLReLMLRM
812
160
0
29 Apr 2025
Data-efficient LLM Fine-tuning for Code Generation
Data-efficient LLM Fine-tuning for Code Generation
Weijie Lv
X. Xia
Sheng-Jun Huang
ALMSyDa
176
4
0
17 Apr 2025
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Xinze Wang
Zhiyong Yang
Chao Feng
Hongjin Lu
Linjie Li
Chung-Ching Lin
Kevin Qinghong Lin
Furong Huang
Lijuan Wang
OODDReLMLRMVLM
591
73
0
10 Apr 2025
Adversarial Training of Reward Models
Adversarial Training of Reward Models
Alexander Bukharin
Haifeng Qian
Shengyang Sun
Adithya Renduchintala
Soumye Singhal
Liang Luo
Oleksii Kuchaiev
Olivier Delalleau
T. Zhao
AAML
438
6
0
08 Apr 2025
CONGRAD:Conflicting Gradient Filtering for Multilingual Preference Alignment
CONGRAD:Conflicting Gradient Filtering for Multilingual Preference Alignment
Jiangnan Li
Thuy-Trang Vu
Christian Herold
Amirhossein Tebbifakhr
Shahram Khadivi
Gholamreza Haffari
444
0
0
31 Mar 2025
MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning
MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning
Yiwei Ma
Guohai Xu
Xiaoshuai Sun
Jinfa Huang
Jie Lou
Debing Zhang
Rongrong Ji
501
6
0
26 Mar 2025
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm
Hui Yuan
Yuhao Du
Xiaoqi Jiao
Yiwen Guo
Yuege Feng
Xiang Wan
Anningzhe Gao
Jinpeng Hu
325
5
0
04 Mar 2025
Advancing MAPF towards the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)
Advancing MAPF towards the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)
Jingtian Yan
Zhifei Li
William Kang
Kevin Zheng
Yulun Zhang
Zhe Chen
Yue Zhang
Daniel Harabor
Stephen Smith
Jiaoyang Li
417
1
0
03 Mar 2025
1234
Next