ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.08701
  4. Cited By
AlpaGasus: Training A Better Alpaca with Fewer Data
v1v2v3v4v5 (latest)

AlpaGasus: Training A Better Alpaca with Fewer Data

17 July 2023
Lichang Chen
Shiyang Li
Jun Yan
Hai Wang
Kalpa Gunaratna
Vikas Yadav
Zheng Tang
Vijay Srinivasan
Wanrong Zhu
Heng-Chiao Huang
Hongxia Jin
    ALM
ArXiv (abs)PDFHTMLHuggingFace (23 upvotes)

Papers citing "AlpaGasus: Training A Better Alpaca with Fewer Data"

50 / 174 papers shown
Title
See the Text: From Tokenization to Visual Reading
See the Text: From Tokenization to Visual Reading
Ling Xing
Alex Jinpeng Wang
Rui Yan
Hongyu Qu
Zechao Li
Jinhui Tang
VLM
88
0
0
21 Oct 2025
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning
Xiaohan Qin
Xiaoxing Wang
Ning Liao
Cancheng Zhang
Xiangdong Zhang
Mingquan Feng
Jingzhi Wang
Junchi Yan
94
0
0
21 Oct 2025
Computational Budget Should Be Considered in Data Selection
Computational Budget Should Be Considered in Data Selection
Weilin Wan
Weizhong Zhang
Cheng Jin
115
0
0
19 Oct 2025
Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning
Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning
Heming Zou
Yixiu Mao
Yun Qu
Qi Wang
Xiangyang Ji
137
1
0
19 Oct 2025
Holdout-Loss-Based Data Selection for LLM Finetuning via In-Context Learning
Holdout-Loss-Based Data Selection for LLM Finetuning via In-Context Learning
Ling Zhang
Xianliang Yang
Juwon Yu
Park Cheonyoung
Lei Song
Jiang Bian
52
0
0
16 Oct 2025
CoIDO: Efficient Data Selection for Visual Instruction Tuning via Coupled Importance-Diversity Optimization
CoIDO: Efficient Data Selection for Visual Instruction Tuning via Coupled Importance-Diversity Optimization
Yichen Yan
Ming Zhong
Qi Zhu
Xiaoling Gu
Jinpeng Chen
Huan Li
81
0
0
11 Oct 2025
TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
Chanjoo Jung
Jaehyung Kim
76
0
0
06 Oct 2025
Increasing LLM response trustworthiness using voting ensembles
Increasing LLM response trustworthiness using voting ensembles
Aparna Nair-Kanneganti
Trevor J. Chan
Shir Goldfinger
Emily Mackay
Brian Anthony
Alison M. Pouch
89
0
0
05 Oct 2025
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning
Ziyan Wang
Zheng Wang
Jie Fu
Xingwei Qu
Qi Cheng
Shengpu Tang
Minjia Zhang
Xiaoming Huo
LRM
180
0
0
05 Oct 2025
Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
Nilay Naharas
Dang Nguyen
Nesihan Bulut
M. Bateni
Vahab Mirrokni
Baharan Mirzasoleiman
80
0
0
01 Oct 2025
Large-Scale Constraint Generation - Can LLMs Parse Hundreds of Constraints?
Large-Scale Constraint Generation - Can LLMs Parse Hundreds of Constraints?
Matteo Boffa
Jiaxuan You
92
0
0
28 Sep 2025
Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment
Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment
Min-Hsuan Yeh
Yixuan Li
119
1
0
28 Sep 2025
A method for improving multilingual quality and diversity of instruction fine-tuning datasets
A method for improving multilingual quality and diversity of instruction fine-tuning datasets
Chunguang Zhao
Yilun Liu
Pufan Zeng
Yuanchang Luo
Shimin Tao
...
Chen Liu
Hongxia Ma
Li Zhang
Boxing Chen
Daimeng Wei
68
0
0
19 Sep 2025
Generating High-Quality Datasets for Code Editing via Open-Source Language Models
Generating High-Quality Datasets for Code Editing via Open-Source Language Models
Zekai Zhang
Mingwei Liu
Z. Chen
Linxi Liang
Yuxuan Chen
Guangsheng Ou
Yanlin Wang
Dan Li
Xin Peng
Zibin Zheng
SyDa
134
0
0
19 Sep 2025
Teaching According to Talents! Instruction Tuning LLMs with Competence-Aware Curriculum Learning
Teaching According to Talents! Instruction Tuning LLMs with Competence-Aware Curriculum LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Yangning Li
Tingwei Lu
Yinghui Li
Yankai Chen
Wei-Chieh Huang
Wenhao Jiang
Hui Wang
Hai-Tao Zheng
Philip S.Yu
170
0
0
17 Sep 2025
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Zinan Tang
Xin Gao
Qizhi Pei
Zhuoshi Pan
Mengzhang Cai
Jiang Wu
Conghui He
Lijun Wu
SyDa
229
0
0
29 Aug 2025
Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation
Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation
Harold Haodong Chen
Haojian Huang
Qifeng Chen
Harry Yang
Ser-Nam Lim
DiffMVGen
81
8
0
14 Aug 2025
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
Ping Yu
Jack Lanchantin
Tianlu Wang
Weizhe Yuan
O. Yu. Golovneva
I. Kulikov
Sainbayar Sukhbaatar
Jason Weston
Jing Xu
SyDaReLMLRM
203
10
0
31 Jul 2025
Trust the Model: Compact VLMs as In-Context Judges for Image-Text Data Quality
Trust the Model: Compact VLMs as In-Context Judges for Image-Text Data Quality
Daulet Toibazar
Kesen Wang
Sherif Mohamed
Abdulaziz Al-Badawi
Abdulrahman Alfulayt
Pedro J. Moreno
VLM
151
0
0
27 Jul 2025
RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models
RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models
Yeongtak Oh
J. Mok
Juhyeon Shin
Juhyeon Shin
Sangha Park
J. Mok
Sungroh Yoon
VLM
210
1
0
23 Jun 2025
FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE
FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE
Khiem Le
Tuan V. Tran
Ting Hua
Nitesh Chawla
MoE
193
0
0
19 Jun 2025
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
Zixuan Huang
Yikun Ban
Lean Fu
Xiaojie Li
Zhongxiang Dai
Jianxin Li
Deqing Wang
203
2
0
08 Jun 2025
Large Language Models are Demonstration Pre-Selectors for Themselves
Large Language Models are Demonstration Pre-Selectors for Themselves
Jiarui Jin
Yuwei Wu
Haoxuan Li
Xiaoting He
Weinan Zhang
Y. Yang
Yong Yu
Jun Wang
Mengyue Yang
227
2
0
06 Jun 2025
Understanding the Impact of Sampling Quality in Direct Preference Optimization
Understanding the Impact of Sampling Quality in Direct Preference Optimization
Kyung Rok Kim
Yumo Bai
Chonghuan Wang
Guanting Chen
168
0
0
03 Jun 2025
T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
Yanjun Fu
Faisal Hamman
Sanghamitra Dutta
ALM
251
6
0
02 Jun 2025
Resolving Knowledge Conflicts in Domain-specific Data Selection: A Case Study on Medical Instruction-tuning
Resolving Knowledge Conflicts in Domain-specific Data Selection: A Case Study on Medical Instruction-tuning
Qihuang Zhong
Liang Ding
Fei Liao
Juhua Liu
Bo Du
Dacheng Tao
169
0
0
28 May 2025
Efficient Data Selection at Scale via Influence Distillation
Efficient Data Selection at Scale via Influence Distillation
Mahdi Nikdan
Vincent Cohen-Addad
Dan Alistarh
Vahab Mirrokni
TDI
277
3
0
25 May 2025
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Cehao Yang
Xueyuan Lin
Chengjin Xu
Xuhui Jiang
Xiaojun Wu
Honghao Liu
Hui Xiong
Jian Guo
LRM
219
1
0
22 May 2025
Not All Documents Are What You Need for Extracting Instruction Tuning Data
Not All Documents Are What You Need for Extracting Instruction Tuning Data
Chi Zhang
Huaping Zhong
Hongtao Li
Chengliang Chai
Jiawei Hong
...
Jiantao Qiu
Ye Yuan
Guoren Wang
Bin Wang
Lei Cao
SyDa
185
0
0
18 May 2025
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Yiping Wang
Qing Yang
Zhiyuan Zeng
Liliang Ren
Liu Liu
...
Jianfeng Gao
Weizhu Chen
Shuaiqiang Wang
Simon Shaolei Du
Haoran Pan
OffRLReLMLRM
648
145
0
29 Apr 2025
Data-efficient LLM Fine-tuning for Code Generation
Data-efficient LLM Fine-tuning for Code Generation
Weijie Lv
X. Xia
Sheng-Jun Huang
ALMSyDa
142
2
0
17 Apr 2025
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Xinze Wang
Zhiyong Yang
Chao Feng
Hongjin Lu
Linjie Li
Chung-Ching Lin
Kevin Qinghong Lin
Furong Huang
Lijuan Wang
OODDReLMLRMVLM
518
62
0
10 Apr 2025
Adversarial Training of Reward Models
Adversarial Training of Reward Models
Alexander Bukharin
Haifeng Qian
Shengyang Sun
Adithya Renduchintala
Soumye Singhal
Liang Luo
Oleksii Kuchaiev
Olivier Delalleau
T. Zhao
AAML
373
4
0
08 Apr 2025
CONGRAD:Conflicting Gradient Filtering for Multilingual Preference Alignment
CONGRAD:Conflicting Gradient Filtering for Multilingual Preference Alignment
Jiangnan Li
Thuy-Trang Vu
Christian Herold
Amirhossein Tebbifakhr
Shahram Khadivi
Gholamreza Haffari
359
0
0
31 Mar 2025
MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning
MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning
Yiwei Ma
Guohai Xu
Xiaoshuai Sun
Jinfa Huang
Jie Lou
Debing Zhang
Rongrong Ji
403
6
0
26 Mar 2025
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm
Hui Yuan
Yuhao Du
Xiaoqi Jiao
Yiwen Guo
Yuege Feng
Xiang Wan
Anningzhe Gao
Jinpeng Hu
238
2
0
04 Mar 2025
Large-Scale Data Selection for Instruction Tuning
Large-Scale Data Selection for Instruction Tuning
Michal Guerquin
Muru Zhang
Faeze Brahman
Pang Wei Koh
Pradeep Dasigi
ALM
283
12
0
03 Mar 2025
Advancing MAPF towards the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)
Advancing MAPF towards the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)
Jingtian Yan
Zhifei Li
William Kang
Kevin Zheng
Yulun Zhang
Zhe Chen
Yue Zhang
Daniel Harabor
Stephen Smith
Jiaoyang Li
289
4
0
03 Mar 2025
MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
Hao Liang
Meiyi Qiang
Yongbin Li
Zefeng He
Yongzhen Guo
Z. Zhu
Wentao Zhang
Tengjiao Wang
157
4
0
26 Feb 2025
Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning
Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning
Hongyi Cal
Jie Li
Mohammad Mahdinur Rahman
Wenzhen Dong
331
0
0
26 Feb 2025
MergeIT: From Selection to Merging for Efficient Instruction Tuning
Hongyi Cai
Yuqian Fu
Hongming Fu
Bo Zhao
MoMe
224
0
0
25 Feb 2025
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
Hantao Lou
Changye Li
Yalan Qin
Yaodong Yang
286
5
0
22 Feb 2025
EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness
EDGE: Efficient Data Selection for LLM Agents via Guideline EffectivenessInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Yunxiao Zhang
Guanming Xiong
Haochen Li
Wen Zhao
LLMAG
180
2
0
18 Feb 2025
InsBank: Evolving Instruction Subset for Ongoing Alignment
InsBank: Evolving Instruction Subset for Ongoing Alignment
Jiayi Shi
Yiwei Li
Shaoxiong Feng
Peiwen Yuan
Xiaobei Wang
...
Chuyi Tan
Boyuan Pan
Huan Ren
Yao Hu
Kan Li
ALM
274
0
0
17 Feb 2025
Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization
Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization
Yongcheng Zeng
Xinyu Cui
Xuanfa Jin
Guoqing Liu
Guoqing Liu
...
Ning Yang
Jun Wang
Jianye Hao
Haifeng Zhang
Jun Wang
LLMAGLRM
294
1
0
08 Feb 2025
The Best Instruction-Tuning Data are Those That Fit
The Best Instruction-Tuning Data are Those That Fit
Dylan Zhang
Qirun Dai
Hao Peng
ALM
462
19
0
06 Feb 2025
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
Yafu Li
Zhilin Wang
Tingchen Fu
Ganqu Cui
Sen Yang
Yu Cheng
223
7
0
21 Jan 2025
Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces
Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces
Amirreza Payandeh
Daeun Song
Mohammad Nazeri
Jing Liang
Praneel Mukherjee
Amir Hossain Raj
Yangzhe Kong
Dinesh Manocha
Xuesu Xiao
LM&RoLRM
403
12
0
17 Jan 2025
CDS: Knowledge Component-Driven Data Synthesis Guided by Cognitive Diagnosis Theory
CDS: Knowledge Component-Driven Data Synthesis Guided by Cognitive Diagnosis Theory
Haokun Zhao
Jinyi Han
Jiaqing Liang
Yanghua Xiao
Xiaojun Meng
Jiansheng Wei
316
0
0
13 Jan 2025
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
S. Joshi
Besmira Nushi
Vidhisha Balachandran
Varun Chandrasekaran
Vibhav Vineet
Neel Joshi
Baharan Mirzasoleiman
MLLMVLM
338
1
0
07 Jan 2025
1234
Next