ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.08701
  4. Cited By
AlpaGasus: Training A Better Alpaca with Fewer Data
v1v2v3v4v5 (latest)

AlpaGasus: Training A Better Alpaca with Fewer Data

17 July 2023
Lichang Chen
Shiyang Li
Jun Yan
Hai Wang
Kalpa Gunaratna
Vikas Yadav
Zheng Tang
Vijay Srinivasan
Wanrong Zhu
Heng-Chiao Huang
Hongxia Jin
    ALM
ArXiv (abs)PDFHTMLHuggingFace (23 upvotes)

Papers citing "AlpaGasus: Training A Better Alpaca with Fewer Data"

50 / 189 papers shown
Advancing MAPF towards the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)
Advancing MAPF towards the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)
Jingtian Yan
Zhifei Li
William Kang
Kevin Zheng
Yulun Zhang
Zhe Chen
Yue Zhang
Daniel Harabor
Stephen Smith
Jiaoyang Li
426
1
0
03 Mar 2025
MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
Hao Liang
Meiyi Qiang
Yongbin Li
Zefeng He
Yongzhen Guo
Z. Zhu
Wentao Zhang
Tengjiao Wang
233
4
0
26 Feb 2025
Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning
Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning
Hongyi Cal
Jie Li
Mohammad Mahdinur Rahman
Wenzhen Dong
412
0
0
26 Feb 2025
MergeIT: From Selection to Merging for Efficient Instruction Tuning
Hongyi Cai
Yuqian Fu
Hongming Fu
Bo Zhao
MoMe
333
0
0
25 Feb 2025
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
Hantao Lou
Changye Li
Yalan Qin
Yaodong Yang
364
6
0
22 Feb 2025
EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness
EDGE: Efficient Data Selection for LLM Agents via Guideline EffectivenessInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Yunxiao Zhang
Guanming Xiong
Haochen Li
Wen Zhao
LLMAG
218
2
0
18 Feb 2025
InsBank: Evolving Instruction Subset for Ongoing Alignment
InsBank: Evolving Instruction Subset for Ongoing Alignment
Jiayi Shi
Yiwei Li
Shaoxiong Feng
Peiwen Yuan
Xiaobei Wang
...
Chuyi Tan
Boyuan Pan
Huan Ren
Yao Hu
Kan Li
ALM
383
0
0
17 Feb 2025
Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization
Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization
Yongcheng Zeng
Xinyu Cui
Xuanfa Jin
Guoqing Liu
Guoqing Liu
...
Ning Yang
Jun Wang
Jianye Hao
Haifeng Zhang
Jun Wang
LLMAGLRM
397
1
0
08 Feb 2025
The Best Instruction-Tuning Data are Those That Fit
The Best Instruction-Tuning Data are Those That Fit
Dylan Zhang
Qirun Dai
Hao Peng
ALM
575
22
0
06 Feb 2025
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
Yafu Li
Zhilin Wang
Tingchen Fu
Ganqu Cui
Sen Yang
Yu Cheng
282
7
0
21 Jan 2025
Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces
Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces
Amirreza Payandeh
Daeun Song
Mohammad Nazeri
Jing Liang
Praneel Mukherjee
Amir Hossain Raj
Yangzhe Kong
Dinesh Manocha
Xuesu Xiao
LM&RoLRM
460
18
0
17 Jan 2025
CDS: Knowledge Component-Driven Data Synthesis Guided by Cognitive Diagnosis Theory
CDS: Knowledge Component-Driven Data Synthesis Guided by Cognitive Diagnosis Theory
Haokun Zhao
Jinyi Han
Jiaqing Liang
Yanghua Xiao
Xiaojun Meng
Jiansheng Wei
456
0
0
13 Jan 2025
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
S. Joshi
Besmira Nushi
Vidhisha Balachandran
Varun Chandrasekaran
Vibhav Vineet
Neel Joshi
Baharan Mirzasoleiman
MLLMVLM
394
2
0
07 Jan 2025
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in MedicineInformation Fusion (Inf. Fusion), 2024
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILawLM&MALRM
459
82
0
31 Dec 2024
Boosting LLM via Learning from Data Iteratively and Selectively
Boosting LLM via Learning from Data Iteratively and Selectively
Qi Jia
Siyu Ren
Ziheng Qin
Fuzhao Xue
Jinjie Ni
Yang You
149
1
0
23 Dec 2024
Synth-Align: Improving Trustworthiness in Vision-Language Model with Synthetic Preference Data Alignment
Synth-Align: Improving Trustworthiness in Vision-Language Model with Synthetic Preference Data Alignment
Robert Wijaya
Ngoc-Bao Nguyen
Ngai-Man Cheung
MLLMSyDa
306
4
0
23 Dec 2024
Curriculum-style Data Augmentation for LLM-based Metaphor Detection
Curriculum-style Data Augmentation for LLM-based Metaphor Detection
Kaidi Jia
Yanxia Wu
Rongsheng Li
Rongsheng Li
234
2
0
04 Dec 2024
Learning from "Silly" Questions Improves Large Language Models, But Only
  Slightly
Learning from "Silly" Questions Improves Large Language Models, But Only Slightly
Tingyuan Zhu
Shudong Liu
Yidong Wang
Yang Li
Han Yu
T. Shinozaki
Jindong Wang
ALMLRM
211
0
0
21 Nov 2024
Star-Agents: Automatic Data Optimization with LLM Agents for Instruction
  Tuning
Star-Agents: Automatic Data Optimization with LLM Agents for Instruction TuningNeural Information Processing Systems (NeurIPS), 2024
Hang Zhou
Yehui Tang
Haochen Qin
Yujie Yang
Renren Jin
Deyi Xiong
Kai Han
Yunhe Wang
305
14
0
21 Nov 2024
EVQAScore: A Fine-grained Metric for Video Question Answering Data Quality Evaluation
EVQAScore: A Fine-grained Metric for Video Question Answering Data Quality Evaluation
Hao Liang
Zirong Chen
Feiyu Xiong
Wentao Zhang
316
0
0
11 Nov 2024
PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment
PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment
Dongxu Liu
Bing Xu
Yinzhuo Chen
Bufan Xu
Wenpeng Lu
Muyun Yang
Tiejun Zhao
MoE
234
1
0
02 Nov 2024
Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation
Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation
Krzysztof Ociepa
Łukasz Flis
Krzysztof Wróbel
Adrian Gwoździej
Remigiusz Kinas
210
8
0
24 Oct 2024
Understanding Layer Significance in LLM Alignment
Understanding Layer Significance in LLM Alignment
Guangyuan Shi
Zexin Lu
Xiaoyu Dong
Wenlong Zhang
Xuanyu Zhang
Yujie Feng
Xiao-Ming Wu
532
12
0
23 Oct 2024
Compute-Constrained Data Selection
Compute-Constrained Data SelectionInternational Conference on Learning Representations (ICLR), 2024
Junjie Oscar Yin
Alexander M. Rush
607
4
0
21 Oct 2024
IterSelectTune: An Iterative Training Framework for Efficient
  Instruction-Tuning Data Selection
IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection
Jielin Song
Siyu Liu
Bin Zhu
Yanghui Rao
163
4
0
17 Oct 2024
Anchored Alignment for Self-Explanations Enhancement
Anchored Alignment for Self-Explanations Enhancement
Luis Felipe Villa-Arenas
Ata Nizamoglu
Qianli Wang
Sebastian Möller
Vera Schmitt
256
1
0
17 Oct 2024
A Survey on Data Synthesis and Augmentation for Large Language Models
A Survey on Data Synthesis and Augmentation for Large Language Models
Ke Wang
Jiahui Zhu
Minjie Ren
Ziqiang Liu
Shiwei Li
...
Yiming Lei
Xiaoyu Wu
Qiqi Zhan
Qingjie Liu
Yunhong Wang
SyDa
425
36
0
16 Oct 2024
Data Quality Control in Federated Instruction-tuning of Large Language Models
Data Quality Control in Federated Instruction-tuning of Large Language Models
Yaxin Du
Guangyi Liu
Fengting Yuchi
W. Zhao
Jingjing Qu
Yanjie Wang
Siheng Chen
ALMFedML
431
3
0
15 Oct 2024
Safety-Aware Fine-Tuning of Large Language Models
Safety-Aware Fine-Tuning of Large Language Models
Hyeong Kyu Choi
Xuefeng Du
Yixuan Li
278
34
0
13 Oct 2024
Rethinking Data Selection at Scale: Random Selection is Almost All You
  Need
Rethinking Data Selection at Scale: Random Selection is Almost All You Need
Tingyu Xia
Bowen Yu
K. Dang
An Yang
Yuan Wu
Yuan Tian
Yi-Ju Chang
Junyang Lin
ALM
235
13
0
12 Oct 2024
Language Imbalance Driven Rewarding for Multilingual Self-improving
Language Imbalance Driven Rewarding for Multilingual Self-improvingInternational Conference on Learning Representations (ICLR), 2024
Wen Yang
Junhong Wu
Chen Wang
Chengqing Zong
J.N. Zhang
ALMLRM
544
23
0
11 Oct 2024
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference OptimizationInternational Conference on Learning Representations (ICLR), 2024
Yougang Lyu
Lingyong Yan
Zihan Wang
D. Yin
Sudipta Singha Roy
Maarten de Rijke
Zhaochun Ren
590
15
0
10 Oct 2024
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data SelectionInternational Conference on Learning Representations (ICLR), 2024
Han Shen
Pin-Yu Chen
Payel Das
Tianyi Chen
ALM
289
51
0
09 Oct 2024
HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation
HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation
Xinyu Zhou
Simin Fan
Martin Jaggi
TDI
379
3
0
07 Oct 2024
Selection of LLM Fine-Tuning Data based on Orthogonal Rules
Selection of LLM Fine-Tuning Data based on Orthogonal Rules
Xiaomin Li
Mingye Gao
Zhiwei Zhang
Chang Yue
Hong Hu
322
9
0
07 Oct 2024
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Yuxin Xiao
Shujian Zhang
Wenxuan Zhou
Marzyeh Ghassemi
Sanqiang Zhao
1.0K
1
0
07 Oct 2024
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Yi Cheng
Xiao Liang
Yeyun Gong
Wen Xiao
Song Wang
...
Wenjie Li
Jian Jiao
Qi Chen
Peng Cheng
Wayne Xiong
HILM
510
6
0
02 Oct 2024
Data Proportion Detection for Optimized Data Management for Large
  Language Models
Data Proportion Detection for Optimized Data Management for Large Language Models
Hao Liang
Keshi Zhao
Yajie Yang
Bin Cui
Bin Cui
Guosheng Dong
Wentao Zhang
170
0
0
26 Sep 2024
ControlMath: Controllable Data Generation Promotes Math Generalist
  Models
ControlMath: Controllable Data Generation Promotes Math Generalist ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Polydoros Giannouris
Ning Wu
Jianhui Chang
Jia Li
272
7
0
20 Sep 2024
Your Weak LLM is Secretly a Strong Teacher for Alignment
Your Weak LLM is Secretly a Strong Teacher for AlignmentInternational Conference on Learning Representations (ICLR), 2024
Leitian Tao
Yixuan Li
583
15
0
13 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
812
55
0
10 Sep 2024
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation
Ingo Ziegler
Abdullatif Köksal
Desmond Elliott
Hinrich Schütze
308
13
0
03 Sep 2024
Rethinking Backdoor Detection Evaluation for Language Models
Rethinking Backdoor Detection Evaluation for Language Models
Jun Yan
Wenjie Jacky Mo
Xiang Ren
Robin Jia
ELM
337
4
0
31 Aug 2024
Leveraging Open Knowledge for Advancing Task Expertise in Large Language
  Models
Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Yuncheng Yang
Yulei Qin
Tong Wu
Zihan Xu
Gang Li
...
Yuchen Shi
Ke Li
Xing Sun
Jie Yang
Yun Gu
ALMOffRLMoE
356
1
0
28 Aug 2024
Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on
  Large Language Models
Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models
Shenglin Zhang
Pengtian Zhu
Minghua Ma
Jiagang Wang
Yongqian Sun
...
Jingyu Wang
Qianying Guo
Xiaolei Hua
Lin Zhu
Dan Pei
AI4TS
125
1
0
22 Aug 2024
CoDi: Conversational Distillation for Grounded Question Answering
CoDi: Conversational Distillation for Grounded Question Answering
Patrick Huber
Arash Einolghozati
Rylan Conway
Kanika Narang
Matt Smith
Waqar Nayyar
Adithya Sagar
Ahmed Aly
Akshat Shrivastava
172
1
0
20 Aug 2024
Towards Efficient Large Language Models for Scientific Text: A Review
Towards Efficient Large Language Models for Scientific Text: A Review
H. To
Ming Liu
Guangyan Huang
187
3
0
20 Aug 2024
REInstruct: Building Instruction Data from Unlabeled Corpus
REInstruct: Building Instruction Data from Unlabeled CorpusAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Shu Chen
Xinyan Guan
Yaojie Lu
Hongyu Lin
Xianpei Han
Le Sun
ALMSyDa
185
5
0
20 Aug 2024
CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs
CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs
Weijie Lv
Xuan Xia
Sheng-Jun Huang
ALM
218
11
0
05 Aug 2024
Synth-Empathy: Towards High-Quality Synthetic Empathy Data
Synth-Empathy: Towards High-Quality Synthetic Empathy Data
Hao Liang
Linzhuang Sun
Jingxuan Wei
Xijie Huang
Linkun Sun
Bihui Yu
Conghui He
Wentao Zhang
SyDa
278
7
0
31 Jul 2024
Previous
1234
Next
Page 2 of 4