v1v2v3v4v5 (latest)

AlpaGasus: Training A Better Alpaca with Fewer Data

17 July 2023

Vikas Yadav

ArXiv (abs)PDF HTML HuggingFace (23 upvotes)

Papers citing "AlpaGasus: Training A Better Alpaca with Fewer Data"

50 / 189 papers shown

Advancing MAPF towards the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)

426

03 Mar 2025

MathClean: A Benchmark for Synthetic Mathematical Data Cleaning

233

26 Feb 2025

Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning

Hongyi Cal

Jie Li

Mohammad Mahdinur Rahman

Wenzhen Dong

412

26 Feb 2025

MergeIT: From Selection to Merging for Efficient Instruction Tuning

333

25 Feb 2025

SAE-V: Interpreting Multimodal Models for Enhanced Alignment

364

22 Feb 2025

EDGE: Efficient Data Selection for LLM Agents via Guideline EffectivenessInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

218

18 Feb 2025

InsBank: Evolving Instruction Subset for Ongoing Alignment

...

383

17 Feb 2025

Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization

...

397

08 Feb 2025

The Best Instruction-Tuning Data are Those That Fit

575

06 Feb 2025

From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning

282

21 Jan 2025

Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces

460

17 Jan 2025

CDS: Knowledge Component-Driven Data Synthesis Guided by Cognitive Diagnosis Theory

456

13 Jan 2025

MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation

S. Joshi

Besmira Nushi

Vidhisha Balachandran

Varun Chandrasekaran

Vibhav Vineet

Neel Joshi

Baharan Mirzasoleiman

MLLM VLM

394

07 Jan 2025

A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in MedicineInformation Fusion (Inf. Fusion), 2024

459

31 Dec 2024

Boosting LLM via Learning from Data Iteratively and Selectively

149

23 Dec 2024

Synth-Align: Improving Trustworthiness in Vision-Language Model with Synthetic Preference Data Alignment

306

23 Dec 2024

Curriculum-style Data Augmentation for LLM-based Metaphor Detection

Kaidi Jia

Yanxia Wu

Rongsheng Li

234

04 Dec 2024

Learning from "Silly" Questions Improves Large Language Models, But Only Slightly

211

21 Nov 2024

Star-Agents: Automatic Data Optimization with LLM Agents for Instruction TuningNeural Information Processing Systems (NeurIPS), 2024

305

21 Nov 2024

EVQAScore: A Fine-grained Metric for Video Question Answering Data Quality Evaluation

Hao Liang

Zirong Chen

Feiyu Xiong

Wentao Zhang

316

11 Nov 2024

PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment

234

02 Nov 2024

Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation

210

24 Oct 2024

Understanding Layer Significance in LLM Alignment

532

23 Oct 2024

Compute-Constrained Data SelectionInternational Conference on Learning Representations (ICLR), 2024

Junjie Oscar Yin

Alexander M. Rush

607

21 Oct 2024

IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

163

17 Oct 2024

Anchored Alignment for Self-Explanations Enhancement

Luis Felipe Villa-Arenas

256

17 Oct 2024

A Survey on Data Synthesis and Augmentation for Large Language Models

...

425

16 Oct 2024

Data Quality Control in Federated Instruction-tuning of Large Language Models

431

15 Oct 2024

Safety-Aware Fine-Tuning of Large Language Models

Hyeong Kyu Choi

Xuefeng Du

Yixuan Li

278

13 Oct 2024

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

235

12 Oct 2024

Language Imbalance Driven Rewarding for Multilingual Self-improvingInternational Conference on Learning Representations (ICLR), 2024

544

11 Oct 2024

MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference OptimizationInternational Conference on Learning Representations (ICLR), 2024

590

10 Oct 2024

SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data SelectionInternational Conference on Learning Representations (ICLR), 2024

Pin-Yu Chen

289

09 Oct 2024

HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation

379

07 Oct 2024

Selection of LLM Fine-Tuning Data based on Orthogonal Rules

322

07 Oct 2024

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

1.0K

07 Oct 2024

Integrative Decoding: Improve Factuality via Implicit Self-consistency

Yeyun Gong

...

Wenjie Li

Jian Jiao

Qi Chen

Peng Cheng

Wayne Xiong

HILM

510

02 Oct 2024

Data Proportion Detection for Optimized Data Management for Large Language Models

Hao Liang

Keshi Zhao

Yajie Yang

Bin Cui

Guosheng Dong

Wentao Zhang

170

26 Sep 2024

ControlMath: Controllable Data Generation Promotes Math Generalist ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Polydoros Giannouris

Ning Wu

Jianhui Chang

Jia Li

272

20 Sep 2024

Your Weak LLM is Secretly a Strong Teacher for AlignmentInternational Conference on Learning Representations (ICLR), 2024

Leitian Tao

Yixuan Li

583

13 Sep 2024

What is the Role of Small Models in the LLM Era: A Survey

Lihu Chen

Gaël Varoquaux

ALM

812

10 Sep 2024

CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation

308

03 Sep 2024

Rethinking Backdoor Detection Evaluation for Language Models

337

31 Aug 2024

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Yuncheng Yang

Tong Wu

...

Ke Li

Xing Sun

Jie Yang

Yun Gu

ALM OffRL MoE

356

28 Aug 2024

Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models

...

125

22 Aug 2024

CoDi: Conversational Distillation for Grounded Question Answering

172

20 Aug 2024

Towards Efficient Large Language Models for Scientific Text: A Review

H. To

Ming Liu

Guangyan Huang

187

20 Aug 2024

REInstruct: Building Instruction Data from Unlabeled CorpusAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Xianpei Han

Le Sun

ALM SyDa

185

20 Aug 2024

CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs

Weijie Lv

Xuan Xia

Sheng-Jun Huang

ALM

218

05 Aug 2024

Synth-Empathy: Towards High-Quality Synthetic Empathy Data

Hao Liang

Linzhuang Sun

Jingxuan Wei

Xijie Huang

Linkun Sun

Bihui Yu

Conghui He

Wentao Zhang

SyDa

278

31 Jul 2024