v1v2v3v4v5 (latest)

AlpaGasus: Training A Better Alpaca with Fewer Data

17 July 2023

Vikas Yadav

ArXiv (abs)PDF HTML HuggingFace (23 upvotes)

Papers citing "AlpaGasus: Training A Better Alpaca with Fewer Data"

31 / 181 papers shown

SH2: Self-Highlighted Hesitation Helps You Decode More TruthfullyConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

295

11 Jan 2024

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

400

322

25 Dec 2023

One-Shot Learning as Instruction Data Prospector for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Yunshui Li

Binyuan Hui

Xiaobo Xia

Jiaxi Yang

Min Yang

...

Fei Huang

362

16 Dec 2023

The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning

Bill Yuchen Lin

Abhilasha Ravichander

Yejin Choi

244

261

04 Dec 2023

SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise

A. Yadav

Arjun Singh

272

03 Dec 2023

The Philosopher's Stone: Trojaning Plugins of Large Language ModelsNetwork and Distributed System Security Symposium (NDSS), 2023

Guoxing Chen

Yan Meng

Haojin Zhu

407

01 Dec 2023

ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?

Hailin Chen

ELM CLL AI4MH LRM ALM

361

28 Nov 2023

MoDS: Model-oriented Data Selection for Instruction Tuning

208

113

27 Nov 2023

Data Diversity Matters for Robust Instruction TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Alexander Bukharin

Tuo Zhao

336

21 Nov 2023

Oasis: Data Curation and Assessment System for Pretraining of Large Language ModelsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

Tong Zhou

Yubo Chen

Pengfei Cao

Kang Liu

Jun Zhao

Shengping Liu

245

21 Nov 2023

PLUG: Leveraging Pivot Language in Cross-Lingual Instruction TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

335

15 Nov 2023

Explanation-aware Soft Ensemble Empowers Large Language Model In-context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Jiaming Shen

Zhen Qin

244

13 Nov 2023

Correction with Backtracking Reduces Hallucination in Summarization

236

24 Oct 2023

HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2023

Fuxiao Liu

...

Furong Huang

457

352

23 Oct 2023

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning

233

18 Oct 2023

Evaluating Large Language Models at Evaluating Instruction FollowingInternational Conference on Learning Representations (ICLR), 2023

411

264

11 Oct 2023

KwaiYiiMath: Technical Report

...

Fuzheng Zhang

299

11 Oct 2023

NEFTune: Noisy Embeddings Improve Instruction FinetuningInternational Conference on Learning Representations (ICLR), 2023

...

287

108

09 Oct 2023

OpenChat: Advancing Open-source Language Models with Mixed-Quality DataInternational Conference on Learning Representations (ICLR), 2023

Yang Liu

429

306

20 Sep 2023

Are Large Language Models Really Robust to Word-Level Perturbations?

...

Li Shen

305

20 Sep 2023

Cognitive Mirage: A Review of Hallucinations in Large Language Models

372

112

13 Sep 2023

Data-Juicer: A One-Stop Data Processing System for Large Language Models

...

Jingren Zhou

297

05 Sep 2023

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsComputational Linguistics (CL), 2023

...

710

812

03 Sep 2023

InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4

Lichao Sun

296

23 Aug 2023

From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction TuningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

520

293

23 Aug 2023

Self-Alignment with Instruction BacktranslationInternational Conference on Learning Representations (ICLR), 2023

Xian Li

Luke Zettlemoyer

Jason Weston

M. Lewis

SyDa

354

166

11 Aug 2023

A Preliminary Study of the Intrinsic Relationship between Complexity and AlignmentInternational Conference on Language Resources and Evaluation (LREC), 2023

Fei Huang

247

10 Aug 2023

Backdooring Instruction-Tuned Large Language Models with Virtual Prompt InjectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Vikas Yadav

Xiang Ren

355

153

31 Jul 2023

On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and OutlookInternational Journal of Computer Vision (IJCV), 2023

Mingyuan Fan

Chengyu Wang

Cen Chen

Yang Liu

Jun Huang

HILM

309

31 Jul 2023

WizardCoder: Empowering Code Large Language Models with Evol-InstructInternational Conference on Learning Representations (ICLR), 2023

722

857

14 Jun 2023

Learning Performance-Improving Code EditsInternational Conference on Learning Representations (ICLR), 2023

Graham Neubig

Parthasarathy Ranganathan

Osbert Bastani

Amir Yazdanbakhsh

SyDa

328

126

15 Feb 2023