ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.08701
  4. Cited By
AlpaGasus: Training A Better Alpaca with Fewer Data
v1v2v3v4v5 (latest)

AlpaGasus: Training A Better Alpaca with Fewer Data

17 July 2023
Lichang Chen
Shiyang Li
Jun Yan
Hai Wang
Kalpa Gunaratna
Vikas Yadav
Zheng Tang
Vijay Srinivasan
Wanrong Zhu
Heng-Chiao Huang
Hongxia Jin
    ALM
ArXiv (abs)PDFHTMLHuggingFace (23 upvotes)

Papers citing "AlpaGasus: Training A Better Alpaca with Fewer Data"

31 / 181 papers shown
SH2: Self-Highlighted Hesitation Helps You Decode More Truthfully
SH2: Self-Highlighted Hesitation Helps You Decode More TruthfullyConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jushi Kai
Hai Hu
Zhouhan Lin
HILM
295
15
0
11 Jan 2024
What Makes Good Data for Alignment? A Comprehensive Study of Automatic
  Data Selection in Instruction Tuning
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Wei Liu
Weihao Zeng
Keqing He
Yong Jiang
Junxian He
ALM
400
322
0
25 Dec 2023
One-Shot Learning as Instruction Data Prospector for Large Language
  Models
One-Shot Learning as Instruction Data Prospector for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yunshui Li
Binyuan Hui
Xiaobo Xia
Jiaxi Yang
Min Yang
...
Ling-Hao Chen
Junhao Liu
Tongliang Liu
Fei Huang
Yongbin Li
362
46
0
16 Dec 2023
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context
  Learning
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Bill Yuchen Lin
Abhilasha Ravichander
Ximing Lu
Nouha Dziri
Melanie Sclar
Khyathi Chandu
Chandra Bhagavatula
Yejin Choi
244
261
0
04 Dec 2023
SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise
SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise
A. Yadav
Arjun Singh
272
3
0
03 Dec 2023
The Philosopher's Stone: Trojaning Plugins of Large Language Models
The Philosopher's Stone: Trojaning Plugins of Large Language ModelsNetwork and Distributed System Security Symposium (NDSS), 2023
Tian Dong
Minhui Xue
Guoxing Chen
Rayne Holland
Shaofeng Li
Yan Meng
Zhen Liu
Haojin Zhu
AAML
407
31
0
01 Dec 2023
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models
  Catching up?
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?
Hailin Chen
Fangkai Jiao
Xingxuan Li
Chengwei Qin
Mathieu Ravaut
Ruochen Zhao
Caiming Xiong
Shafiq Joty
ELMCLLAI4MHLRMALM
361
31
0
28 Nov 2023
MoDS: Model-oriented Data Selection for Instruction Tuning
MoDS: Model-oriented Data Selection for Instruction Tuning
Qianlong Du
Chengqing Zong
Jiajun Zhang
ALM
208
113
0
27 Nov 2023
Data Diversity Matters for Robust Instruction Tuning
Data Diversity Matters for Robust Instruction TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Alexander Bukharin
Tuo Zhao
336
68
0
21 Nov 2023
Oasis: Data Curation and Assessment System for Pretraining of Large
  Language Models
Oasis: Data Curation and Assessment System for Pretraining of Large Language ModelsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Tong Zhou
Yubo Chen
Pengfei Cao
Kang Liu
Jun Zhao
Shengping Liu
245
3
0
21 Nov 2023
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhihan Zhang
Dong-Ho Lee
Yuwei Fang
Wenhao Yu
Mengzhao Jia
Meng Jiang
Francesco Barbieri
ALM
335
43
0
15 Nov 2023
Explanation-aware Soft Ensemble Empowers Large Language Model In-context
  Learning
Explanation-aware Soft Ensemble Empowers Large Language Model In-context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yue Yu
Jiaming Shen
Tianqi Liu
Zhen Qin
Jing Nathan Yan
Jialu Liu
Chao Zhang
Michael Bendersky
244
9
0
13 Nov 2023
Correction with Backtracking Reduces Hallucination in Summarization
Correction with Backtracking Reduces Hallucination in Summarization
Zhenzhen Liu
Chao-gang Wan
Varsha Kishore
Jin Peng Zhou
Minmin Chen
Kilian Q. Weinberger
HILM
236
4
0
24 Oct 2023
HallusionBench: An Advanced Diagnostic Suite for Entangled Language
  Hallucination and Visual Illusion in Large Vision-Language Models
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Tianrui Guan
Fuxiao Liu
Xiyang Wu
Ruiqi Xian
Zongxia Li
...
Lichang Chen
Furong Huang
Yaser Yacoob
Dinesh Manocha
Wanrong Zhu
VLMMLLM
457
352
0
23 Oct 2023
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning
Ming Li
Lichang Chen
Jiuhai Chen
Shwai He
Heng-Chiao Huang
Jiuxiang Gu
Wanrong Zhu
233
28
0
18 Oct 2023
Evaluating Large Language Models at Evaluating Instruction Following
Evaluating Large Language Models at Evaluating Instruction FollowingInternational Conference on Learning Representations (ICLR), 2023
Zhiyuan Zeng
Jiatong Yu
Tianyu Gao
Yu Meng
Tanya Goyal
Danqi Chen
ELMALM
411
264
0
11 Oct 2023
KwaiYiiMath: Technical Report
KwaiYiiMath: Technical Report
Jia-Yi Fu
Lei Lin
Xiaoyang Gao
Pengli Liu
Zhengzong Chen
...
Zijia Lin
Fuzheng Zhang
Zhongyuan Wang
Chen Zhang
Kun Gai
LRMReLMRALM
299
3
0
11 Oct 2023
NEFTune: Noisy Embeddings Improve Instruction Finetuning
NEFTune: Noisy Embeddings Improve Instruction FinetuningInternational Conference on Learning Representations (ICLR), 2023
Neel Jain
Ping Yeh-Chiang
Yuxin Wen
John Kirchenbauer
Hong-Min Chu
...
Avi Schwarzschild
Aniruddha Saha
Micah Goldblum
Jonas Geiping
Tom Goldstein
287
108
0
09 Oct 2023
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
OpenChat: Advancing Open-source Language Models with Mixed-Quality DataInternational Conference on Learning Representations (ICLR), 2023
Guan-Bo Wang
Sijie Cheng
Xianyuan Zhan
Xiangang Li
Sen Song
Yang Liu
ALM
429
306
0
20 Sep 2023
Are Large Language Models Really Robust to Word-Level Perturbations?
Are Large Language Models Really Robust to Word-Level Perturbations?
Haoyu Wang
Guozheng Ma
Cong Yu
Ning Gui
Linrui Zhang
...
Sen Zhang
Li Shen
Xueqian Wang
Peilin Zhao
Dacheng Tao
KELM
305
31
0
20 Sep 2023
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Hongbin Ye
Tong Liu
Aijia Zhang
Wei Hua
Weiqiang Jia
HILM
372
112
0
13 Sep 2023
Data-Juicer: A One-Stop Data Processing System for Large Language Models
Data-Juicer: A One-Stop Data Processing System for Large Language Models
Daoyuan Chen
Yilun Huang
Zhijian Ma
Hesen Chen
Xuchen Pan
...
Zhaoyang Liu
Jinyang Gao
Yaliang Li
Bolin Ding
Jingren Zhou
SyDaVLM
297
59
0
05 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsComputational Linguistics (CL), 2023
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Freda Shi
Shuming Shi
Shuming Shi
LRMRALMHILM
710
812
0
03 Sep 2023
InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4
InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4
Lai Wei
Zihao Jiang
Weiran Huang
Lichao Sun
VLMMLLM
296
74
0
23 Aug 2023
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data
  Selection for Instruction Tuning
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction TuningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Ming Li
Yong Zhang
Zhitao Li
Jiuhai Chen
Lichang Chen
Ning Cheng
Jianzong Wang
Wanrong Zhu
Jing Xiao
520
293
0
23 Aug 2023
Self-Alignment with Instruction Backtranslation
Self-Alignment with Instruction BacktranslationInternational Conference on Learning Representations (ICLR), 2023
Xian Li
Ping Yu
Chunting Zhou
Timo Schick
Omer Levy
Luke Zettlemoyer
Jason Weston
M. Lewis
SyDa
354
166
0
11 Aug 2023
A Preliminary Study of the Intrinsic Relationship between Complexity and
  Alignment
A Preliminary Study of the Intrinsic Relationship between Complexity and AlignmentInternational Conference on Language Resources and Evaluation (LREC), 2023
Ying Zhao
Yu Bowen
Binyuan Hui
Haiyang Yu
Fei Huang
Yongbin Li
Ningyu Zhang
247
34
0
10 Aug 2023
Backdooring Instruction-Tuned Large Language Models with Virtual Prompt
  Injection
Backdooring Instruction-Tuned Large Language Models with Virtual Prompt InjectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Jun Yan
Vikas Yadav
Shiyang Li
Lichang Chen
Zheng Tang
Hai Wang
Vijay Srinivasan
Xiang Ren
Hongxia Jin
SILM
355
153
0
31 Jul 2023
On the Trustworthiness Landscape of State-of-the-art Generative Models:
  A Survey and Outlook
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and OutlookInternational Journal of Computer Vision (IJCV), 2023
Mingyuan Fan
Chengyu Wang
Cen Chen
Yang Liu
Jun Huang
HILM
309
13
0
31 Jul 2023
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
WizardCoder: Empowering Code Large Language Models with Evol-InstructInternational Conference on Learning Representations (ICLR), 2023
Ziyang Luo
Can Xu
Lu Wang
Qingfeng Sun
Xiubo Geng
Wenxiang Hu
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
ELMSyDaALM
722
857
0
14 Jun 2023
Learning Performance-Improving Code Edits
Learning Performance-Improving Code EditsInternational Conference on Learning Representations (ICLR), 2023
Alex Shypula
Aman Madaan
Yiming Yang
Uri Alon
Jacob R. Gardner
Milad Hashemi
Graham Neubig
Parthasarathy Ranganathan
Osbert Bastani
Amir Yazdanbakhsh
SyDa
328
126
0
15 Feb 2023
Previous
1234