Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2307.08701
Cited By
v1
v2
v3
v4
v5 (latest)
AlpaGasus: Training A Better Alpaca with Fewer Data
17 July 2023
Lichang Chen
Shiyang Li
Jun Yan
Hai Wang
Kalpa Gunaratna
Vikas Yadav
Zheng Tang
Vijay Srinivasan
Wanrong Zhu
Heng-Chiao Huang
Hongxia Jin
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (23 upvotes)
Papers citing
"AlpaGasus: Training A Better Alpaca with Fewer Data"
50 / 174 papers shown
Title
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Information Fusion (Inf. Fusion), 2024
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILaw
LM&MA
LRM
427
79
0
31 Dec 2024
Boosting LLM via Learning from Data Iteratively and Selectively
Qi Jia
Siyu Ren
Ziheng Qin
Fuzhao Xue
Jinjie Ni
Yang You
129
1
0
23 Dec 2024
Synth-Align: Improving Trustworthiness in Vision-Language Model with Synthetic Preference Data Alignment
Robert Wijaya
Ngoc-Bao Nguyen
Ngai-Man Cheung
MLLM
SyDa
276
4
0
23 Dec 2024
Curriculum-style Data Augmentation for LLM-based Metaphor Detection
Kaidi Jia
Yanxia Wu
Rongsheng Li
Rongsheng Li
222
2
0
04 Dec 2024
Learning from "Silly" Questions Improves Large Language Models, But Only Slightly
Tingyuan Zhu
Shudong Liu
Yidong Wang
Yang Li
Han Yu
T. Shinozaki
Jindong Wang
ALM
LRM
198
0
0
21 Nov 2024
Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning
Neural Information Processing Systems (NeurIPS), 2024
Hang Zhou
Yehui Tang
Haochen Qin
Yujie Yang
Renren Jin
Deyi Xiong
Kai Han
Yunhe Wang
287
12
0
21 Nov 2024
EVQAScore: A Fine-grained Metric for Video Question Answering Data Quality Evaluation
Hao Liang
Zirong Chen
Feiyu Xiong
Wentao Zhang
280
0
0
11 Nov 2024
PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment
Dongxu Liu
Bing Xu
Yinzhuo Chen
Bufan Xu
Wenpeng Lu
Muyun Yang
Tiejun Zhao
MoE
192
1
0
02 Nov 2024
Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation
Krzysztof Ociepa
Łukasz Flis
Krzysztof Wróbel
Adrian Gwoździej
Remigiusz Kinas
179
6
0
24 Oct 2024
Understanding Layer Significance in LLM Alignment
Guangyuan Shi
Zexin Lu
Xiaoyu Dong
Wenlong Zhang
Xuanyu Zhang
Yujie Feng
Xiao-Ming Wu
452
11
0
23 Oct 2024
Compute-Constrained Data Selection
International Conference on Learning Representations (ICLR), 2024
Junjie Oscar Yin
Alexander M. Rush
562
4
0
21 Oct 2024
IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection
Jielin Song
Siyu Liu
Bin Zhu
Yanghui Rao
118
3
0
17 Oct 2024
Anchored Alignment for Self-Explanations Enhancement
Luis Felipe Villa-Arenas
Ata Nizamoglu
Qianli Wang
Sebastian Möller
Vera Schmitt
222
1
0
17 Oct 2024
A Survey on Data Synthesis and Augmentation for Large Language Models
Ke Wang
Jiahui Zhu
Minjie Ren
Ziqiang Liu
Shiwei Li
...
Yiming Lei
Xiaoyu Wu
Qiqi Zhan
Qingjie Liu
Yunhong Wang
SyDa
413
36
0
16 Oct 2024
Data Quality Control in Federated Instruction-tuning of Large Language Models
Yaxin Du
Guangyi Liu
Fengting Yuchi
W. Zhao
Jingjing Qu
Yanjie Wang
Siheng Chen
ALM
FedML
305
3
0
15 Oct 2024
Safety-Aware Fine-Tuning of Large Language Models
Hyeong Kyu Choi
Xuefeng Du
Yixuan Li
261
33
0
13 Oct 2024
Rethinking Data Selection at Scale: Random Selection is Almost All You Need
Tingyu Xia
Bowen Yu
K. Dang
An Yang
Yuan Wu
Yuan Tian
Yi-Ju Chang
Junyang Lin
ALM
203
13
0
12 Oct 2024
Language Imbalance Driven Rewarding for Multilingual Self-improving
International Conference on Learning Representations (ICLR), 2024
Wen Yang
Junhong Wu
Chen Wang
Chengqing Zong
J.N. Zhang
ALM
LRM
510
22
0
11 Oct 2024
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
International Conference on Learning Representations (ICLR), 2024
Yougang Lyu
Lingyong Yan
Zihan Wang
D. Yin
Sudipta Singha Roy
Maarten de Rijke
Zhaochun Ren
534
15
0
10 Oct 2024
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
International Conference on Learning Representations (ICLR), 2024
Han Shen
Pin-Yu Chen
Payel Das
Tianyi Chen
ALM
269
47
0
09 Oct 2024
Selection of LLM Fine-Tuning Data based on Orthogonal Rules
Xiaomin Li
Mingye Gao
Zhiwei Zhang
Chang Yue
Hong Hu
286
9
0
07 Oct 2024
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Yuxin Xiao
Shujian Zhang
Wenxuan Zhou
Marzyeh Ghassemi
Sanqiang Zhao
964
1
0
07 Oct 2024
HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation
Xinyu Zhou
Simin Fan
Martin Jaggi
TDI
352
3
0
07 Oct 2024
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Yi Cheng
Xiao Liang
Yeyun Gong
Wen Xiao
Song Wang
...
Wenjie Li
Jian Jiao
Qi Chen
Peng Cheng
Wayne Xiong
HILM
481
6
0
02 Oct 2024
Data Proportion Detection for Optimized Data Management for Large Language Models
Hao Liang
Keshi Zhao
Yajie Yang
Bin Cui
Bin Cui
Guosheng Dong
Wentao Zhang
133
0
0
26 Sep 2024
ControlMath: Controllable Data Generation Promotes Math Generalist Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Polydoros Giannouris
Ning Wu
Jianhui Chang
Jia Li
240
6
0
20 Sep 2024
Your Weak LLM is Secretly a Strong Teacher for Alignment
International Conference on Learning Representations (ICLR), 2024
Leitian Tao
Yixuan Li
529
15
0
13 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
744
54
0
10 Sep 2024
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation
Ingo Ziegler
Abdullatif Köksal
Desmond Elliott
Hinrich Schütze
244
12
0
03 Sep 2024
Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Yuncheng Yang
Yulei Qin
Tong Wu
Zihan Xu
Gang Li
...
Yuchen Shi
Ke Li
Xing Sun
Jie Yang
Yun Gu
ALM
OffRL
MoE
323
1
0
28 Aug 2024
Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models
Shenglin Zhang
Pengtian Zhu
Minghua Ma
Jiagang Wang
Yongqian Sun
...
Jingyu Wang
Qianying Guo
Xiaolei Hua
Lin Zhu
Dan Pei
AI4TS
104
1
0
22 Aug 2024
CoDi: Conversational Distillation for Grounded Question Answering
Patrick Huber
Arash Einolghozati
Rylan Conway
Kanika Narang
Matt Smith
Waqar Nayyar
Adithya Sagar
Ahmed Aly
Akshat Shrivastava
157
1
0
20 Aug 2024
Towards Efficient Large Language Models for Scientific Text: A Review
H. To
Ming Liu
Guangyan Huang
169
3
0
20 Aug 2024
REInstruct: Building Instruction Data from Unlabeled Corpus
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Shu Chen
Xinyan Guan
Yaojie Lu
Hongyu Lin
Xianpei Han
Le Sun
ALM
SyDa
166
5
0
20 Aug 2024
CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs
Weijie Lv
Xuan Xia
Sheng-Jun Huang
ALM
171
9
0
05 Aug 2024
Synth-Empathy: Towards High-Quality Synthetic Empathy Data
Hao Liang
Linzhuang Sun
Jingxuan Wei
Xijie Huang
Linkun Sun
Bihui Yu
Conghui He
Wentao Zhang
SyDa
251
7
0
31 Jul 2024
Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey
Atsuyuki Miyai
Jingkang Yang
Jingyang Zhang
Yifei Ming
Sisir Dhakal
...
Yixuan Li
Hai "Helen" Li
Ziwei Liu
Toshihiko Yamasaki
Kiyoharu Aizawa
359
29
0
31 Jul 2024
SynthVLM: Towards High-Quality and Efficient Synthesis of Image-Caption Datasets for Vision-Language Models
Zheng Liu
Hao Liang
Xijie Huang
Wentao Xiong
Qinhan Yu
Linzhuang Sun
Chong Chen
Huang Leng
SyDa
447
1
0
30 Jul 2024
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Tianhao Wu
Weizhe Yuan
O. Yu. Golovneva
Jing Xu
Yuandong Tian
Jiantao Jiao
Jason Weston
Sainbayar Sukhbaatar
ALM
KELM
LRM
318
149
0
28 Jul 2024
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Seongho Son
William Bankes
Sayak Ray Chowdhury
Brooks Paige
Ilija Bogunovic
413
8
0
26 Jul 2024
Quality Assured: Rethinking Annotation Strategies in Imaging AI
Tim Radsch
Annika Reinke
V. Weru
M. Tizabi
Nicholas Heller
Hyunjin Park
Annette Kopp-Schneider
Lena Maier-Hein
214
6
0
24 Jul 2024
Entropy Law: The Story Behind Data Compression and LLM Performance
Mingjia Yin
Chuhan Wu
Yufei Wang
Hao Wang
Wei Guo
Yasheng Wang
Yong Liu
Ruiming Tang
Defu Lian
Enhong Chen
262
41
0
09 Jul 2024
LIONs: An Empirically Optimized Approach to Align Language Models
Xiao Yu
Qingyang Wu
Yu Li
Zhou Yu
ALM
237
6
0
09 Jul 2024
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
Miao Zheng
H. Liang
Fan Yang
Haoze Sun
Tianpeng Li
...
Kun Fang
Weipeng Chen
Bin Cui
Wentao Zhang
Guosheng Dong
RALM
251
6
0
08 Jul 2024
KeyVideoLLM: Towards Large-scale Video Keyframe Selection
Hao Liang
Jiapeng Li
Tianyi Bai
Xijie Huang
Linzhuang Sun
Zhengren Wang
Conghui He
Bin Cui
Chong Chen
Wentao Zhang
VGen
296
27
0
03 Jul 2024
Efficient-Empathy: Towards Efficient and Effective Selection of Empathy Data
Linzhuang Sun
Hao Liang
Jingxuan Wei
Linkun Sun
Bihui Yu
Bin Cui
Wentao Zhang
166
2
0
02 Jul 2024
Curriculum Learning with Quality-Driven Data Selection
Biao Wu
Fang Meng
375
5
0
27 Jun 2024
Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems
Italo Luis da Silva
Hanqi Yan
Lin Gui
Yulan He
CML
358
0
0
26 Jun 2024
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt
Deng Cai
Huayang Li
Tingchen Fu
Siheng Li
Weiwen Xu
...
Leyang Cui
Yan Wang
Lemao Liu
Taro Watanabe
Shuming Shi
KELM
224
2
0
24 Jun 2024
M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models
Rishabh Maheshwary
Vikas Yadav
Hoang Nguyen
Khyati Mahajan
Sathwik Tejaswi Madhusudhan
448
7
0
24 Jun 2024
Previous
1
2
3
4
Next