Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.13246
Cited By
CoachLM: Automatic Instruction Revisions Improve the Data Quality in LLM Instruction Tuning
22 November 2023
Yilun Liu
Shimin Tao
Xiaofeng Zhao
Ming Zhu
Wenbing Ma
Junhao Zhu
Chang Su
Yutai Hou
Miao Zhang
Min Zhang
Hongxia Ma
Li Zhang
Hao-Yu Yang
Yanfei Jiang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CoachLM: Automatic Instruction Revisions Improve the Data Quality in LLM Instruction Tuning"
12 / 12 papers shown
Title
MergeIT: From Selection to Merging for Efficient Instruction Tuning
Hongyi Cai
Yuqian Fu
Hongming Fu
Bo Zhao
MoMe
47
0
0
25 Feb 2025
Data Wrangling Task Automation Using Code-Generating Language Models
Ashlesha Akella
Krishnasuri Narayanam
SyDa
45
0
0
05 Feb 2025
LogLM: From Task-based to Instruction-based Automated Log Analysis
Yilun Liu
Yuhe Ji
Shimin Tao
Minggui He
Weibin Meng
Shenglin Zhang
Yongqian Sun
Yuming Xie
Boxing Chen
Hao Yang
42
2
0
10 Jan 2025
NILE: Internal Consistency Alignment in Large Language Models
Minda Hu
Qiyuan Zhang
Yufei Wang
Bowei He
Hongru Wang
Jingyan Zhou
Liangyou Li
Yasheng Wang
Chen-li Ma
Irwin King
81
0
0
21 Dec 2024
Federated Data-Efficient Instruction Tuning for Large Language Models
Zhen Qin
Zhaomin Wu
Bingsheng He
Shuiguang Deng
FedML
30
2
0
14 Oct 2024
What Do You Want? User-centric Prompt Generation for Text-to-image Synthesis via Multi-turn Guidance
Yilun Liu
Minggui He
Feiyu Yao
Yuhe Ji
Shimin Tao
...
Jian Gao
Li Zhang
Hao Yang
Boxing Chen
Osamu Yoshie
31
0
0
23 Aug 2024
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
Yuxin Jiang
Bo Huang
Yufei Wang
Xingshan Zeng
Liangyou Li
Yasheng Wang
Xin Jiang
Lifeng Shang
Ruiming Tang
Wei Wang
40
5
0
14 Aug 2024
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
Yuan Ge
Yilun Liu
Chi Hu
Weibin Meng
Shimin Tao
Xiaofeng Zhao
Hongxia Ma
Li Zhang
Hao Yang
Tong Xiao
ALM
14
24
0
28 Feb 2024
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering
Pan Lu
Swaroop Mishra
Tony Xia
Liang Qiu
Kai-Wei Chang
Song-Chun Zhu
Oyvind Tafjord
Peter Clark
A. Kalyan
ELM
ReLM
LRM
198
1,089
0
20 Sep 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
A Systematic Evaluation of Large Language Models of Code
Frank F. Xu
Uri Alon
Graham Neubig
Vincent J. Hellendoorn
ELM
ALM
188
624
0
26 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
1