Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.13542
Cited By
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models
19 June 2024
Guanting Dong
K. Lu
Chengpeng Li
Tingyu Xia
Bowen Yu
Chang Zhou
Jingren Zhou
SyDa
ALM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models"
7 / 7 papers shown
Title
WebThinker: Empowering Large Reasoning Models with Deep Research Capability
X. Li
Jiajie Jin
Guanting Dong
Hongjin Qian
Yutao Zhu
Yongkang Wu
Ji-Rong Wen
Zhicheng Dou
LLMAG
LRM
82
1
0
30 Apr 2025
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Jiale Cheng
Xiao-Chang Liu
C. Wang
Xiaotao Gu
Y. Lu
Dan Zhang
Yuxiao Dong
J. Tang
Hongning Wang
Minlie Huang
LRM
117
3
0
16 Dec 2024
Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning
Xiaochuan Li
Zichun Yu
Chenyan Xiong
SyDa
24
1
0
18 Oct 2024
Towards Scalable Automated Alignment of LLMs: A Survey
Boxi Cao
Keming Lu
Xinyu Lu
Jiawei Chen
Mengjie Ren
...
Ben He
Xianpei Han
Le Sun
Hongyu Lin
Bowen Yu
LM&MA
23
23
0
03 Jun 2024
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
Jiajie Jin
Yutao Zhu
Xinyu Yang
Chenghao Zhang
Zhicheng Dou
Chenghao Zhang
Tong Zhao
Zhao Yang
Zhicheng Dou
Ji-Rong Wen
VLM
59
40
0
22 May 2024
Instruction Tuning with GPT-4
Baolin Peng
Chunyuan Li
Pengcheng He
Michel Galley
Jianfeng Gao
SyDa
ALM
LM&MA
154
576
0
06 Apr 2023
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
116
232
0
05 Jul 2022
1