Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.17651
Cited By
v1
v2 (latest)
Self-Refine: Iterative Refinement with Self-Feedback
Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"Self-Refine: Iterative Refinement with Self-Feedback"
50 / 1,674 papers shown
Autonomous LLM-driven research from data to human-verifiable research papers
Tal Ifargan
Lukas Hafner
Maor Kern
Ori Alcalay
Roy Kishony
300
43
0
24 Apr 2024
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia
Rui Wang
Xu Liu
Mingyan Li
Tong Yu
Xiang Chen
Julian McAuley
Shuai Li
LRM
648
47
0
24 Apr 2024
Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation
Xun Wu
Shaohan Huang
Furu Wei
208
16
0
23 Apr 2024
NExT: Teaching Large Language Models to Reason about Code Execution
Ansong Ni
Miltiadis Allamanis
Arman Cohan
Yinlin Deng
Kensen Shi
Charles Sutton
Pengcheng Yin
ReLM
LRM
270
62
0
23 Apr 2024
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Qihuang Zhong
Kang Wang
Ziyang Xu
Juhua Liu
Liang Ding
Bo Du
LRM
AIMat
496
8
0
23 Apr 2024
A Survey on Self-Evolution of Large Language Models
Zhengwei Tao
Ting-En Lin
Xiancai Chen
Hangyu Li
Yuchuan Wu
Yongbin Li
Zhi Jin
Fei Huang
Dacheng Tao
Jingren Zhou
LRM
LM&Ro
302
46
0
22 Apr 2024
ISQA: Informative Factuality Feedback for Scientific Summarization
Zekai Li
Yanxia Qin
Qian Liu
Min-Yen Kan
HILM
242
2
0
20 Apr 2024
iTBLS: A Dataset of Interactive Conversations Over Tabular Information
Anirudh S. Sundar
Christopher Richardson
William Gay
Larry Heck
LMTD
355
3
0
19 Apr 2024
Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences
Shreya Shankar
J.D. Zamfirescu-Pereira
Bjorn Hartmann
Aditya G. Parameswaran
Ian Arawjo
ALM
245
180
0
18 Apr 2024
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Ye Tian
Baolin Peng
Linfeng Song
Lifeng Jin
Dian Yu
Haitao Mi
Dong Yu
LRM
ReLM
261
124
0
18 Apr 2024
Large Language Models Can Solve Real-World Planning Rigorously with Formal Verification Tools
Yilun Hao
Yongchao Chen
Yang Zhang
Chuchu Fan
LRM
LLMAG
286
4
0
18 Apr 2024
Enhancing Q&A with Domain-Specific Fine-Tuning and Iterative Reasoning: A Comparative Study
Zooey Nguyen
Anthony Annunziata
Vinh Luong
Sang Dinh
Quynh Le
Anh Hai Ha
Chanh Le
Hong An Phan
Shruti Raghavan
Christopher Nguyen
LRM
157
8
0
17 Apr 2024
AgentKit: Flow Engineering with Graphs, not Coding
Yue Wu
Yewen Fan
So Yeon Min
Shrimai Prabhumoye
Alexander Shmakov
Yonatan Bisk
Ruslan Salakhutdinov
Yuanzhi Li
Tom Michael Mitchell
AI4CE
337
0
0
17 Apr 2024
Can Language Models Solve Olympiad Programming?
Quan Shi
Michael Tang
Karthik Narasimhan
Shunyu Yao
ELM
LRM
ReLM
334
51
0
16 Apr 2024
Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs
Ruoxi Cheng
Haoxuan Ma
Shuirong Cao
Jiaqi Li
Aihua Pei
Zhiqiang Wang
Pengliang Ji
Haoyu Wang
Jiaqi Huo
AI4CE
437
21
0
15 Apr 2024
LLM Evaluators Recognize and Favor Their Own Generations
Arjun Panickssery
Samuel R. Bowman
Shi Feng
411
348
0
15 Apr 2024
Distilling Reasoning Ability from Large Language Models with Adaptive Thinking
Xiao Chen
Sihang Zhou
K. Liang
Xinwang Liu
ReLM
LRM
314
13
0
14 Apr 2024
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models
Yanhong Li
Chenghao Yang
Allyson Ettinger
ReLM
LRM
LLMAG
156
15
0
14 Apr 2024
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Ruixin Yang
Dheeraj Rajagopal
S. Hayati
Bin Hu
Luan Tuyen Chau
LLMAG
496
15
0
14 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
Ameet Deshpande
Bruno Castro da Silva
407
88
0
12 Apr 2024
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward
Xuan Xie
Yuheng Huang
Zhehua Zhou
Yuheng Huang
Da Song
Lei Ma
OffRL
385
12
0
12 Apr 2024
Auctions with LLM Summaries
Kumar Avinava Dubey
Zhe Feng
Rahul Kidambi
Aranyak Mehta
Di Wang
170
22
0
11 Apr 2024
Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations
Dayeon Ki
Marine Carpuat
284
33
0
11 Apr 2024
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Jinheon Baek
S. Jauhar
Silviu Cucerzan
Sung Ju Hwang
AI4CE
LLMAG
LM&Ro
379
104
0
11 Apr 2024
CodecLM: Aligning Language Models with Tailored Synthetic Data
Zifeng Wang
Chun-Liang Li
Vincent Perot
Long T. Le
Jin Miao
Zizhao Zhang
Chen-Yu Lee
Tomas Pfister
SyDa
ALM
194
34
0
08 Apr 2024
RoT: Enhancing Large Language Models with Reflection on Search Trees
Wenyang Hui
Kewei Tu
LRM
241
13
0
08 Apr 2024
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen
3DV
400
6
0
05 Apr 2024
SELF-[IN]CORRECT: LLMs Struggle with Refining Self-Generated Responses
AAAI Conference on Artificial Intelligence (AAAI), 2024
Dongwei Jiang
Jingyu Zhang
Orion Weller
Nathaniel Weir
Benjamin Van Durme
Daniel Khashabi
226
12
0
04 Apr 2024
Evaluating LLMs at Detecting Errors in LLM Responses
Ryo Kamoi
Sarkar Snigdha Sarathi Das
Renze Lou
Jihyun Janice Ahn
Yilun Zhao
...
Salika Dave
Shaobo Qin
Arman Cohan
Wenpeng Yin
Rui Zhang
217
46
0
04 Apr 2024
Personalized LLM Response Generation with Parameterized Memory Injection
Kai Zhang
Lizhi Qing
Yangyang Kang
348
22
0
04 Apr 2024
Empowering Biomedical Discovery with AI Agents
Cell (Cell), 2024
Shanghua Gao
Ada Fang
Yepeng Huang
Valentina Giunchiglia
Ayush Noori
Jonathan Richard Schwarz
Yasha Ektefaie
Jovana Kondic
Marinka Zitnik
LLMAG
AI4CE
267
208
0
03 Apr 2024
Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization
Yoichi Ishibashi
Yoshimasa Nishimura
240
73
0
02 Apr 2024
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
AI4CE
LLMAG
LM&Ro
LM&MA
680
107
0
02 Apr 2024
Large Language Models are Capable of Offering Cognitive Reappraisal, if Guided
Hongli Zhan
Allen Zheng
Yoon Kyung Lee
Jina Suh
Junyi Jessy Li
Desmond C. Ong
AI4MH
256
19
0
01 Apr 2024
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Wei He
Shichun Liu
Jun Zhao
Yiwen Ding
Yi Lu
Zhiheng Xi
Tao Gui
Tao Gui
Xuanjing Huang
183
4
0
01 Apr 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
405
151
0
30 Mar 2024
Conceptual and Unbiased Reasoning in Language Models
Ben Zhou
Hongming Zhang
Sihao Chen
Dian Yu
Hongwei Wang
Baolin Peng
Dan Roth
Dong Yu
ReLM
LRM
ELM
262
19
0
30 Mar 2024
Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning
Yongqi Tong
Dawei Li
Sizhe Wang
Yujia Wang
Fei Teng
Jingbo Shang
LRM
411
85
0
29 Mar 2024
Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Yuwen Tan
Zihan Zhang
Xiang Xiang
Ke Wang
Yuchuan Wu
Yongbin Li
LLMAG
LRM
176
9
0
29 Mar 2024
MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text Evaluation
Yu Li
Shenyu Zhang
Rui Wu
Xiutian Huang
Yongrui Chen
Wenhao Xu
Guilin Qi
Dehai Min
LLMAG
172
17
0
28 Mar 2024
Learning From Correctness Without Prompting Makes LLM Efficient Reasoner
Yuxuan Yao
Han Wu
Zhijiang Guo
Biyan Zhou
Jiahui Gao
Sichun Luo
Hanxu Hou
Mingwen Liu
Linqi Song
LLMAG
LRM
342
14
0
28 Mar 2024
CYCLE: Learning to Self-Refine the Code Generation
Yangruibo Ding
Marcus J. Min
Gail E. Kaiser
Baishakhi Ray
243
62
0
27 Mar 2024
IterAlign: Iterative Constitutional Alignment of Large Language Models
Xiusi Chen
Hongzhi Wen
Jiapeng Liu
Chen Luo
Qingyu Yin
Ruirui Li
Zheng Li
Wei Wang
AILaw
117
7
0
27 Mar 2024
Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation
Ziyan Wang
Yingpeng Du
Zhu Sun
Haoyan Chua
Kaidong Feng
Wenya Wang
Jie Zhang
LRM
KELM
230
8
0
25 Mar 2024
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Nicholas Lee
Thanakul Wattanawong
Sehoon Kim
K. Mangalam
Sheng Shen
Gopala Anumanchipalli
Michael W. Mahoney
Kurt Keutzer
A. Gholami
292
67
0
22 Mar 2024
A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning
ACM Multimedia (MM), 2024
Changmeng Zheng
Dayong Liang
Wengyu Zhang
Xiao Wei
Tat-Seng Chua
Qing Li
207
1
0
22 Mar 2024
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Kyungjae Lee
Dasol Hwang
Sunghyun Park
Youngsoo Jang
Moontae Lee
265
14
0
21 Mar 2024
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding
Ahmad A Mahmood
Ashmal Vayani
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
LRM
419
11
0
21 Mar 2024
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models
Huachuan Qiu
Shuai Zhang
Hongliang He
Anqi Li
Zhenzhong Lan
234
3
0
20 Mar 2024
Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering
International Conference on Language Resources and Evaluation (LREC), 2024
Yuan Gao
Yiheng Zhu
Yuanbin Cao
Yinzhi Zhou
Zhen Wu
Yujie Chen
Shenglan Wu
Haoyuan Hu
Xinyu Dai
LRM
216
5
0
19 Mar 2024
Previous
1
2
3
...
24
25
26
...
32
33
34
Next