ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17651
  4. Cited By
Self-Refine: Iterative Refinement with Self-Feedback
v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
    ReLMLRMDiffM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,674 papers shown
Direct Alignment of Language Models via Quality-Aware Self-Refinement
Direct Alignment of Language Models via Quality-Aware Self-Refinement
Runsheng Yu
Yong Wang
Xiaoqi Jiao
Youzhi Zhang
James T. Kwok
203
7
0
31 May 2024
Improving Reward Models with Synthetic Critiques
Improving Reward Models with Synthetic Critiques
Zihuiwen Ye
Fraser Greenlee-Scott
Max Bartolo
Phil Blunsom
Jon Ander Campos
Matthias Gallé
ALMSyDaLRM
268
36
0
31 May 2024
Large Language Models Can Self-Improve At Web Agent Tasks
Large Language Models Can Self-Improve At Web Agent Tasks
Ajay Patel
M. Hofmarcher
Claudiu Leoveanu-Condrei
Marius-Constantin Dinu
Chris Callison-Burch
Sepp Hochreiter
LLMAG
304
44
0
30 May 2024
Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in
  Code Generation
Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Jingchang Chen
Hongxuan Tang
Zheng Chu
Qianglong Chen
Zekun Wang
Ming Liu
Bing Qin
256
15
0
30 May 2024
Grade Like a Human: Rethinking Automated Assessment with Large Language
  Models
Grade Like a Human: Rethinking Automated Assessment with Large Language Models
Wenjing Xie
Juxin Niu
Chun Jason Xue
Nan Guan
AI4Ed
220
14
0
30 May 2024
Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization
Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization
Yuchi Liu
Jaskirat Singh
Gaowen Liu
Ali Payani
Liang Zheng
LLMAG
285
15
0
30 May 2024
Preference Learning Algorithms Do Not Learn Preference Rankings
Preference Learning Algorithms Do Not Learn Preference Rankings
Angelica Chen
Sadhika Malladi
Lily H. Zhang
Xinyi Chen
Qiuyi Zhang
Rajesh Ranganath
Kyunghyun Cho
316
44
0
29 May 2024
A Theoretical Understanding of Self-Correction through In-context
  Alignment
A Theoretical Understanding of Self-Correction through In-context Alignment
Yifei Wang
Yuyang Wu
Zeming Wei
Stefanie Jegelka
Yisen Wang
LRM
269
51
0
28 May 2024
A Human-Like Reasoning Framework for Multi-Phases Planning Task with
  Large Language Models
A Human-Like Reasoning Framework for Multi-Phases Planning Task with Large Language Models
Chengxing Xie
Difan Zou
LRMLLMAG
218
11
0
28 May 2024
TimeChara: Evaluating Point-in-Time Character Hallucination of
  Role-Playing Large Language Models
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models
Jaewoo Ahn
Taehyun Lee
Junyoung Lim
Jin-Hwa Kim
Sangdoo Yun
Hwaran Lee
Gunhee Kim
LLMAGHILM
249
19
0
28 May 2024
Self-Guiding Exploration for Combinatorial Problems
Self-Guiding Exploration for Combinatorial Problems
Zangir Iklassov
Yali Du
Farkhad Akimov
Martin Takáč
LRM
116
15
0
28 May 2024
MockLLM: A Multi-Agent Behavior Collaboration Framework for Online Job Seeking and Recruiting
MockLLM: A Multi-Agent Behavior Collaboration Framework for Online Job Seeking and Recruiting
Hongda Sun
Hongzhan Lin
Haiyu Yan
Chen Zhu
Yang Song
Xin Gao
210
8
0
28 May 2024
Position: Foundation Agents as the Paradigm Shift for Decision Making
Position: Foundation Agents as the Paradigm Shift for Decision Making
Xiaoqian Liu
Xingzhou Lou
Jianbin Jiao
Junge Zhang
OffRLLLMAG
400
9
0
27 May 2024
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Houxing Ren
Mingjie Zhan
Zhongyuan Wu
Aojun Zhou
Junting Pan
Jiaming Song
SyDa
410
12
0
27 May 2024
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff
Hao Tang
Keya Hu
Jin Peng Zhou
Sicheng Zhong
Wei-Long Zheng
Xujie Si
Kevin Ellis
197
43
0
26 May 2024
RLSF: Fine-tuning LLMs via Symbolic Feedback
RLSF: Fine-tuning LLMs via Symbolic Feedback
Piyush Jha
Prithwish Jana
Pranavkrishna Suresh
Arnav Arora
Vijay Ganesh
LRM
373
4
0
26 May 2024
Devil's Advocate: Anticipatory Reflection for LLM Agents
Devil's Advocate: Anticipatory Reflection for LLM Agents
Haoyu Wang
Tao Li
Zhiwei Deng
Dan Roth
Yang Li
LLMAG
520
9
0
25 May 2024
Evolutionary Large Language Model for Automated Feature Transformation
Evolutionary Large Language Model for Automated Feature Transformation
Nanxu Gong
Chandan K. Reddy
Wangyang Ying
Yanjie Fu
173
30
0
25 May 2024
Harnessing Large Language Models for Software Vulnerability Detection: A
  Comprehensive Benchmarking Study
Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study
Karl Tamberg
Hayretdin Bahsi
222
34
0
24 May 2024
Generating Code World Models with Large Language Models Guided by Monte
  Carlo Tree Search
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
Nicola Dainese
Matteo Merler
Minttu Alakuijala
Pekka Marttinen
LLMAG
263
20
0
24 May 2024
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial
  Framework Driven by Large Language Models
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yiming Chen
Chen Zhang
Danqing Luo
L. F. D’Haro
R. Tan
Haizhou Li
AAMLELM
224
3
0
23 May 2024
Reinforcing Language Agents via Policy Optimization with Action
  Decomposition
Reinforcing Language Agents via Policy Optimization with Action Decomposition
Muning Wen
Bo Liu
Weinan Zhang
Jun Wang
Ying Wen
247
12
0
23 May 2024
RaFe: Ranking Feedback Improves Query Rewriting for RAG
RaFe: Ranking Feedback Improves Query Rewriting for RAGConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Shengyu Mao
Yong Jiang
Boli Chen
Xiao Li
Peng Wang
Xinyu Wang
Pengjun Xie
Fei Huang
Huajun Chen
Ningyu Zhang
RALM
171
60
0
23 May 2024
ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based
  Evaluation
ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based EvaluationNeural Information Processing Systems (NeurIPS), 2024
Jingnan Zheng
Han Wang
An Zhang
Tai D. Nguyen
Jun Sun
Tat-Seng Chua
LLMAG
347
39
0
23 May 2024
Large Language Models Can Self-Correct with Minimal Effort
Large Language Models Can Self-Correct with Minimal EffortConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhenyu Wu
Qingkai Zeng
Zhihan Zhang
Zhaoxuan Tan
Chao Shen
Meng Jiang
KELMLRMReLM
276
3
0
23 May 2024
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous AgentsInternational Conference on Learning Representations (ICLR), 2024
Christopher Rawles
Sarah Clinckemaillie
Yifan Chang
Jonathan Waltz
Gabrielle Lau
...
Daniel Toyama
Robert Berry
Divya Tyamagundlu
Timothy Lillicrap
Oriana Riva
LLMAG
625
178
0
23 May 2024
Large Language Models Meet NLP: A Survey
Large Language Models Meet NLP: A Survey
Libo Qin
Qiguang Chen
Xiachong Feng
Yang Wu
Yongheng Zhang
Hai-Tao Zheng
Min Li
Wanxiang Che
Philip S. Yu
LRMALMLM&MAELM
455
119
0
21 May 2024
DOP: Diagnostic-Oriented Prompting for Large Language Models in
  Mathematical Correction
DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction
Hao Chen
Biaojie Zeng
Xin Lin
Liang He
Aimin Zhou
LRM
259
0
0
20 May 2024
The CAP Principle for LLM Serving: A Survey of Long-Context Large
  Language Model Serving
The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving
Pai Zeng
Zhenyu Ning
Jieru Zhao
Weihao Cui
Mengwei Xu
Liwei Guo
Xusheng Chen
Yizhou Shan
LLMAG
286
5
0
18 May 2024
Thinking Fair and Slow: On the Efficacy of Structured Prompts for
  Debiasing Language Models
Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Shaz Furniturewala
Surgan Jandial
Abhinav Java
Pragyan Banerjee
Simra Shahid
Sumita Bhatia
Kokil Jaidka
250
32
0
16 May 2024
Autonomous Workflow for Multimodal Fine-Grained Training Assistants
  Towards Mixed Reality
Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed RealityAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Jiahuan Pei
Irene Viola
Haochen Huang
Junxiao Wang
Moonisa Ahsan
...
Yao Sai
Di Wang
Zhumin Chen
Sudipta Singha Roy
Pablo César
LM&RoLLMAG
351
15
0
16 May 2024
METAREFLECTION: Learning Instructions for Language Agents using Past
  Reflections
METAREFLECTION: Learning Instructions for Language Agents using Past ReflectionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Priyanshu Gupta
Shashank Kirtania
Ananya Singha
Sumit Gulwani
Arjun Radhakrishna
Sherry Shi
Gustavo Soares
LLMAG
153
16
0
13 May 2024
MathDivide: Improved mathematical reasoning by large language models
MathDivide: Improved mathematical reasoning by large language models
S. Srivastava
Ashutosh Gandhi
LRMReLM
111
1
0
12 May 2024
AIOS Compiler: LLM as Interpreter for Natural Language Programming and
  Flow Programming of AI Agents
AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents
Shuyuan Xu
Zelong Li
Kai Mei
Zelong Li
190
10
0
11 May 2024
LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought
LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-ThoughtInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Zhuoxuan Jiang
Haoyuan Peng
Shanshan Feng
Fan Li
Dongsheng Li
KELMLRM
444
28
0
09 May 2024
MIDGARD: Self-Consistency Using Minimum Description Length for
  Structured Commonsense Reasoning
MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Inderjeet Nair
Lu Wang
LRM
194
2
0
08 May 2024
Large Language Models for Cyber Security: A Systematic Literature Review
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaidi Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
587
106
0
08 May 2024
Understanding the Capabilities and Limitations of Large Language Models
  for Cultural Commonsense
Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense
Siqi Shen
Lajanugen Logeswaran
Moontae Lee
Honglak Lee
Soujanya Poria
Amélie Reymond
AI4MHLRMELM
332
55
0
07 May 2024
Optimizing Language Model's Reasoning Abilities with Weak Supervision
Optimizing Language Model's Reasoning Abilities with Weak Supervision
Yongqi Tong
Sizhe Wang
Dawei Li
Yifan Wang
Simeng Han
Zi Lin
Chengsong Huang
Jiaxin Huang
Jingbo Shang
LRMReLM
243
13
0
07 May 2024
Fleet of Agents: Coordinated Problem Solving with Large Language Models
Fleet of Agents: Coordinated Problem Solving with Large Language Models
Akhil Arora
L. Klein
Nearchos Potamitis
Roland Aydin
Çağlar Gülçehre
Robert West
LLMAG
196
0
0
07 May 2024
Self-Improving Customer Review Response Generation Based on LLMs
Self-Improving Customer Review Response Generation Based on LLMs
Guy Azov
Tatiana Pelc
Adi Fledel Alon
Gila Kamhi
211
7
0
06 May 2024
Large Language Models Synergize with Automated Machine Learning
Large Language Models Synergize with Automated Machine Learning
Jinglue Xu
Jialong Li
Zhen Liu
Nagar Anthel Venkatesh Suryanarayanan
Guoyuan Zhou
Jia Guo
Hitoshi Iba
Kenji Tei
210
8
0
06 May 2024
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
Matthew Renze
Erhan Guven
LRMLLMAG
340
70
0
05 May 2024
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large
  Language Model
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language ModelEuropean Conference on Computer Vision (ECCV), 2024
Yulin Luo
Ruichuan An
Bocheng Zou
Yiming Tang
Jiaming Liu
Shanghang Zhang
319
43
0
03 May 2024
General Purpose Verification for Chain of Thought Prompting
General Purpose Verification for Chain of Thought Prompting
Robert Vacareanu
Anurag Pratik
Evangelia Spiliopoulou
Zheng Qi
Giovanni Paolini
Neha Ann John
Jie Ma
Yassine Benajiba
Miguel Ballesteros
LRM
183
16
0
30 Apr 2024
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
Parshin Shojaee
Kazem Meidani
Shashank Gupta
A. Farimani
Chandan K. Reddy
581
54
0
29 Apr 2024
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for
  Complex Problem Solving
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving
Pei Chen
Boran Han
Shuai Zhang
LRMLLMAG
207
20
0
26 Apr 2024
LLMs for Generating and Evaluating Counterfactuals: A Comprehensive
  Study
LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study
Van Bach Nguyen
Paul Youssef
Jorg Schlotterer
Christin Seifert
269
29
0
26 Apr 2024
Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Yunxiang Zhang
Muhammad Khalifa
Lajanugen Logeswaran
Jaekyeom Kim
Moontae Lee
Honglak Lee
Lu Wang
LRMKELMReLM
325
72
0
26 Apr 2024
Benchmarking Mobile Device Control Agents across Diverse Configurations
Benchmarking Mobile Device Control Agents across Diverse Configurations
Juyong Lee
Taywon Min
Minyong An
Dongyoon Hahm
Kimin Lee
Changyeon Kim
Kimin Lee
360
29
0
25 Apr 2024
Previous
123...232425...323334
Next