ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17651
  4. Cited By
Self-Refine: Iterative Refinement with Self-Feedback
v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
    ReLMLRMDiffM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,156 papers shown
Title
Large Language Models for Cyber Security: A Systematic Literature Review
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaidi Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
542
101
0
08 May 2024
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
Matthew Renze
Erhan Guven
LRMLLMAG
280
69
0
05 May 2024
General Purpose Verification for Chain of Thought Prompting
General Purpose Verification for Chain of Thought Prompting
Robert Vacareanu
Anurag Pratik
Evangelia Spiliopoulou
Zheng Qi
Giovanni Paolini
Neha Ann John
Jie Ma
Yassine Benajiba
Miguel Ballesteros
LRM
139
15
0
30 Apr 2024
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
Parshin Shojaee
Kazem Meidani
Shashank Gupta
A. Farimani
Chandan K. Reddy
470
52
0
29 Apr 2024
Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Yunxiang Zhang
Muhammad Khalifa
Lajanugen Logeswaran
Jaekyeom Kim
Moontae Lee
Honglak Lee
Lu Wang
LRMKELMReLM
279
71
0
26 Apr 2024
Benchmarking Mobile Device Control Agents across Diverse Configurations
Benchmarking Mobile Device Control Agents across Diverse Configurations
Juyong Lee
Taywon Min
Minyong An
Dongyoon Hahm
Kimin Lee
Changyeon Kim
Kimin Lee
292
29
0
25 Apr 2024
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia
Rui Wang
Xu Liu
Mingyan Li
Tong Yu
Xiang Chen
Julian McAuley
Shuai Li
LRM
553
46
0
24 Apr 2024
NExT: Teaching Large Language Models to Reason about Code Execution
NExT: Teaching Large Language Models to Reason about Code Execution
Ansong Ni
Miltiadis Allamanis
Arman Cohan
Yinlin Deng
Kensen Shi
Charles Sutton
Pengcheng Yin
ReLMLRM
239
60
0
23 Apr 2024
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Qihuang Zhong
Kang Wang
Ziyang Xu
Juhua Liu
Liang Ding
Bo Du
LRMAIMat
449
6
0
23 Apr 2024
iTBLS: A Dataset of Interactive Conversations Over Tabular Information
iTBLS: A Dataset of Interactive Conversations Over Tabular Information
Anirudh S. Sundar
Christopher Richardson
William Gay
Larry Heck
LMTD
317
3
0
19 Apr 2024
Toward Self-Improvement of LLMs via Imagination, Searching, and
  Criticizing
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Ye Tian
Baolin Peng
Linfeng Song
Lifeng Jin
Dian Yu
Haitao Mi
Dong Yu
LRMReLM
219
123
0
18 Apr 2024
Enhancing Q&A with Domain-Specific Fine-Tuning and Iterative Reasoning:
  A Comparative Study
Enhancing Q&A with Domain-Specific Fine-Tuning and Iterative Reasoning: A Comparative Study
Zooey Nguyen
Anthony Annunziata
Vinh Luong
Sang Dinh
Quynh Le
Anh Hai Ha
Chanh Le
Hong An Phan
Shruti Raghavan
Christopher Nguyen
LRM
141
7
0
17 Apr 2024
Distilling Reasoning Ability from Large Language Models with Adaptive
  Thinking
Distilling Reasoning Ability from Large Language Models with Adaptive Thinking
Xiao Chen
Sihang Zhou
K. Liang
Xinwang Liu
ReLMLRM
290
12
0
14 Apr 2024
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in
  Large Language Models
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models
Yanhong Li
Chenghao Yang
Allyson Ettinger
ReLMLRMLLMAG
141
15
0
14 Apr 2024
Confidence Calibration and Rationalization for LLMs via Multi-Agent
  Deliberation
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Ruixin Yang
Dheeraj Rajagopal
S. Hayati
Bin Hu
Luan Tuyen Chau
LLMAG
456
14
0
14 Apr 2024
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path
  Forward
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward
Xuan Xie
Yuheng Huang
Zhehua Zhou
Yuheng Huang
Da Song
Lei Ma
OffRL
353
12
0
12 Apr 2024
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Jinheon Baek
S. Jauhar
Silviu Cucerzan
Sung Ju Hwang
AI4CELLMAGLM&Ro
322
101
0
11 Apr 2024
Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations
Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations
Dayeon Ki
Marine Carpuat
213
32
0
11 Apr 2024
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from
  Interleaved Multimodal Inputs
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen3DV
368
6
0
05 Apr 2024
Personalized LLM Response Generation with Parameterized Memory Injection
Personalized LLM Response Generation with Parameterized Memory Injection
Kai Zhang
Lizhi Qing
Yangyang Kang
305
20
0
04 Apr 2024
A Survey on Large Language Model-Based Game Agents
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
AI4CELLMAGLM&RoLM&MA
640
106
0
02 Apr 2024
Large Language Models are Capable of Offering Cognitive Reappraisal, if
  Guided
Large Language Models are Capable of Offering Cognitive Reappraisal, if Guided
Hongli Zhan
Allen Zheng
Yoon Kyung Lee
Jina Suh
Junyi Jessy Li
Desmond C. Ong
AI4MH
213
18
0
01 Apr 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept,
  Taxonomy, and Methods
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAGKELMOffRLLM&Ro
352
146
0
30 Mar 2024
Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to
  Boost for Reasoning
Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning
Yongqi Tong
Dawei Li
Sizhe Wang
Yujia Wang
Fei Teng
Jingbo Shang
LRM
352
79
0
29 Mar 2024
Enhancing the General Agent Capabilities of Low-Parameter LLMs through
  Tuning and Multi-Branch Reasoning
Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Yuwen Tan
Zihan Zhang
Xiang Xiang
Ke Wang
Yuchuan Wu
Yongbin Li
LLMAGLRM
143
8
0
29 Mar 2024
MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended
  Text Evaluation
MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text Evaluation
Yu Li
Shenyu Zhang
Rui Wu
Xiutian Huang
Yongrui Chen
Wenhao Xu
Guilin Qi
Dehai Min
LLMAG
145
16
0
28 Mar 2024
Re2LLM: Reflective Reinforcement Large Language Model for Session-based
  Recommendation
Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation
Ziyan Wang
Yingpeng Du
Zhu Sun
Haoyan Chua
Kaidong Feng
Wenya Wang
Jie Zhang
LRMKELM
202
8
0
25 Mar 2024
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding
Ahmad A Mahmood
Ashmal Vayani
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
LRM
394
11
0
21 Mar 2024
Facilitating Pornographic Text Detection for Open-Domain Dialogue
  Systems via Knowledge Distillation of Large Language Models
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models
Huachuan Qiu
Shuai Zhang
Hongliang He
Anqi Li
Zhenzhong Lan
214
2
0
20 Mar 2024
Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open
  Domain Multi-Hop Question Answering
Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question AnsweringInternational Conference on Language Resources and Evaluation (LREC), 2024
Yuan Gao
Yiheng Zhu
Yuanbin Cao
Yinzhi Zhou
Zhen Wu
Yujie Chen
Shenglan Wu
Haoyuan Hu
Xinyu Dai
LRM
179
5
0
19 Mar 2024
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices
Sara Abdali
Richard Anarfi
C. Barberan
Jia He
Erfan Shayegani
PILM
385
46
0
19 Mar 2024
SMART: Submodular Data Mixture Strategy for Instruction Tuning
SMART: Submodular Data Mixture Strategy for Instruction TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Kowndinya Renduchintala
S. Bhatia
Ganesh Ramakrishnan
190
11
0
13 Mar 2024
Large Language Models are Contrastive Reasoners
Large Language Models are Contrastive Reasoners
Liang Yao
ReLMELMLRM
298
8
0
13 Mar 2024
LiveCodeBench: Holistic and Contamination Free Evaluation of Large
  Language Models for Code
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for CodeInternational Conference on Learning Representations (ICLR), 2024
Naman Jain
King Han
Alex Gu
Wen-Ding Li
Fanjia Yan
Tianjun Zhang
Sida I. Wang
Armando Solar-Lezama
Koushik Sen
Ion Stoica
ELM
421
894
0
12 Mar 2024
The pitfalls of next-token prediction
The pitfalls of next-token predictionInternational Conference on Machine Learning (ICML), 2024
Gregor Bachmann
Vaishnavh Nagarajan
409
130
0
11 Mar 2024
Exploring LLM-based Agents for Root Cause Analysis
Exploring LLM-based Agents for Root Cause Analysis
Devjeet Roy
Xuchao Zhang
Rashi Bhave
Chetan Bansal
P. Las-Casas
Rodrigo Fonseca
Saravan Rajmohan
208
66
0
07 Mar 2024
Evaluating and Optimizing Educational Content with Large Language Model
  Judgments
Evaluating and Optimizing Educational Content with Large Language Model Judgments
Joy He-Yueya
Noah D. Goodman
Emma Brunskill
AI4Ed
225
12
0
05 Mar 2024
Socratic Reasoning Improves Positive Text Rewriting
Socratic Reasoning Improves Positive Text Rewriting
Anmol Goel
Nico Daheim
Iryna Gurevych
Iryna Gurevych
LRM
303
6
0
05 Mar 2024
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of
  LLMs as Mathematical Problem Solvers
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Qintong Li
Leyang Cui
Xueliang Zhao
Lingpeng Kong
Wei Bi
LRM
303
104
0
29 Feb 2024
ToolNet: Connecting Large Language Models with Massive Tools via Tool
  Graph
ToolNet: Connecting Large Language Models with Massive Tools via Tool Graph
Xukun Liu
Zhiyuan Peng
Xiaoyuan Yi
Xing Xie
Lirong Xiang
Yuchen Liu
Dongkuan Xu
CLLLLMAG
151
42
0
29 Feb 2024
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems
  in Commonsense Reasoning
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Daojian Zeng
Kang Liu
Jun Zhao
LRM
246
17
0
28 Feb 2024
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the
  Key?
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key?
Qineng Wang
Zihao Wang
Ying Su
Hanghang Tong
Yangqiu Song
LLMAGLRM
322
131
0
28 Feb 2024
MEGAnno+: A Human-LLM Collaborative Annotation System
MEGAnno+: A Human-LLM Collaborative Annotation System
H. Kim
Kushan Mitra
Rafael Li Chen
Sajjadur Rahman
Dan Zhang
269
49
0
28 Feb 2024
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical
  Reasoning
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning
Debrup Das
Debopriyo Banerjee
Somak Aditya
Ashish Kulkarni
ReLMLRM
258
33
0
27 Feb 2024
Fine-Grained Self-Endorsement Improves Factuality and Reasoning
Fine-Grained Self-Endorsement Improves Factuality and Reasoning
Ante Wang
Linfeng Song
Baolin Peng
Ye Tian
Lifeng Jin
Haitao Mi
Jinsong Su
Dong Yu
HILMLRM
128
9
0
23 Feb 2024
Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich
  Reasoning
Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
Hanqi Yan
Qinglin Zhu
Xinyu Wang
Lin Gui
Yulan He
LRMLLMAG
154
11
0
22 Feb 2024
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Zicheng Lin
Zhibin Gou
Tian Liang
Ruilin Luo
Haowei Liu
Yujiu Yang
LRM
370
78
0
22 Feb 2024
COPR: Continual Human Preference Learning via Optimal Policy
  Regularization
COPR: Continual Human Preference Learning via Optimal Policy Regularization
Han Zhang
Lin Gui
Yu Lei
Yuanzhao Zhai
Yehong Zhang
...
Hui Wang
Yue Yu
Kam-Fai Wong
Bin Liang
Ruifeng Xu
CLL
204
6
0
22 Feb 2024
Making Reasoning Matter: Measuring and Improving Faithfulness of
  Chain-of-Thought Reasoning
Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning
Debjit Paul
Robert West
Antoine Bosselut
Boi Faltings
ReLMLRM
354
75
0
21 Feb 2024
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
Zhaorui Yang
Tianyu Pang
Hao Feng
Han Wang
Wei Chen
Minfeng Zhu
Qian Liu
ALM
266
76
0
21 Feb 2024
Previous
123...192021222324
Next