ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17651
  4. Cited By
Self-Refine: Iterative Refinement with Self-Feedback
v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
    ReLMLRMDiffM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,563 papers shown
Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL
Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL
Aleesha Khurram
Amir Moeini
Shangtong Zhang
Rohan Chandra
76
0
0
16 Nov 2025
Genomic Next-Token Predictors are In-Context Learners
Genomic Next-Token Predictors are In-Context Learners
Nathan Breslow
Aayush Mishra
Mahler Revsine
Michael C. Schatz
Anqi Liu
Daniel Khashabi
219
0
0
16 Nov 2025
Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
Raavi Gupta
Pranav Hari Panicker
S. Bhatia
Ganesh Ramakrishnan
HILM
136
2
0
15 Nov 2025
LOCA-R: Near-Perfect Performance on the Chinese Physics Olympiad 2025
LOCA-R: Near-Perfect Performance on the Chinese Physics Olympiad 2025
Dong-Shan Jian
Xiang Li
Chen-Xu Yan
Hui-Wen Zheng
Zhi-Zhang Bian
...
Bing-Rui Gong
Ren-Xi He
Jing Zhang
Ce Meng
Yan Ma
LRMELM
257
0
0
13 Nov 2025
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Yunzhe Xu
Zhuosheng Zhang
Zhe Liu
169
0
0
13 Nov 2025
MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal CritiqueConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Gailun Zeng
Ziyang Luo
Hongzhan Lin
Yuchen Tian
Kaixin Li
Ziyang Gong
Jianxiong Guo
Jing Ma
121
1
0
12 Nov 2025
Chain of Summaries: Summarization Through Iterative Questioning
William Brach
Lukas Galke Poech
HILM
220
0
0
12 Nov 2025
Feedback Descent: Open-Ended Text Optimization via Pairwise Comparison
Feedback Descent: Open-Ended Text Optimization via Pairwise Comparison
Yoonho Lee
Joseph Boen
Chelsea Finn
163
1
0
11 Nov 2025
Investigating CoT Monitorability in Large Reasoning Models
Investigating CoT Monitorability in Large Reasoning Models
Shu Yang
Junchao Wu
Xilin Gou
X. Wu
Yang Li
Ninhao Liu
Di Wang
LRM
204
0
0
11 Nov 2025
Bot Meets Shortcut: How Can LLMs Aid in Handling Unknown Invariance OOD Scenarios?
Bot Meets Shortcut: How Can LLMs Aid in Handling Unknown Invariance OOD Scenarios?
Shiyan Zheng
Herun Wan
Minnan Luo
Junhang Huang
AAML
426
0
0
11 Nov 2025
General Intelligence-based Fragmentation (GIF): A framework for peak-labeled spectra simulation
General Intelligence-based Fragmentation (GIF): A framework for peak-labeled spectra simulation
Margaret R. Martin
Soha Hassoun
72
0
0
11 Nov 2025
Dual-Process Scaffold Reasoning for Enhancing LLM Code Debugging
Dual-Process Scaffold Reasoning for Enhancing LLM Code Debugging
Po-Chung Hsieh
Chin-Po Chen
Jeng-Lin Li
Ming-Ching Chang
LRM
108
0
0
11 Nov 2025
Adaptive Multi-Agent Response Refinement in Conversational Systems
Adaptive Multi-Agent Response Refinement in Conversational Systems
Soyeong Jeong
Aparna Elangovan
Emine Yilmaz
Oleg Rokhlenko
LLMAG
127
1
0
11 Nov 2025
Beyond Detection: Exploring Evidence-based Multi-Agent Debate for Misinformation Intervention and Persuasion
Beyond Detection: Exploring Evidence-based Multi-Agent Debate for Misinformation Intervention and Persuasion
Chen Han
Yijia Ma
Jin Tan
Wenzhen Zheng
Xijin Tang
200
0
0
10 Nov 2025
Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation
Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation
Keunhyeung Park
Seunguk Yu
Youngbin Kim
114
0
0
10 Nov 2025
S-DAG: A Subject-Based Directed Acyclic Graph for Multi-Agent Heterogeneous Reasoning
S-DAG: A Subject-Based Directed Acyclic Graph for Multi-Agent Heterogeneous ReasoningMachine-mediated learning (ML), 2025
Jiangwen Dong
Zehui Lin
Wanyu Lin
Mingjin Zhang
LLMAGLRM
158
0
0
10 Nov 2025
FLEX: Continuous Agent Evolution via Forward Learning from Experience
FLEX: Continuous Agent Evolution via Forward Learning from Experience
Zhicheng Cai
Xinyuan Guo
Yu Pei
Jiangtao Feng
Jiangjie Chen
Ya Zhang
Wei-Ying Ma
Mingxuan Wang
Hao Zhou
Hao Zhou
CLLLLMAGLRM
278
3
0
09 Nov 2025
ScRPO: From Errors to Insights
ScRPO: From Errors to Insights
Lianrui Li
Dakuan Lu
Jiawei Shao
Chi Zhang
LRM
155
0
0
08 Nov 2025
Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMs
Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS Annals), 2025
Wei Yang
Jiacheng Pang
Shixuan Li
P. Bogdan
Stephen Tu
Jesse Thomason
LLMAG
396
1
0
08 Nov 2025
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Hiroaki Hayashi
Bo Pang
Wenting Zhao
Ye Liu
Akash Gokul
Srijan Bansal
Caiming Xiong
Semih Yavuz
Yingbo Zhou
LLMAGLM&RoLRM
304
0
0
08 Nov 2025
Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models
Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models
Teqi Hao
Xioayu Tan
Shaojie Shi
Yinghui Xu
Xihe Qiu
208
0
0
07 Nov 2025
Monitor-Generate-Verify (MGV): Formalising Metacognitive Theory for Language Model Reasoning
Monitor-Generate-Verify (MGV): Formalising Metacognitive Theory for Language Model Reasoning
Nick Oh
Fernand Gobet
LRM
184
1
0
06 Nov 2025
Plan of Knowledge: Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering
Plan of Knowledge: Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering
Xinying Qian
Ying Zhang
Yu Zhao
Baohang Zhou
Xuhui Sui
Xiaojie Yuan
RALM
272
0
0
06 Nov 2025
Secure Code Generation at Scale with Reflexion
Secure Code Generation at Scale with Reflexion
Arup Datta
Ahmed Aljohani
Hyunsook Do
ELM
121
0
0
05 Nov 2025
The Sequential Edge: Inverse-Entropy Voting Beats Parallel Self-Consistency at Matched Compute
The Sequential Edge: Inverse-Entropy Voting Beats Parallel Self-Consistency at Matched Compute
Aman Sharma
Paras Chopra
BDLLRM
222
0
0
04 Nov 2025
ReAcTree: Hierarchical LLM Agent Trees with Control Flow for Long-Horizon Task Planning
ReAcTree: Hierarchical LLM Agent Trees with Control Flow for Long-Horizon Task Planning
Jae-Woo Choi
Hyungmin Kim
Hyobin Ong
Minsu Jang
Dohyung Kim
Jaehong Kim
Youngwoo Yoon
160
0
0
04 Nov 2025
The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models
The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models
Claudia Herambourg
Dawid Siuda
Julia Kopczyńska
Joao R. L. Santos
Wojciech Sas
Joanna Śmietańska-Nowak
ELMALMLRM
397
0
0
04 Nov 2025
Analyzing the Power of Chain of Thought through Memorization Capabilities
Analyzing the Power of Chain of Thought through Memorization Capabilities
Lijia Yu
Xiao-Shan Gao
Lijun Zhang
LRMELM
208
0
0
03 Nov 2025
Context-Guided Decompilation: A Step Towards Re-executability
Context-Guided Decompilation: A Step Towards Re-executability
Xiaohan Wang
Yuxin Hu
Kevin Leach
103
0
0
03 Nov 2025
Knowledge Elicitation with Large Language Models for Interpretable Cancer Stage Identification from Pathology Reports
Knowledge Elicitation with Large Language Models for Interpretable Cancer Stage Identification from Pathology Reports
Yeawon Lee
Christopher C. Yang
Chia-Hsuan Chang
Grace Lu-Yao
55
0
0
02 Nov 2025
How Focused Are LLMs? A Quantitative Study via Repetitive Deterministic Prediction Tasks
How Focused Are LLMs? A Quantitative Study via Repetitive Deterministic Prediction Tasks
W. Hou
Leon Zhou
Hong-Ye Hu
Yi-Zhuang You
Yi-Zhuang You
Xiao-Liang Qi
LRM
367
0
0
02 Nov 2025
Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation
Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation
Song Wang
Zihan Chen
Peng Wang
Zhepei Wei
Zhen Tan
Yu Meng
Cong Shen
Jundong Li
169
1
0
01 Nov 2025
Diverse Human Value Alignment for Large Language Models via Ethical Reasoning
Diverse Human Value Alignment for Large Language Models via Ethical Reasoning
Jiahao Wang
Songkai Xue
Jinghui Li
X. Wang
112
0
0
01 Nov 2025
Test-time Scaling of LLMs: A Survey from A Subproblem Structure Perspective
Test-time Scaling of LLMs: A Survey from A Subproblem Structure Perspective
Zhuoyi Yang
Xu Guo
Tong Zhang
Huijuan Xu
Boyang Albert Li
LRM
149
0
0
01 Nov 2025
Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base
Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base
Yu Li
Yuan Huang
Tao Wang
Caiyu Fan
X-D Cai
...
X. Li
Weinan E
Linfeng Zhang
Zhiyuan Yao
Kun Chen
LRM
209
1
0
30 Oct 2025
CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions
CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions
Lingyue Fu
Xin Ding
Yaoming Zhu
Shao Zhang
Lin Qiu
...
W. Zhang
Xuezhi Cao
Xunliang Cai
Jiaxin Ding
Yong Yu
LLMAGELM
203
0
0
30 Oct 2025
QCoder Benchmark: Bridging Language Generation and Quantum Hardware through Simulator-Based Feedback
QCoder Benchmark: Bridging Language Generation and Quantum Hardware through Simulator-Based Feedback
Taku Mikuriya
Tatsuya Ishigaki
Masayuki Kawarada
Shunya Minami
Tadashi Kadowaki
...
Shunya Takata
Takumi Kato
Tamotsu Basseda
Reo Yamada
Hiroya Takamura
ALMELM
254
1
0
30 Oct 2025
RCScore: Quantifying Response Consistency in Large Language Models
RCScore: Quantifying Response Consistency in Large Language Models
Dongjun Jang
Youngchae Ahn
Hyopil Shin
132
0
0
30 Oct 2025
InfoFlow: Reinforcing Search Agent Via Reward Density Optimization
InfoFlow: Reinforcing Search Agent Via Reward Density Optimization
Kun Luo
Hongjin Qian
Zheng Liu
Ziyi Xia
Shitao Xiao
Siqi Bao
Jun Zhao
Kang Liu
113
0
0
30 Oct 2025
LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation
LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation
Xiangqing Zheng
Chengyue Wu
Kehai Chen
Min Zhang
DiffMVGen
205
0
0
30 Oct 2025
RECAP: Reproducing Copyrighted Data from LLMs Training with an Agentic Pipeline
RECAP: Reproducing Copyrighted Data from LLMs Training with an Agentic Pipeline
André V. Duarte
Xuying Li
Bin Zeng
Arlindo L. Oliveira
Lei Li
Zhuo Li
127
0
0
29 Oct 2025
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data
Kun ouyang
Haoyu Wang
Dong Fang
LLMAGAI4CE
187
0
0
29 Oct 2025
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
Fali Wang
Jihai Chen
Shuhua Yang
Runxue Bao
Tianxiang Zhao
Zhiwei Zhang
Xianfeng Tang
Hui Liu
Qi He
Suhang Wang
116
0
0
29 Oct 2025
A Survey on Efficient Large Language Model Training: From Data-centric Perspectives
A Survey on Efficient Large Language Model Training: From Data-centric PerspectivesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Junyu Luo
Bohan Wu
Xiao Luo
Zhiping Xiao
Yiqiao Jin
...
Nan Yin
Yifan Wang
Jingyang Yuan
Wei Ju
Ming Zhang
142
4
0
29 Oct 2025
MGA: Memory-Driven GUI Agent for Observation-Centric Interaction
MGA: Memory-Driven GUI Agent for Observation-Centric Interaction
Weihua Cheng
Ersheng Ni
Wenlong Wang
Yifei Sun
Junming Liu
Wangyu Shen
Yirong Chen
Ding Wang
Botian Shi
LLMAGLM&Ro
281
0
0
28 Oct 2025
StorageXTuner: An LLM Agent-Driven Automatic Tuning Framework for Heterogeneous Storage Systems
StorageXTuner: An LLM Agent-Driven Automatic Tuning Framework for Heterogeneous Storage Systems
Qi Lin
Zhenyu Zhang
Viraj Thakkar
Zhenjie Sun
Mai Zheng
Zhichao Cao
69
1
0
28 Oct 2025
FT-ARM: Fine-Tuned Agentic Reflection Multimodal Language Model for Pressure Ulcer Severity Classification with Reasoning
FT-ARM: Fine-Tuned Agentic Reflection Multimodal Language Model for Pressure Ulcer Severity Classification with Reasoning
Reza Saadati Fard
Emmanuel O. Agu
Palawat Busaranuvong
Deepak Kumar
Shefalika Gautam
B. Tulu
Diane Strong
Lorraine Loretz
93
0
0
28 Oct 2025
VDSAgents: A PCS-Guided Multi-Agent System for Veridical Data Science Automation
VDSAgents: A PCS-Guided Multi-Agent System for Veridical Data Science Automation
Yunxuan Jiang
Silan Hu
Xiaoning Wang
Yuanyuan Zhang
Xiangyu Chang
128
0
0
28 Oct 2025
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
Zhiheng Xi
Jixuan Huang
Xin Guo
Boyang Hong
Dingwen Yang
...
Jiecao Chen
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRLLRM
170
0
0
28 Oct 2025
Aligning Large Language Models with Procedural Rules: An Autoregressive State-Tracking Prompting for In-Game Trading
Aligning Large Language Models with Procedural Rules: An Autoregressive State-Tracking Prompting for In-Game Trading
Minkyung Kim
J. Kim
Woongcheol Yang
Sangdon Park
Sohee Bae
89
0
0
28 Oct 2025
Previous
12345...303132
Next