ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17651
  4. Cited By
Self-Refine: Iterative Refinement with Self-Feedback
v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
    ReLMLRMDiffM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,172 papers shown
Title
Learning to Orchestrate Agents in Natural Language with the Conductor
Learning to Orchestrate Agents in Natural Language with the Conductor
Stefan Nielsen
Edoardo Cetin
Peter Schwendeman
Qi Sun
Jinglue Xu
Yujin Tang
LLMAG
56
0
0
04 Dec 2025
On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
Yue Yu
Qiwei Di
Quanquan Gu
Dongruo Zhou
BDL
141
0
0
04 Dec 2025
Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models
Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models
NaHyeon Park
Namin An
Kunhee Kim
Soyeon Yoon
Jiahao Huo
Hyunjung Shim
VLM
56
0
0
04 Dec 2025
Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks
Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks
Gianni Molinari
Fabio Ciravegna
32
0
0
03 Dec 2025
PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks
PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks
Yuki Orimo
Iori Kurata
Hodaka Mori
Ryuhei Okuno
Ryohto Sawada
Daisuke Okanohara
120
1
0
03 Dec 2025
Balancing Safety and Helpfulness in Healthcare AI Assistants through Iterative Preference Alignment
Balancing Safety and Helpfulness in Healthcare AI Assistants through Iterative Preference Alignment
Huy Nghiem
Swetasudha Panda
Devashish Khatwani
Huy Nguyen
Krishnaram Kenthapadi
Hal Daumé III
LM&MA
76
0
0
03 Dec 2025
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning
Salman Rahman
Sruthi Gorantla
Arpit Gupta
Swastik Roy
Nanyun Peng
Yang Liu
OffRLLRM
134
0
0
02 Dec 2025
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
A. Cherian
River Doyle
Eyal Ben-Dov
Suhas Lohit
Kuan-Chuan Peng
LLMAGMoE
108
0
0
02 Dec 2025
LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems
LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems
Yuanhe Zhang
Weiliu Wang
Zhenhong Zhou
Kun Wang
Jie Zhang
Li Sun
Yang Liu
Sen Su
116
1
0
02 Dec 2025
When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers
When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers
Jack Lu
Ryan Teehan
Jinran Jin
Mengye Ren
LRM
136
0
0
02 Dec 2025
Enhancing Automated Paper Reproduction via Prompt-Free Collaborative Agents
Enhancing Automated Paper Reproduction via Prompt-Free Collaborative Agents
Zijie Lin
Qilin Cai
Liang Shen
Mingjun Xiao
48
0
0
02 Dec 2025
COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis
COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis
Tsz-To Wong
Ching-Chun Huang
Hong-Han Shuai
AI4TS
328
0
0
01 Dec 2025
DrawingBench: Evaluating Spatial Reasoning and UI Interaction Capabilities of Large Language Models through Mouse-Based Drawing Tasks
DrawingBench: Evaluating Spatial Reasoning and UI Interaction Capabilities of Large Language Models through Mouse-Based Drawing Tasks
Hyunjun Kim
Sooyoung Ryu
56
0
0
01 Dec 2025
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning
Yang Li
Z. He
Y. Huang
Zhuhanling Xiao
Chao Yu
Meng Fang
Kun Shao
Jun Wang
LRMVLM
137
0
0
28 Nov 2025
Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities
Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities
Aayush Garg
Zanis Ali Khan
Renzo Degiovanni
Qiang Tang
AAML
116
0
0
28 Nov 2025
Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent
Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent
Jianzhe Lin
Zeyu Pan
Yun Zhu
Ruiqi Song
Jining Yang
LRM
96
0
0
28 Nov 2025
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Yujiao Yang
Jing Lian
Linhui Li
LRM
145
0
0
28 Nov 2025
ThetaEvolve: Test-time Learning on Open Problems
ThetaEvolve: Test-time Learning on Open Problems
Y. Wang
Shao-Rong Su
Zhiyuan Zeng
Eva Xu
Liliang Ren
...
Pengcheng He
Weizhu Chen
Shuohang Wang
S. Du
Yelong Shen
180
0
0
28 Nov 2025
Real-Time Procedural Learning From Experience for AI Agents
Real-Time Procedural Learning From Experience for AI Agents
Dasheng Bi
Yubin Hu
Mohammed N. Nasir
44
0
0
27 Nov 2025
DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA
DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA
Ahmad Mohammadshirazi
Pinaki Prasad Guha Neogi
Dheeraj Kulshrestha
R. Ramnath
VGen
104
0
0
27 Nov 2025
TTSnap: Test-Time Scaling of Diffusion Models via Noise-Aware Pruning
TTSnap: Test-Time Scaling of Diffusion Models via Noise-Aware Pruning
Qingtao Yu
Changlin Song
Minghao Sun
Zhengyang Yu
Vinay Kumar Verma
Soumya Roy
Sumit Negi
Hongdong Li
Dylan Campbell
76
0
0
27 Nov 2025
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
Young-Jun Lee
Seungone Kim
Byung-Kwan Lee
Minkyeong Moon
Yechan Hwang
Jong Myoung Kim
Graham Neubig
Sean Welleck
Ho-Jin Choi
ReLMLRM
176
2
0
27 Nov 2025
Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information
Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information
Lukas Struppek
Dominik Hintersdorf
Hannah Struppek
Daniel Neider
Kristian Kersting
LRM
84
0
0
27 Nov 2025
On the Limits of Innate Planning in Large Language Models
On the Limits of Innate Planning in Large Language Models
Charles Schepanowski
Charles Ling
LLMAGLRMELM
421
0
0
26 Nov 2025
BAMAS: Structuring Budget-Aware Multi-Agent Systems
BAMAS: Structuring Budget-Aware Multi-Agent Systems
Liming Yang
Junyu Luo
Xuanzhe Liu
Yiling Lou
Zhenpeng Chen
LLMAG
303
0
0
26 Nov 2025
EWE: An Agentic Framework for Extreme Weather Analysis
EWE: An Agentic Framework for Extreme Weather Analysis
Zhe Jiang
Jiong Wang
Xiaoyu Yue
Zijie Guo
Wenlong Zhang
Fenghua Ling
Wanli Ouyang
L. Bai
136
1
0
26 Nov 2025
A Unified Evaluation-Instructed Framework for Query-Dependent Prompt Optimization
A Unified Evaluation-Instructed Framework for Query-Dependent Prompt Optimization
Ke Chen
Yifeng Wang
Hassan Almosapeeh
Haohan Wang
148
0
0
25 Nov 2025
More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering
More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering
Duc Anh Vu
T. Nguyen
Cong-Duy Nguyen
Viet-Anh Nguyen
Anh Tuan Luu
FaMLLRM
318
0
0
25 Nov 2025
Nonparametric Instrumental Variable Regression with Observed Covariates
Nonparametric Instrumental Variable Regression with Observed Covariates
Zikai Shen
Zonghao Chen
Dimitri Meunier
Ingo Steinwart
Arthur Gretton
Zhu Li
76
0
0
24 Nov 2025
Majority of the Bests: Improving Best-of-N via Bootstrapping
Majority of the Bests: Improving Best-of-N via Bootstrapping
Amin Rakhsha
Kanika Madan
Tianyu Zhang
Amir-massoud Farahmand
Amir Khasahmadi
124
0
0
23 Nov 2025
$A^2Flow:$ Automating Agentic Workflow Generation via Self-Adaptive Abstraction Operators
A2Flow:A^2Flow:A2Flow: Automating Agentic Workflow Generation via Self-Adaptive Abstraction Operators
Mingming Zhao
Xiaokang Wei
Yuanqi Shao
Kaiwen Zhou
Lin Yang
Siwei Rao
Junhui Zhan
Zhitang Chen
82
0
0
23 Nov 2025
SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization
SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization
Jianghao Wu
Yasmeen George
Jin Ye
Y. Wu
Daniel F. Schmidt
Jianfei Cai
LRM
76
0
0
22 Nov 2025
Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures
Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures
Yunsheng Bai
Haoxing Ren
92
0
0
21 Nov 2025
Budget-Aware Tool-Use Enables Effective Agent Scaling
Budget-Aware Tool-Use Enables Effective Agent Scaling
Tengxiao Liu
Zifeng Wang
Jin Miao
I-Hung Hsu
Jun Yan
...
Samira Daruki
Yi Liang
William Y. Wang
Tomas Pfister
Chen-Yu Lee
216
0
0
21 Nov 2025
MultiGA: Leveraging Multi-Source Seeding in Genetic Algorithms
MultiGA: Leveraging Multi-Source Seeding in Genetic Algorithms
Isabelle Diana May-Xin Ng
Tharindu Cyril Weerasooriya
Haitao Zhu
Wei Wei
0
0
0
21 Nov 2025
PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization
Huseein Jawad
Nicolas Brunel
AAML
140
0
0
20 Nov 2025
InfCode: Adversarial Iterative Refinement of Tests and Patches for Reliable Software Issue Resolution
Kefan Li
Mengfei Wang
Hengzhi Zhang
Zhichao Li
Yuan Yuan
Mu Li
X. Gao
Hailong Sun
Chunming Hu
Weifeng Lv
120
0
0
20 Nov 2025
SDA: Steering-Driven Distribution Alignment for Open LLMs without Fine-Tuning
Wei Xia
Zhi-Hong Deng
ALM
249
0
0
20 Nov 2025
AutoBackdoor: Automating Backdoor Attacks via LLM Agents
AutoBackdoor: Automating Backdoor Attacks via LLM Agents
Y. Li
Z. Li
Wei Zhao
Nay Myat Min
Hanxun Huang
Xingjun Ma
Jun Sun
AAMLLLMAGSILM
358
0
0
20 Nov 2025
From Solving to Verifying: A Unified Objective for Robust Reasoning in LLMs
From Solving to Verifying: A Unified Objective for Robust Reasoning in LLMs
Xiaoxuan Wang
Bo Liu
Song Jiang
Jingzhou Liu
Jingyuan Qi
Xia Chen
Baosheng He
LRM
164
0
0
19 Nov 2025
Reflexive Evidence-Based Multimodal Learning for Clean Energy Transitions: Causal Insights on Cooking Fuel Access, Urbanization, and Carbon Emissions
Reflexive Evidence-Based Multimodal Learning for Clean Energy Transitions: Causal Insights on Cooking Fuel Access, Urbanization, and Carbon Emissions
Shan Shan
76
0
0
19 Nov 2025
Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn
Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn
Chao Yu
Qixin Tan
Jiaxuan Gao
Shi Yu
Hong Lu
Xinting Yang
Zelai Xu
Yu Wang
Yi Wu
Eugene Vinitsky
LRM
124
0
0
18 Nov 2025
SVBRD-LLM: Self-Verifying Behavioral Rule Discovery for Autonomous Vehicle Identification
SVBRD-LLM: Self-Verifying Behavioral Rule Discovery for Autonomous Vehicle Identification
Xiangyu Li
Zhaomiao Guo
123
0
0
18 Nov 2025
Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Sadegh Mahdavi
Branislav Kisacanin
Shubham Toshniwal
Wei Du
Ivan Moshkov
George Armstrong
Renjie Liao
Christos Thrampoulidis
Igor Gitman
ALMLRM
261
2
0
17 Nov 2025
CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving
CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving
Enhui Ma
Lijun Zhou
Tao Tang
Jiahuan Zhang
Junpeng Jiang
...
Xianpeng Lang
Haiyang Sun
Xia Zhou
Di Lin
Kaicheng Yu
225
0
0
17 Nov 2025
From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models
From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models
Wenxin Zhu
Andong Chen
Yuchen Song
Kehai Chen
Conghui Zhu
Ziyan Chen
Tiejun Zhao
LRM
434
0
0
17 Nov 2025
Cost-Driven Synthesis of Sound Abstract Interpreters
Cost-Driven Synthesis of Sound Abstract Interpreters
Qiuhan Gu
Avaljot Singh
Gagandeep Singh
76
0
0
17 Nov 2025
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding
Jiaze Li
Hao Yin
Wenhui Tan
Jingyang Chen
Boshen Xu
Yuxun Qu
Yijing Chen
Jianzhong Ju
Zhenbo Luo
Jian Luan
LRMVLM
230
1
0
17 Nov 2025
Dynamic Template Selection for Output Token Generation Optimization: MLP-Based and Transformer Approaches
Dynamic Template Selection for Output Token Generation Optimization: MLP-Based and Transformer Approaches
Bharadwaj Yadavalli
187
0
0
17 Nov 2025
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
Harold Haodong Chen
Disen Lan
Wen-Jie Shu
Qingyang Liu
Zihan Wang
...
Hongfei Zhang
Zixin Zhang
Rongjin Guo
Yu Cheng
Ying-Cong Chen
VGenLRM
303
2
0
17 Nov 2025
1234...222324
Next