ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17651
  4. Cited By
Self-Refine: Iterative Refinement with Self-Feedback
v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
    ReLMLRMDiffM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,155 papers shown
Title
LiRA: A Multi-Agent Framework for Reliable and Readable Literature Review Generation
LiRA: A Multi-Agent Framework for Reliable and Readable Literature Review Generation
Gregory Hok Tjoan Go
Khang Ly
Anders Søgaard
Amin Tabatabaei
Maarten de Rijke
Xinyi Chen
105
0
0
01 Oct 2025
Test-Time Search in Neural Graph Coarsening Procedures for the Capacitated Vehicle Routing Problem
Test-Time Search in Neural Graph Coarsening Procedures for the Capacitated Vehicle Routing Problem
Yoonju Sim
Hyeonah Kim
Changhyun Kwon
72
0
0
01 Oct 2025
Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions
Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions
Junbeom Kim
Kyuyoung Kim
Jihoon Tack
Dongha Lim
Jinwoo Shin
MUKELM
137
1
0
30 Sep 2025
ACT: Agentic Classification Tree
ACT: Agentic Classification Tree
Vincent Grari
Tim Arni
Thibault Laugel
Sylvain Lamprier
James Zou
Marcin Detyniecki
149
0
0
30 Sep 2025
DyFlow: Dynamic Workflow Framework for Agentic Reasoning
DyFlow: Dynamic Workflow Framework for Agentic Reasoning
Yanbo Wang
Z. Xu
Yue Huang
Xiangqi Wang
Zirui Song
...
Xiangru Tang
Yue Zhao
Arman Cohan
Xiangliang Zhang
Xiuying Chen
LRMAI4CE
121
0
0
30 Sep 2025
Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models
Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models
S. Venkatraman
Vineet Jain
Sarthak Mittal
Vedant Shah
J. Obando-Ceron
...
B. Kailkhura
Guillaume Lajoie
Glen Berseth
Nikolay Malkin
Moksh Jain
ReLMAIFinLRM
188
1
0
30 Sep 2025
Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts
Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts
Hanwen Du
Yuxin Dong
Xia Ning
LRMAI4CE
146
1
0
30 Sep 2025
TUMIX: Multi-Agent Test-Time Scaling with Tool-Use Mixture
TUMIX: Multi-Agent Test-Time Scaling with Tool-Use Mixture
Yongchao Chen
Jiefeng Chen
Rui Meng
Ji Yin
Na Li
Chuchu Fan
Chi Wang
Tomas Pfister
Jinsung Yoon
LLMAG
117
2
0
30 Sep 2025
Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling
Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling
P. Wang
B. Sun
Xuemei Dong
Yaxun Dai
Hongwei Yuan
Mengdie Chu
Yingqi Gao
Xiang Qi
Peng Zhang
Ying Yan
LMTD
283
0
0
29 Sep 2025
Learning to Ponder: Adaptive Reasoning in Latent Space
Learning to Ponder: Adaptive Reasoning in Latent Space
Yixin He
Lumingyuan Tang
LRM
82
1
0
29 Sep 2025
ContextPRM: Leveraging Contextual Coherence for multi-domain Test-Time Scaling
ContextPRM: Leveraging Contextual Coherence for multi-domain Test-Time Scaling
Haotian Zhang
Liu Liu
B. Yu
Jiayan Qiu
Likang Xiao
Yanwei Ren
Quan Chen
Xianglong Liu
LRM
84
0
0
29 Sep 2025
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
Siru Ouyang
Jun Yan
I-Hung Hsu
Yanfei Chen
Ke Jiang
...
Mahsan Rofouei
Hangfei Lin
Jiawei Han
Chen-Yu Lee
Tomas Pfister
LLMAGCLLLRM
120
8
0
29 Sep 2025
PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System
PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System
F. Yu
Junchi Yao
Ziyi Wang
Haiyuan Wan
Y. Huang
...
Ning Ding
Ganqu Cui
Wenlong Zhang
Wanli Ouyang
Peng Ye
LRMAI4CE
84
2
0
29 Sep 2025
SecInfer: Preventing Prompt Injection via Inference-time Scaling
SecInfer: Preventing Prompt Injection via Inference-time Scaling
Yupei Liu
Yanting Wang
Yuqi Jia
Jinyuan Jia
Neil Zhenqiang Gong
LRMSILMAAML
397
3
0
29 Sep 2025
Geo-R1: Unlocking VLM Geospatial Reasoning with Cross-View Reinforcement Learning
Geo-R1: Unlocking VLM Geospatial Reasoning with Cross-View Reinforcement Learning
Chenhui Xu
F. Yu
Michael J. Bianco
Jacob Kovarskiy
Raphael Tang
...
Rupanjali Kukal
Mikael Figueroa
Rishi Madhok
Nikolaos Karianakis
Jinjun Xiong
ObjDReLMLRM
103
0
0
29 Sep 2025
Large-Scale Constraint Generation - Can LLMs Parse Hundreds of Constraints?
Large-Scale Constraint Generation - Can LLMs Parse Hundreds of Constraints?
Matteo Boffa
Jiaxuan You
140
0
0
28 Sep 2025
How LLMs Learn to Reason: A Complex Network Perspective
How LLMs Learn to Reason: A Complex Network Perspective
Sihan Hu
X-D Cai
Yuan Huang
Zhiyuan Yao
Linfeng Zhang
Pan Zhang
Youjin Deng
Kun Chen
LRM
145
1
0
28 Sep 2025
GUI-PRA: Process Reward Agent for GUI Tasks
GUI-PRA: Process Reward Agent for GUI Tasks
Tao Xiong
Xavier Hu
Yurun Chen
Yuhang Liu
Changqiao Wu
Pengzhi Gao
Wei Liu
Jian Luan
Shengyu Zhang
LLMAG
205
0
0
27 Sep 2025
Self-Consistency as a Free Lunch: Reducing Hallucinations in Vision-Language Models via Self-Reflection
Self-Consistency as a Free Lunch: Reducing Hallucinations in Vision-Language Models via Self-Reflection
Mingfei Han
Haihong Hao
Jinxing Zhou
Zhihui Li
Yuhui Zheng
XueQing Deng
Linjie Yang
Xiaojun Chang
HILMVLM
108
0
0
27 Sep 2025
LAGEA: Language Guided Embodied Agents for Robotic Manipulation
LAGEA: Language Guided Embodied Agents for Robotic Manipulation
Abdul Monaf Chowdhury
Akm Moshiur Rahman Mazumder
Rabeya Akter
S. Arib
LM&Ro
96
0
0
27 Sep 2025
Teaching Transformers to Solve Combinatorial Problems through Efficient Trial & Error
Teaching Transformers to Solve Combinatorial Problems through Efficient Trial & Error
Panagiotis Giannoulis
Yorgos Pantis
Christos Tzamos
112
0
0
26 Sep 2025
HEART: Emotionally-driven test-time scaling of Language Models
HEART: Emotionally-driven test-time scaling of Language Models
Gabriela Pinto
Palash Goyal
Yiwen Song
Souradip Chakraborty
Zifeng Wang
Tomas Pfister
Hamid Palangi
ReLMLRM
136
0
0
26 Sep 2025
Mixture-of-Visual-Thoughts: Exploring Context-Adaptive Reasoning Mode Selection for General Visual Reasoning
Mixture-of-Visual-Thoughts: Exploring Context-Adaptive Reasoning Mode Selection for General Visual Reasoning
Zejun Li
Yingxiu Zhao
Jiwen Zhang
Siyuan Wang
Yang Yao
Runzhou Zhao
Jun Song
Bo Zheng
Zhongyu Wei
LRM
106
0
0
26 Sep 2025
PRIME: Planning and Retrieval-Integrated Memory for Enhanced Reasoning
PRIME: Planning and Retrieval-Integrated Memory for Enhanced ReasoningRemote Sensing (RS), 2025
Hieu Tran
Zonghai Yao
Nguyen Luong Tran
Zhichao Yang
Feiyun Ouyang
Shuo Han
Razieh Rahimi
Hong-ye Yu
LLMAGLRM
205
0
0
26 Sep 2025
A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Z. Wang
Boye Niu
Ruoyao Xiao
Linghui Meng
Jing Liu
Zhi Zheng
Tong Xu
H. Wu
Haifeng Wang
Enhong Chen
LRM
92
1
0
26 Sep 2025
Think Right, Not More: Test-Time Scaling for Numerical Claim Verification
Think Right, Not More: Test-Time Scaling for Numerical Claim Verification
Primakov Chungkham
Venktesh V
Vinay Setty
Avishek Anand
LRM
105
0
0
26 Sep 2025
Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning
Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning
Chi Ruan
Dongfu Jiang
Yubo Wang
Wenhu Chen
OffRLALMLRM
89
0
0
26 Sep 2025
InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios
InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios
Chenglin Yu
Yang Yu
Songmiao Wang
Y. Wang
Y. Yang
Jinjia Li
Ming Li
Hongxia Yang
LLMAG
169
0
0
26 Sep 2025
Few-Shot and Training-Free Review Generation via Conversational Prompting
Few-Shot and Training-Free Review Generation via Conversational Prompting
Genki Kusano
VLM
100
0
0
25 Sep 2025
A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA
A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA
Kaiyang Wan
Lang Gao
Honglin Mu
Preslav Nakov
Yuxia Wang
Xiuying Chen
LRM
88
1
0
25 Sep 2025
Hallucination-Resistant, Domain-Specific Research Assistant with Self-Evaluation and Vector-Grounded Retrieval
Hallucination-Resistant, Domain-Specific Research Assistant with Self-Evaluation and Vector-Grounded Retrieval
Vivek Bhavsar
Joseph Ereifej
Aravanan Gurusami
RALM
84
0
0
25 Sep 2025
PALADIN: Self-Correcting Language Model Agents to Cure Tool-Failure Cases
PALADIN: Self-Correcting Language Model Agents to Cure Tool-Failure Cases
Sri Vatsa Vuddanti
Aarav Shah
Satwik Kumar Chittiprolu
Tony Song
Sunishchal Dev
Kevin Zhu
Maheep Chaudhary
KELM
98
0
0
25 Sep 2025
Correct Reasoning Paths Visit Shared Decision Pivots
Correct Reasoning Paths Visit Shared Decision Pivots
Dongkyu Cho
Amy B.Z. Zhang
Bilel Fehri
Sheng Wang
Rumi Chunara
R. Song
Hengrui Cai
LRM
196
0
0
25 Sep 2025
Video models are zero-shot learners and reasoners
Video models are zero-shot learners and reasoners
Thaddäus Wiedemer
Yuxuan Li
Paul Vicol
Shixiang Shane Gu
Nick Matarese
Kevin Swersky
Been Kim
P. Jaini
Robert Geirhos
VLMLRM
224
47
0
24 Sep 2025
SIM-CoT: Supervised Implicit Chain-of-Thought
SIM-CoT: Supervised Implicit Chain-of-Thought
Xilin Wei
Xiaoran Liu
Yuhang Zang
Xiaoyi Dong
Yuhang Cao
Jiaqi Wang
Xipeng Qiu
Dahua Lin
LRM
190
3
0
24 Sep 2025
Calibrated Reasoning: An Explanatory Verifier for Dynamic and Efficient Problem-Solving
Calibrated Reasoning: An Explanatory Verifier for Dynamic and Efficient Problem-Solving
Anisha Garg
Engin Tekin
Yash More
David Bick
Nishit Neema
Ganesh Venkatesh
LRM
88
1
0
24 Sep 2025
Automated Multi-Agent Workflows for RTL Design
Automated Multi-Agent Workflows for RTL Design
Amulya Bhattaram
Janani Ramamoorthy
Ranit Gupta
Diana Marculescu
Dimitrios Stamoulis
132
1
0
24 Sep 2025
LOCA: Logical Chain Augmentation for Scientific Corpus Cleaning
LOCA: Logical Chain Augmentation for Scientific Corpus Cleaning
You-Le Fang
Dong-Shan Jian
Xiang Li
Ce Meng
Ling-Shi Meng
Chen-Xu Yan
Zhi-Zhang Bian
Yan Ma
LRM
152
1
0
24 Sep 2025
CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs
CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs
Jin Young Kim
Ji Won Yoon
ReLMLRM
104
0
0
23 Sep 2025
Investigating Test-Time Scaling with Reranking for Machine Translation
Investigating Test-Time Scaling with Reranking for Machine Translation
Shaomu Tan
Ryosuke Mitani
Ritvik Choudhary
Toshiyuki Sekiya
LRM
68
2
0
23 Sep 2025
Reflect before Act: Proactive Error Correction in Language Models
Reflect before Act: Proactive Error Correction in Language Models
Qiuhai Zeng
Sarvesh Rajkumar
Di Wang
Narendra Gyanchandani
Wenbo Yan
KELMLLMAG
82
0
0
23 Sep 2025
Agentic AutoSurvey: Let LLMs Survey LLMs
Agentic AutoSurvey: Let LLMs Survey LLMs
Yixin Liu
Yonghui Wu
Denghui Zhang
Lichao Sun
AI4CE
128
1
0
23 Sep 2025
Autonomous Data Agents: A New Opportunity for Smart Data
Autonomous Data Agents: A New Opportunity for Smart Data
Yanjie Fu
Dongjie Wang
Wangyang Ying
Xiangliang Zhang
Xiangliang Zhang
Huan Liu
Jian Pei
144
3
0
23 Sep 2025
Failure Makes the Agent Stronger: Enhancing Accuracy through Structured Reflection for Reliable Tool Interactions
Failure Makes the Agent Stronger: Enhancing Accuracy through Structured Reflection for Reliable Tool Interactions
Junhao Su
Yuanliang Wan
Junwei Yang
Hengyu Shi
Tianyang Han
Junfeng Luo
Yurui Qiu
LLMAG
178
2
0
23 Sep 2025
GnnXemplar: Exemplars to Explanations -- Natural Language Rules for Global GNN Interpretability
GnnXemplar: Exemplars to Explanations -- Natural Language Rules for Global GNN Interpretability
Burouj Armgaan
Eshan Jain
Harsh Pandey
Mahesh Chandran
Jignesh M. Patel
LLMAG
281
0
0
22 Sep 2025
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System
Sunhao Dai
Jiakai Tang
Jiahua Wu
Kun Wang
Yuxuan Zhu
...
Anxiang Zeng
Wenjie Wang
Xu Chen
Jun Xu
See-Kiong Ng
OffRLAI4TSLRM
124
3
0
22 Sep 2025
Multimodal Prompt Decoupling Attack on the Safety Filters in Text-to-Image Models
Multimodal Prompt Decoupling Attack on the Safety Filters in Text-to-Image Models
Xingkai Peng
Jun Jiang
Meng Tong
Shuai Li
Weiming Zhang
Nenghai Yu
Kejiang Chen
112
0
0
21 Sep 2025
From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations
From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations
Benlu Wang
Iris Xia
Yifan Zhang
Junda Wang
Feiyun Ouyang
Shuo Han
Arman Cohan
Hong-ye Yu
Zonghai Yao
ELM
120
2
0
20 Sep 2025
REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
Nannan Huang
Haytham M. Fayek
Xiuzhen Zhang
96
0
0
19 Sep 2025
Generalizability of Large Language Model-Based Agents: A Comprehensive Survey
Generalizability of Large Language Model-Based Agents: A Comprehensive Survey
Minxing Zhang
Yi Yang
Roy Xie
Bhuwan Dhingra
Shuyan Zhou
Jian Pei
LLMAGLM&RoAI4CE
178
1
0
19 Sep 2025
Previous
123456...222324
Next