Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.17651
Cited By
v1
v2 (latest)
Self-Refine: Iterative Refinement with Self-Feedback
Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"Self-Refine: Iterative Refinement with Self-Feedback"
50 / 1,563 papers shown
Towards Self-Refinement of Vision-Language Models with Triangular Consistency
Yunlong Deng
Guangyi Chen
Tianpei Gu
Lingjing Kong
Yan Li
Zeyu Tang
Kun Zhang
168
1
0
12 Oct 2025
PrediQL: Automated Testing of GraphQL APIs with LLMs
Shaolun Liu
Sina Marefat
Omar Tsai
Yu Chen
Zecheng Deng
Jia Wang
Mohammad A. Tayebi
109
0
0
12 Oct 2025
MedAgentAudit: Diagnosing and Quantifying Collaborative Failure Modes in Medical Multi-Agent Systems
Lei Gu
Yinghao Zhu
Haoran Sang
Zixiang Wang
Dehao Sui
Wen Tang
Ewen M. Harrison
Junyi Gao
Lequan Yu
Liantao Ma
110
1
0
11 Oct 2025
Answer-Consistent Chain-of-thought Reinforcement Learning For Multi-modal Large Langauge Models
Minbin Huang
Runhui Huang
Chuanyang Zheng
Jingyao Li
Guoxuan Chen
Han Shi
Hong Cheng
KELM
LRM
122
0
0
11 Oct 2025
Failure-Driven Workflow Refinement
Jusheng Zhang
Kaitong Cai
Qinglin Zeng
Ningyuan Liu
Stephen Fan
Ziliang Chen
Keze Wang
104
11
0
11 Oct 2025
MatryoshkaThinking: Recursive Test-Time Scaling Enables Efficient Reasoning
Hongwei Chen
Yishu Lei
Dan Zhang
Bo Ke
Danxiang Zhu
...
Shikun Feng
Jingzhou He
Yu Sun
Hua Wu
Haifeng Wang
ReLM
LRM
132
0
0
11 Oct 2025
Mitigating Hallucination in Multimodal Reasoning via Functional Attention Control
H. Lu
Bolun Chu
Weiye Fu
Guoshun Nan
Junning Liu
Minghui Pan
Qiankun Li
Yi Yu
Hua Wang
Kun Wang
LRM
129
0
0
11 Oct 2025
Fundamentals of Building Autonomous LLM Agents
Victor de Lamo Castrillo
Habtom Kahsay Gidey
Alexander Lenz
Alois Knoll
LLMAG
LM&Ro
204
2
0
10 Oct 2025
Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning
Sicong Huang
Qianqi Yan
Shengze Wang
Ian Lane
HILM
161
0
0
10 Oct 2025
MEC
3
^3
3
O: Multi-Expert Consensus for Code Time Complexity Prediction
Joonghyuk Hahn
Soohan Lim
Yo-Sub Han
104
0
0
10 Oct 2025
Automated Refinement of Essay Scoring Rubrics for Language Models via Reflect-and-Revise
Keno Harada
Lui Yoshida
Takeshi Kojima
Yusuke Iwasawa
Yutaka Matsuo
106
0
0
10 Oct 2025
Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
Lianhao Zhou
Hongyi Ling
Cong Fu
Yepeng Huang
Michael Sun
...
X. Qian
Heng Ji
Wei Wang
Marinka Zitnik
Shuiwang Ji
LLMAG
LM&Ro
AI4CE
179
3
0
10 Oct 2025
PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting
Houqiang Zhong
Zhenglong Wu
Sihua Fu
Zihan Zheng
Xin Jin
X. Zhang
Li Song
Q. Hu
3DGS
113
5
0
09 Oct 2025
Haibu Mathematical-Medical Intelligent Agent:Enhancing Large Language Model Reliability in Medical Tasks via Verifiable Reasoning Chains
Yilun Zhang
Dexing Kong
LRM
52
0
0
09 Oct 2025
Agent Learning via Early Experience
Kai Zhang
Xiangchao Chen
Bo Liu
Tianci Xue
Zeyi Liao
...
J. Zhu
Huan Sun
Jason Weston
Eric Fosler-Lussier
Y. Wu
OffRL
195
6
0
09 Oct 2025
Training-Free Group Relative Policy Optimization
Yuzheng Cai
Siqi Cai
Yuchen Shi
Zihan Xu
Lichao Chen
...
Zongyi Li
Haojia Lin
Yong Mao
Ke Li
Xing Sun
OffRL
227
4
0
09 Oct 2025
MOSAIC: Multi-agent Orchestration for Task-Intelligent Scientific Coding
Siddeshwar Raghavan
Tanwi Mallick
AI4CE
133
0
0
09 Oct 2025
ReInAgent: A Context-Aware GUI Agent Enabling Human-in-the-Loop Mobile Task Navigation
Haitao Jia
Ming He
Zimo Yin
Likang Wu
Jianping Fan
Jitao Sang
112
0
0
09 Oct 2025
Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation
Yunzhe Xu
Yiyuan Pan
Zhe Liu
LM&Ro
85
0
0
09 Oct 2025
COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context
Guangya Wan
Mingyang Ling
Xiaoqi Ren
Rujun Han
Sheng Li
Zizhao Zhang
LLMAG
LRM
101
1
0
09 Oct 2025
LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?
Jingyuan Wang
Yankai Chen
Zhonghang Li
Chao Huang
LRM
96
0
0
09 Oct 2025
Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation
Shiyuan Yin
Chenjia Bai
Z. Zhang
Junwei Jin
Xinxin Zhang
Chi Zhang
Xuelong Li
115
0
0
09 Oct 2025
BG-FlipIn: A Bayesian game framework for FlipIt-insider models in advanced persistent threats
Yang Jiao
Guanpu Chen
Yiguang Hong
AAML
98
0
0
08 Oct 2025
Don't Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models
Jonggeun Lee
Woojung Song
Jongwook Han
Haesung Pyun
Yohan Jo
CLL
211
0
0
08 Oct 2025
Inspection Planning Primitives with Implicit Models
Jingyang You
Hanna Kurniawati
Lashika Medagoda
108
2
0
08 Oct 2025
AgentAsk: Multi-Agent Systems Need to Ask
Bohan Lin
Kuo Yang
Yingchuan Lai
Yudong Zhang
Chen Zhang
G. Zhang
Xinlei Yu
Miao Yu
Xu Wang
Yang-Feng Wang
114
0
0
08 Oct 2025
MAPRO: Recasting Multi-Agent Prompt Optimization as Maximum a Posteriori Inference
Zheyuan Zhang
Lin Ge
Hongjiang Li
Weicheng Zhu
Chuxu Zhang
Yanfang Ye
LLMAG
128
1
0
08 Oct 2025
ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems
Bohan Yao
Shiva Krishna Reddy Malay
Vikas Yadav
LM&Ro
LRM
152
0
0
07 Oct 2025
RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases
Lang Qin
Zijian Gan
Xu Cao
Pengcheng Jiang
Yankai Jiang
Jiawei Han
Kaishun Wu
Jintai Chen
LRM
164
0
0
07 Oct 2025
Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding
Haomiao Chen
K. Jamison
M. Sabuncu
Amy Kuceyeski
140
1
0
07 Oct 2025
Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails
Siwei Han
Jiaqi Liu
Yaofeng Su
Wenbo Duan
Xinyuan Liu
Cihang Xie
Mohit Bansal
Mingyu Ding
Linjun Zhang
Huaxiu Yao
133
1
0
06 Oct 2025
AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems
Shambhavi Mishra
Gaurav Sahu
M. Pedersoli
Laurent Charlin
Jose Dolz
Christopher Pal
LRM
96
0
0
06 Oct 2025
Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading
Zifan Song
Kaitao Song
Guosheng Hu
Ding Qi
Junyao Gao
Xiaohua Wang
Dongsheng Li
Cairong Zhao
AIFin
128
1
0
06 Oct 2025
Bridging Reasoning to Learning: Unmasking Illusions using Complexity Out of Distribution Generalization
Mohammad Mahdi Samiei Paqaleh
Arash Marioriyad
Arman Tahmasebi-Zadeh
Mohamadreza Fereydooni
Mahdi Ghaznavai
Mahdieh Soleymani Baghshah
120
0
0
06 Oct 2025
Large Language Models Hallucination: A Comprehensive Survey
Aisha Alansari
Hamzah Luqman
HILM
LRM
457
1
0
05 Oct 2025
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
Zhanke Zhou
Chentao Cao
Xiao Feng
Xuan Li
Zongze Li
...
Brando Miranda
Tongliang Liu
Sanmi Koyejo
Masashi Sugiyama
Bo Han
ReLM
LRM
112
0
0
05 Oct 2025
Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Hadi Nekoei
Aman Jaiswal
Patrice Béchard
Oleh Shliazhko
Orlando Marquez Ayala
Mathieu Reymond
Massimo Caccia
Alexandre Drouin
Sarath Chandar
Alexandre Lacoste
KELM
124
0
0
05 Oct 2025
SPOGW: a Score-based Preference Optimization method via Group-Wise comparison for workflows
Yitong Cui
Liu Liu
B. Yu
Jiayan Qiu
Xikai Zhang
Likang Xiao
Y. Liu
Quan Chen
153
0
0
05 Oct 2025
Searching Meta Reasoning Skeleton to Guide LLM Reasoning
Ziying Zhang
Yaqing Wang
Quanming Yao
LRM
191
1
0
05 Oct 2025
Utility-Learning Tension in Self-Modifying Agents
Charles L. Wang
Keir Dorchen
Peter Jin
127
0
0
05 Oct 2025
A global log for medical AI
Ayush Noori
Adam Rodman
Alan Karthikesalingam
Bilal A. Mateen
Christopher A. Longhurst
...
Noa Dagan
David Clifton
Ran D. Balicer
I. Kohane
Marinka Zitnik
171
0
0
05 Oct 2025
LLM Chemistry Estimation for Multi-LLM Recommendation
H. Sánchez
Briland Hitaj
121
1
0
04 Oct 2025
Adversarial Agent Collaboration for C to Rust Translation
Tianyu Li
Ruishi Li
Bo Wang
Brandon Paulsen
Umang Mathur
Prateek Saxena
152
2
0
04 Oct 2025
Self-Anchor: Large Language Model Reasoning via Step-by-step Attention Alignment
Hongxiang Zhang
Yuan Tian
Tianyi Zhang
LRM
94
1
0
03 Oct 2025
AutoMaAS: Self-Evolving Multi-Agent Architecture Search for Large Language Models
Bo Ma
Hang Li
ZeHua Hu
XiaoFan Gui
LuYao Liu
Simon Liu
LLMAG
LM&Ro
AI4CE
160
0
0
03 Oct 2025
Lang-PINN: From Language to Physics-Informed Neural Networks via a Multi-Agent Framework
Xin He
Liangliang You
Hongduan Tian
Bo Han
Ivor Tsang
Yew-Soon Ong
PINN
AI4CE
214
1
0
03 Oct 2025
Self-Reflective Generation at Test Time
Jian Mu
Qixin Zhang
Zhiyong Wang
Menglin Yang
Shuang Qiu
Chengwei Qin
Zhongxiang Dai
Yao Shu
LRM
139
1
0
03 Oct 2025
Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation
Faruk Alpay
Hamdi Alakkad
58
0
0
03 Oct 2025
CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Zhenwen Liang
Ruosen Li
Yujun Zhou
Linfeng Song
Dian Yu
Xinya Du
Haitao Mi
Dong Yu
121
1
0
02 Oct 2025
Towards Interpretable and Inference-Optimal COT Reasoning with Sparse Autoencoder-Guided Generation
Daniel Zhao
Abhilash Shankarampeta
Lanxiang Hu
Tajana Rosing
Hao Zhang
LRM
108
0
0
02 Oct 2025
Previous
1
2
3
4
5
...
30
31
32
Next