Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2303.17651
Cited By

Self-Refine: Iterative Refinement with Self-Feedback

v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Skyler Hallinan

Sarah Wiegreffe

Shrimai Prabhumoye

Bodhisattwa Prasad Majumder

Katherine Hermann

Amir Yazdanbakhsh

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,676 papers shown

Learning to Orchestrate Agents in Natural Language with the Conductor

Learning to Orchestrate Agents in Natural Language with the Conductor

Peter Schwendeman

108

1

0

04 Dec 2025

Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models

Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models

109

0

0

04 Dec 2025

On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference

On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference

184

0

0

04 Dec 2025

Balancing Safety and Helpfulness in Healthcare AI Assistants through Iterative Preference Alignment

Balancing Safety and Helpfulness in Healthcare AI Assistants through Iterative Preference Alignment

Swetasudha Panda

Devashish Khatwani

Krishnaram Kenthapadi

133

0

0

03 Dec 2025

PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks

PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks

Daisuke Okanohara

162

1

0

03 Dec 2025

Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks

Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks

Gianni Molinari

Fabio Ciravegna

45

0

0

03 Dec 2025

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

Sruthi Gorantla

156

0

0

02 Dec 2025

WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate

WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate

Kuan-Chuan Peng

120

0

0

02 Dec 2025

When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers

When Does Verification Pay Off? A Closer Look at LLMs as Solution Verifiers

155

0

0

02 Dec 2025

LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems

LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems

128

1

0

02 Dec 2025

Enhancing Automated Paper Reproduction via Prompt-Free Collaborative Agents

Enhancing Automated Paper Reproduction via Prompt-Free Collaborative Agents

62

0

0

02 Dec 2025

DrawingBench: Evaluating Spatial Reasoning and UI Interaction Capabilities of Large Language Models through Mouse-Based Drawing Tasks

DrawingBench: Evaluating Spatial Reasoning and UI Interaction Capabilities of Large Language Models through Mouse-Based Drawing Tasks

68

0

0

01 Dec 2025

COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis

COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis

Ching-Chun Huang

370

0

0

01 Dec 2025

Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent

Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent

136

0

0

28 Nov 2025

Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models

Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models

193

0

0

28 Nov 2025

Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities

Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities

Renzo Degiovanni

135

0

0

28 Nov 2025

ThetaEvolve: Test-time Learning on Open Problems

ThetaEvolve: Test-time Learning on Open Problems

...

248

0

0

28 Nov 2025

Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning

Adapting Like Humans: A Metacognitive Agent with Test-time Reasoning

Zhuhanling Xiao

159

0

0

28 Nov 2025

Real-Time Procedural Learning From Experience for AI Agents

Real-Time Procedural Learning From Experience for AI Agents

Mohammed N. Nasir

86

0

0

27 Nov 2025

Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

Dominik Hintersdorf

Hannah Struppek

Kristian Kersting

104

0

0

27 Nov 2025

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

Jong Myoung Kim

206

2

0

27 Nov 2025

DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA

DocVAL: Validated Chain-of-Thought Distillation for Grounded Document VQA

Ahmad Mohammadshirazi

Pinaki Prasad Guha Neogi

Dheeraj Kulshrestha

138

0

0

27 Nov 2025

TTSnap: Test-Time Scaling of Diffusion Models via Noise-Aware Pruning

TTSnap: Test-Time Scaling of Diffusion Models via Noise-Aware Pruning

Vinay Kumar Verma

96

0

0

27 Nov 2025

BAMAS: Structuring Budget-Aware Multi-Agent Systems

BAMAS: Structuring Budget-Aware Multi-Agent Systems

328

0

0

26 Nov 2025

On the Limits of Innate Planning in Large Language Models

On the Limits of Innate Planning in Large Language Models

Charles Schepanowski

438

0

0

26 Nov 2025

EWE: An Agentic Framework for Extreme Weather Analysis

EWE: An Agentic Framework for Extreme Weather Analysis

167

1

0

26 Nov 2025

A Unified Evaluation-Instructed Framework for Query-Dependent Prompt Optimization

A Unified Evaluation-Instructed Framework for Query-Dependent Prompt Optimization

Hassan Almosapeeh

157

0

0

25 Nov 2025

More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering

More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering

Cong-Duy Nguyen

Viet-Anh Nguyen

343

0

0

25 Nov 2025

Nonparametric Instrumental Variable Regression with Observed Covariates

Nonparametric Instrumental Variable Regression with Observed Covariates

Dimitri Meunier

81

0

0

24 Nov 2025

Majority of the Bests: Improving Best-of-N via Bootstrapping

Majority of the Bests: Improving Best-of-N via Bootstrapping

Amir-massoud Farahmand

Amir Khasahmadi

145

0

0

23 Nov 2025

$A^2Flow:$ Automating Agentic Workflow Generation via Self-Adaptive Abstraction Operators

A^2Flow:

Automating Agentic Workflow Generation via Self-Adaptive Abstraction Operators

124

0

0

23 Nov 2025

SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization

SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization

Daniel F. Schmidt

110

0

0

22 Nov 2025

Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures

Learning to Debug: LLM-Organized Knowledge Trees for Solving RTL Assertion Failures

105

0

0

21 Nov 2025

Budget-Aware Tool-Use Enables Effective Agent Scaling

Budget-Aware Tool-Use Enables Effective Agent Scaling

...

William Y. Wang

228

4

0

21 Nov 2025

MultiGA: Leveraging Multi-Source Seeding in Genetic Algorithms

MultiGA: Leveraging Multi-Source Seeding in Genetic Algorithms

Isabelle Diana May-Xin Ng

Tharindu Cyril Weerasooriya

55

0

0

21 Nov 2025

InfCode: Adversarial Iterative Refinement of Tests and Patches for Reliable Software Issue Resolution

134

0

0

20 Nov 2025

AutoBackdoor: Automating Backdoor Attacks via LLM Agents

AutoBackdoor: Automating Backdoor Attacks via LLM Agents

AAML LLMAG SILM

396

1

0

20 Nov 2025

SDA: Steering-Driven Distribution Alignment for Open LLMs without Fine-Tuning

278

0

0

20 Nov 2025

PSM: Prompt Sensitivity Minimization via LLM-Guided Black-Box Optimization

146

0

0

20 Nov 2025

Reflexive Evidence-Based Multimodal Learning for Clean Energy Transitions: Causal Insights on Cooking Fuel Access, Urbanization, and Carbon Emissions

Reflexive Evidence-Based Multimodal Learning for Clean Energy Transitions: Causal Insights on Cooking Fuel Access, Urbanization, and Carbon Emissions

90

0

0

19 Nov 2025

From Solving to Verifying: A Unified Objective for Robust Reasoning in LLMs

From Solving to Verifying: A Unified Objective for Robust Reasoning in LLMs

179

0

0

19 Nov 2025

Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn

Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn

Eugene Vinitsky

142

0

0

18 Nov 2025

SVBRD-LLM: Self-Verifying Behavioral Rule Discovery for Autonomous Vehicle Identification

SVBRD-LLM: Self-Verifying Behavioral Rule Discovery for Autonomous Vehicle Identification

129

0

0

18 Nov 2025

Dynamic Template Selection for Output Token Generation Optimization: MLP-Based and Transformer Approaches

Dynamic Template Selection for Output Token Generation Optimization: MLP-Based and Transformer Approaches

Bharadwaj Yadavalli

202

0

0

17 Nov 2025

Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection

Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection

Branislav Kisacanin

Shubham Toshniwal

George Armstrong

Christos Thrampoulidis

282

2

0

17 Nov 2025

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Harold Haodong Chen

...

366

3

0

17 Nov 2025

From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models

From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models

452

0

0

17 Nov 2025

CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

...

252

0

0

17 Nov 2025

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

246

1

0

17 Nov 2025

Cost-Driven Synthesis of Sound Abstract Interpreters

Cost-Driven Synthesis of Sound Abstract Interpreters

Gagandeep Singh

88

0

0

17 Nov 2025

1 2 3 4...32 33 34

Page 1 of 34

Pageof 34