Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2303.17651
Cited By

Self-Refine: Iterative Refinement with Self-Feedback

v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Skyler Hallinan

Sarah Wiegreffe

Shrimai Prabhumoye

Bodhisattwa Prasad Majumder

Katherine Hermann

Amir Yazdanbakhsh

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,674 papers shown

PrediQL: Automated Testing of GraphQL APIs with LLMs

PrediQL: Automated Testing of GraphQL APIs with LLMs

Mohammad A. Tayebi

113

0

0

12 Oct 2025

Towards Self-Refinement of Vision-Language Models with Triangular Consistency

Towards Self-Refinement of Vision-Language Models with Triangular Consistency

177

2

0

12 Oct 2025

Answer-Consistent Chain-of-thought Reinforcement Learning For Multi-modal Large Langauge Models

Answer-Consistent Chain-of-thought Reinforcement Learning For Multi-modal Large Langauge Models

Chuanyang Zheng

125

0

0

11 Oct 2025

Failure-Driven Workflow Refinement

Failure-Driven Workflow Refinement

115

12

0

11 Oct 2025

MedAgentAudit: Diagnosing and Quantifying Collaborative Failure Modes in Medical Multi-Agent Systems

MedAgentAudit: Diagnosing and Quantifying Collaborative Failure Modes in Medical Multi-Agent Systems

Ewen M. Harrison

114

1

0

11 Oct 2025

MatryoshkaThinking: Recursive Test-Time Scaling Enables Efficient Reasoning

MatryoshkaThinking: Recursive Test-Time Scaling Enables Efficient Reasoning

...

135

0

0

11 Oct 2025

Mitigating Hallucination in Multimodal Reasoning via Functional Attention Control

Mitigating Hallucination in Multimodal Reasoning via Functional Attention Control

135

0

0

11 Oct 2025

Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning

Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning

161

0

0

10 Oct 2025

Automated Refinement of Essay Scoring Rubrics for Language Models via Reflect-and-Revise

Automated Refinement of Essay Scoring Rubrics for Language Models via Reflect-and-Revise

114

0

0

10 Oct 2025

Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics

Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics

...

LLMAG LM&Ro AI4CE

182

3

0

10 Oct 2025

MEC$^3$O: Multi-Expert Consensus for Code Time Complexity Prediction

^3

O: Multi-Expert Consensus for Code Time Complexity Prediction

108

0

0

10 Oct 2025

Fundamentals of Building Autonomous LLM Agents

Fundamentals of Building Autonomous LLM Agents

Victor de Lamo Castrillo

Habtom Kahsay Gidey

204

2

0

10 Oct 2025

Training-Free Group Relative Policy Optimization

Training-Free Group Relative Policy Optimization

...

230

6

0

09 Oct 2025

Agent Learning via Early Experience

Agent Learning via Early Experience

...

Eric Fosler-Lussier

198

8

0

09 Oct 2025

Haibu Mathematical-Medical Intelligent Agent:Enhancing Large Language Model Reliability in Medical Tasks via Verifiable Reasoning Chains

Haibu Mathematical-Medical Intelligent Agent:Enhancing Large Language Model Reliability in Medical Tasks via Verifiable Reasoning Chains

52

0

0

09 Oct 2025

PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting

PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting

117

5

0

09 Oct 2025

ReInAgent: A Context-Aware GUI Agent Enabling Human-in-the-Loop Mobile Task Navigation

ReInAgent: A Context-Aware GUI Agent Enabling Human-in-the-Loop Mobile Task Navigation

112

0

0

09 Oct 2025

MOSAIC: Multi-agent Orchestration for Task-Intelligent Scientific Coding

MOSAIC: Multi-agent Orchestration for Task-Intelligent Scientific Coding

Siddeshwar Raghavan

136

0

0

09 Oct 2025

Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation

Towards Reliable LLM-based Robot Planning via Combined Uncertainty Estimation

118

0

0

09 Oct 2025

LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?

LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?

96

0

0

09 Oct 2025

COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context

COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context

101

1

0

09 Oct 2025

Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation

Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation

88

0

0

09 Oct 2025

Don't Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models

Don't Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models

218

0

0

08 Oct 2025

AgentAsk: Multi-Agent Systems Need to Ask

AgentAsk: Multi-Agent Systems Need to Ask

...

Yang Wang

116

0

0

08 Oct 2025

Inspection Planning Primitives with Implicit Models

Inspection Planning Primitives with Implicit Models

Hanna Kurniawati

Lashika Medagoda

109

2

0

08 Oct 2025

BG-FlipIn: A Bayesian game framework for FlipIt-insider models in advanced persistent threats

BG-FlipIn: A Bayesian game framework for FlipIt-insider models in advanced persistent threats

103

0

0

08 Oct 2025

MAPRO: Recasting Multi-Agent Prompt Optimization as Maximum a Posteriori Inference

MAPRO: Recasting Multi-Agent Prompt Optimization as Maximum a Posteriori Inference

129

1

0

08 Oct 2025

Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

150

1

0

07 Oct 2025

RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases

RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases

Pengcheng Jiang

172

0

0

07 Oct 2025

ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems

ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems

Shiva Krishna Reddy Malay

152

0

0

07 Oct 2025

Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails

Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails

134

1

0

06 Oct 2025

AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems

AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems

Shambhavi Mishra

Laurent Charlin

Christopher Pal

99

0

0

06 Oct 2025

Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading

Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading

129

1

0

06 Oct 2025

Bridging Reasoning to Learning: Unmasking Illusions using Complexity Out of Distribution Generalization

Bridging Reasoning to Learning: Unmasking Illusions using Complexity Out of Distribution Generalization

Mohammad Mahdi Samiei Paqaleh

Arash Marioriyad

Arman Tahmasebi-Zadeh

Mohamadreza Fereydooni

Mahdi Ghaznavai

Mahdieh Soleymani Baghshah

120

0

0

06 Oct 2025

Large Language Models Hallucination: A Comprehensive Survey

Large Language Models Hallucination: A Comprehensive Survey

461

1

0

05 Oct 2025

AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning

AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning

...

Masashi Sugiyama

117

0

0

05 Oct 2025

Searching Meta Reasoning Skeleton to Guide LLM Reasoning

Searching Meta Reasoning Skeleton to Guide LLM Reasoning

199

1

0

05 Oct 2025

A global log for medical AI

A global log for medical AI

Alan Karthikesalingam

Bilal A. Mateen

Christopher A. Longhurst

...

172

0

0

05 Oct 2025

Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation

Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation

Patrice Béchard

Orlando Marquez Ayala

Mathieu Reymond

Alexandre Drouin

Alexandre Lacoste

128

1

0

05 Oct 2025

SPOGW: a Score-based Preference Optimization method via Group-Wise comparison for workflows

SPOGW: a Score-based Preference Optimization method via Group-Wise comparison for workflows

159

0

0

05 Oct 2025

Utility-Learning Tension in Self-Modifying Agents

Utility-Learning Tension in Self-Modifying Agents

Charles L. Wang

129

0

0

05 Oct 2025

Adversarial Agent Collaboration for C to Rust Translation

Adversarial Agent Collaboration for C to Rust Translation

Brandon Paulsen

154

2

0

04 Oct 2025

LLM Chemistry Estimation for Multi-LLM Recommendation

LLM Chemistry Estimation for Multi-LLM Recommendation

122

1

0

04 Oct 2025

Self-Reflective Generation at Test Time

Self-Reflective Generation at Test Time

144

1

0

03 Oct 2025

Self-Anchor: Large Language Model Reasoning via Step-by-step Attention Alignment

Self-Anchor: Large Language Model Reasoning via Step-by-step Attention Alignment

Hongxiang Zhang

98

1

0

03 Oct 2025

AutoMaAS: Self-Evolving Multi-Agent Architecture Search for Large Language Models

AutoMaAS: Self-Evolving Multi-Agent Architecture Search for Large Language Models

LLMAG LM&Ro AI4CE

160

0

0

03 Oct 2025

Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation

Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation

64

0

0

03 Oct 2025

Lang-PINN: From Language to Physics-Informed Neural Networks via a Multi-Agent Framework

Lang-PINN: From Language to Physics-Informed Neural Networks via a Multi-Agent Framework

214

1

0

03 Oct 2025

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Amrith Rajagopal Setlur

Ruslan Salakhutdinov

137

2

0

02 Oct 2025

GRACE: A Language Model Framework for Explainable Inverse Reinforcement Learning

GRACE: A Language Model Framework for Explainable Inverse Reinforcement Learning

Alexander Toshev

162

0

0

02 Oct 2025

1 2 3 4 5...32 33 34