Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2303.17651
Cited By

Self-Refine: Iterative Refinement with Self-Feedback

v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Skyler Hallinan

Sarah Wiegreffe

Shrimai Prabhumoye

Bodhisattwa Prasad Majumder

Katherine Hermann

Amir Yazdanbakhsh

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,676 papers shown

Aligning Large Language Models with Procedural Rules: An Autoregressive State-Tracking Prompting for In-Game Trading

Aligning Large Language Models with Procedural Rules: An Autoregressive State-Tracking Prompting for In-Game Trading

Woongcheol Yang

104

0

0

28 Oct 2025

Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

YuanBing Ouyang

207

1

0

27 Oct 2025

Language Server CLI Empowers Language Agents with Process Rewards

Language Server CLI Empowers Language Agents with Process Rewards

Lanser Contributors

73

0

0

27 Oct 2025

Deductive Chain-of-Thought Augmented Socially-aware Robot Navigation World Model

Deductive Chain-of-Thought Augmented Socially-aware Robot Navigation World Model

Byung-Cheol Min

195

0

0

27 Oct 2025

Scalable Supervising Software Agents with Patch Reasoner

Scalable Supervising Software Agents with Patch Reasoner

153

0

0

26 Oct 2025

Accelerating Materials Design via LLM-Guided Evolutionary Search

Accelerating Materials Design via LLM-Guided Evolutionary Search

Nikhil Abhyankar

Chandan K. Reddy

109

0

0

26 Oct 2025

Scalable Oversight via Partitioned Human Supervision

Scalable Oversight via Partitioned Human Supervision

Masashi Sugiyama

165

0

0

26 Oct 2025

Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration

Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration

Maneesh Agrawala

112

2

0

25 Oct 2025

Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning

Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning

Ravindra Aribowo Tarunokusumo

Rafael Fernandes Cunha

142

0

0

24 Oct 2025

FLAMES: Fine-tuning LLMs to Synthesize Invariants for Smart Contract Security

FLAMES: Fine-tuning LLMs to Synthesize Invariants for Smart Contract Security

Mojtaba Eshghie

Gabriele Morello

Matteo Lauretano

Alexandre Bartel

Martin Monperrus

123

1

0

24 Oct 2025

Finding the Sweet Spot: Trading Quality, Cost, and Speed During Inference-Time LLM Reflection

Finding the Sweet Spot: Trading Quality, Cost, and Speed During Inference-Time LLM Reflection

Gaiar Baimuratov

105

0

0

23 Oct 2025

AgentArcEval: An Architecture Evaluation Method for Foundation Model based Agents

AgentArcEval: An Architecture Evaluation Method for Foundation Model based AgentsJournal of Systems and Software (JSS), 2025

110

0

0

23 Oct 2025

Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents

Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents

Brandon Paulsen

Daniel Kroening

120

0

0

23 Oct 2025

Code-enabled language models can outperform reasoning models on diverse tasks

Code-enabled language models can outperform reasoning models on diverse tasks

Cedegao E. Zhang

Joshua B. Tenenbaum

ReLM ALM LRM AI4CE

189

0

0

23 Oct 2025

Communication to Completion: Modeling Collaborative Workflows with Intelligent Multi-Agent Communication

Communication to Completion: Modeling Collaborative Workflows with Intelligent Multi-Agent Communication

Sathish Indurthi

117

0

0

22 Oct 2025

Learning Affordances at Inference-Time for Vision-Language-Action Models

Learning Affordances at Inference-Time for Vision-Language-Action Models

Sanjit A. Seshia

132

0

0

22 Oct 2025

WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

118

0

0

21 Oct 2025

PlanU: Large Language Model Reasoning through Planning under Uncertainty

PlanU: Large Language Model Reasoning through Planning under Uncertainty

153

0

0

21 Oct 2025

Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents

Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents

180

1

0

21 Oct 2025

Chain-of-Conceptual-Thought Elicits Daily Conversation in Large Language Models

Chain-of-Conceptual-Thought Elicits Daily Conversation in Large Language Models

292

0

0

21 Oct 2025

Illusions of reflection: open-ended task reveals systematic failures in Large Language Models' reflective reasoning

Illusions of reflection: open-ended task reveals systematic failures in Large Language Models' reflective reasoning

Sion Weatherhead

195

0

0

21 Oct 2025

SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation

SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation

Sion Weatherhead

139

0

0

21 Oct 2025

Prompting the Priorities: A First Look at Evaluating LLMs for Vulnerability Triage and Prioritization

Prompting the Priorities: A First Look at Evaluating LLMs for Vulnerability Triage and Prioritization

Osama Al Haddad

234

0

0

21 Oct 2025

Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents

Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents

...

161

0

0

20 Oct 2025

Deep Self-Evolving Reasoning

Deep Self-Evolving Reasoning

171

1

0

20 Oct 2025

Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs

Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs

Paula Cordero-Encinar

200

1

0

20 Oct 2025

StreamingThinker: Large Language Models Can Think While Reading

StreamingThinker: Large Language Models Can Think While Reading

346

2

0

20 Oct 2025

An Agentic Framework with LLMs for Solving Complex Vehicle Routing Problems

An Agentic Framework with LLMs for Solving Complex Vehicle Routing Problems

109

0

0

19 Oct 2025

Before you <think>, monitor: Implementing Flavell's metacognitive framework in LLMs

Before you <think>, monitor: Implementing Flavell's metacognitive framework in LLMs

138

0

0

18 Oct 2025

LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs

LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs

Stefanie Jegelka

178

0

0

18 Oct 2025

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

...

260

1

0

17 Oct 2025

VISTA: A Test-Time Self-Improving Video Generation Agent

VISTA: A Test-Time Self-Improving Video Generation Agent

Sercan Ö. Arık

252

3

0

17 Oct 2025

Stable but Miscalibrated: A Kantian View on Overconfidence from Filters to Large Language Models

Stable but Miscalibrated: A Kantian View on Overconfidence from Filters to Large Language Models

209

0

0

16 Oct 2025

Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models

Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models

195

2

0

16 Oct 2025

LLM-ERM: Sample-Efficient Program Learning via LLM-Guided Search

LLM-ERM: Sample-Efficient Program Learning via LLM-Guided Search

96

1

0

16 Oct 2025

Are My Optimized Prompts Compromised? Exploring Vulnerabilities of LLM-based Optimizers

Are My Optimized Prompts Compromised? Exploring Vulnerabilities of LLM-based Optimizers

248

1

0

16 Oct 2025

Training LLM Agents to Empower Humans

Training LLM Agents to Empower Humans

Benjamin Eysenbach

194

0

0

15 Oct 2025

Generative Universal Verifier as Multimodal Meta-Reasoner

Generative Universal Verifier as Multimodal Meta-Reasoner

181

4

0

15 Oct 2025

Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval

Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval

197

0

0

15 Oct 2025

Optimal Aggregation of LLM and PRM Signals for Efficient Test-Time Scaling

Optimal Aggregation of LLM and PRM Signals for Efficient Test-Time Scaling

82

0

0

15 Oct 2025

BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)

BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)

Carsten Schwemmer

100

1

0

14 Oct 2025

Self-Verifying Reflection Helps Transformers with CoT Reasoning

Self-Verifying Reflection Helps Transformers with CoT Reasoning

107

1

0

14 Oct 2025

Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models

Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models

111

0

0

14 Oct 2025

LLM Prompt Duel Optimizer: Efficient Label-Free Prompt Optimization

LLM Prompt Duel Optimizer: Efficient Label-Free Prompt Optimization

Amel Awadelkarim

118

1

0

14 Oct 2025

FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks

FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks

Sabrina McCallum

Alessandro Suglia

117

0

0

13 Oct 2025

LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens

LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens

143

1

0

13 Oct 2025

ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding

ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding

115

3

0

13 Oct 2025

KnowRL: Teaching Language Models to Know What They Know

KnowRL: Teaching Language Models to Know What They Know

Devendra Singh Dhami

112

0

0

13 Oct 2025

Generative AI for Biosciences: Emerging Threats and Roadmap to Biosecurity

Generative AI for Biosciences: Emerging Threats and Roadmap to Biosecurity

Souradip Chakraborty

Amrit Singh Bedi

Varsha Saravanan

...

438

1

0

13 Oct 2025

PrediQL: Automated Testing of GraphQL APIs with LLMs

PrediQL: Automated Testing of GraphQL APIs with LLMs

Mohammad A. Tayebi

117

0

0

12 Oct 2025

1 2 3 4 5 6...32 33 34

Page 3 of 34

Pageof 34