v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Bodhisattwa Prasad Majumder

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,563 papers shown

Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL

16 Nov 2025

Genomic Next-Token Predictors are In-Context Learners

219

16 Nov 2025

Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts

136

15 Nov 2025

LOCA-R: Near-Perfect Performance on the Chinese Physics Olympiad 2025

...

257

13 Nov 2025

Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks

Yunzhe Xu

Zhuosheng Zhang

Zhe Liu

169

13 Nov 2025

MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal CritiqueConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

121

12 Nov 2025

Chain of Summaries: Summarization Through Iterative Questioning

William Brach

Lukas Galke Poech

HILM

220

12 Nov 2025

Feedback Descent: Open-Ended Text Optimization via Pairwise Comparison

Yoonho Lee

Joseph Boen

Chelsea Finn

163

11 Nov 2025

Investigating CoT Monitorability in Large Reasoning Models

204

11 Nov 2025

Bot Meets Shortcut: How Can LLMs Aid in Handling Unknown Invariance OOD Scenarios?

426

11 Nov 2025

General Intelligence-based Fragmentation (GIF): A framework for peak-labeled spectra simulation

Margaret R. Martin

Soha Hassoun

11 Nov 2025

Dual-Process Scaffold Reasoning for Enhancing LLM Code Debugging

108

11 Nov 2025

Adaptive Multi-Agent Response Refinement in Conversational Systems

127

11 Nov 2025

Beyond Detection: Exploring Evidence-based Multi-Agent Debate for Misinformation Intervention and Persuasion

200

10 Nov 2025

Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation

Keunhyeung Park

Seunguk Yu

Youngbin Kim

114

10 Nov 2025

S-DAG: A Subject-Based Directed Acyclic Graph for Multi-Agent Heterogeneous ReasoningMachine-mediated learning (ML), 2025

158

10 Nov 2025

FLEX: Continuous Agent Evolution via Forward Learning from Experience

278

09 Nov 2025

ScRPO: From Errors to Insights

155

08 Nov 2025

Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS Annals), 2025

396

08 Nov 2025

Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement

304

08 Nov 2025

Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models

208

07 Nov 2025

Monitor-Generate-Verify (MGV): Formalising Metacognitive Theory for Language Model Reasoning

Nick Oh

Fernand Gobet

LRM

184

06 Nov 2025

Plan of Knowledge: Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering

272

06 Nov 2025

Secure Code Generation at Scale with Reflexion

121

05 Nov 2025

The Sequential Edge: Inverse-Entropy Voting Beats Parallel Self-Consistency at Matched Compute

Aman Sharma

Paras Chopra

BDL LRM

222

04 Nov 2025

ReAcTree: Hierarchical LLM Agent Trees with Control Flow for Long-Horizon Task Planning

160

04 Nov 2025

The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models

Joanna Śmietańska-Nowak

ELM ALM LRM

397

04 Nov 2025

Analyzing the Power of Chain of Thought through Memorization Capabilities

208

03 Nov 2025

Context-Guided Decompilation: A Step Towards Re-executability

Xiaohan Wang

Yuxin Hu

Kevin Leach

103

03 Nov 2025

Knowledge Elicitation with Large Language Models for Interpretable Cancer Stage Identification from Pathology Reports

02 Nov 2025

How Focused Are LLMs? A Quantitative Study via Repetitive Deterministic Prediction Tasks

367

02 Nov 2025

Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation

169

01 Nov 2025

Diverse Human Value Alignment for Large Language Models via Ethical Reasoning

112

01 Nov 2025

Test-time Scaling of LLMs: A Survey from A Subproblem Structure Perspective

149

01 Nov 2025

Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base

...

209

30 Oct 2025

CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions

...

203

30 Oct 2025

QCoder Benchmark: Bridging Language Generation and Quantum Hardware through Simulator-Based Feedback

...

254

30 Oct 2025

RCScore: Quantifying Response Consistency in Large Language Models

Dongjun Jang

Youngchae Ahn

Hyopil Shin

132

30 Oct 2025

InfoFlow: Reinforcing Search Agent Via Reward Density Optimization

113

30 Oct 2025

LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation

205

30 Oct 2025

RECAP: Reproducing Copyrighted Data from LLMs Training with an Agentic Pipeline

127

29 Oct 2025

FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data

187

29 Oct 2025

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

116

29 Oct 2025

A Survey on Efficient Large Language Model Training: From Data-centric PerspectivesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

142

29 Oct 2025

MGA: Memory-Driven GUI Agent for Observation-Centric Interaction

281

28 Oct 2025

StorageXTuner: An LLM Agent-Driven Automatic Tuning Framework for Heterogeneous Storage Systems

28 Oct 2025

FT-ARM: Fine-Tuned Agentic Reflection Multimodal Language Model for Pressure Ulcer Severity Classification with Reasoning

28 Oct 2025

VDSAgents: A PCS-Guided Multi-Agent System for Veridical Data Science Automation

128

28 Oct 2025

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

...

170

28 Oct 2025

Aligning Large Language Models with Procedural Rules: An Autoregressive State-Tracking Prompting for In-Game Trading

28 Oct 2025