v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Bodhisattwa Prasad Majumder

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,674 papers shown

Direct Alignment of Language Models via Quality-Aware Self-Refinement

203

31 May 2024

Improving Reward Models with Synthetic Critiques

Zihuiwen Ye

Fraser Greenlee-Scott

268

31 May 2024

Large Language Models Can Self-Improve At Web Agent Tasks

Ajay Patel

M. Hofmarcher

Claudiu Leoveanu-Condrei

Marius-Constantin Dinu

Chris Callison-Burch

Sepp Hochreiter

LLMAG

304

30 May 2024

Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation

256

30 May 2024

Grade Like a Human: Rethinking Automated Assessment with Large Language Models

220

30 May 2024

Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization

Liang Zheng

285

30 May 2024

Preference Learning Algorithms Do Not Learn Preference Rankings

316

29 May 2024

A Theoretical Understanding of Self-Correction through In-context Alignment

Zeming Wei

269

28 May 2024

A Human-Like Reasoning Framework for Multi-Phases Planning Task with Large Language Models

Chengxing Xie

Difan Zou

LRM LLMAG

218

28 May 2024

TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models

249

28 May 2024

Self-Guiding Exploration for Combinatorial Problems

116

28 May 2024

MockLLM: A Multi-Agent Behavior Collaboration Framework for Online Job Seeking and Recruiting

210

28 May 2024

Position: Foundation Agents as the Paradigm Shift for Decision Making

400

27 May 2024

ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation

410

27 May 2024

Code Repair with LLMs gives an Exploration-Exploitation Tradeoff

Hao Tang

197

26 May 2024

RLSF: Fine-tuning LLMs via Symbolic Feedback

373

26 May 2024

Devil's Advocate: Anticipatory Reflection for LLM Agents

520

25 May 2024

Evolutionary Large Language Model for Automated Feature Transformation

173

25 May 2024

Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study

Karl Tamberg

Hayretdin Bahsi

222

24 May 2024

Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search

263

24 May 2024

Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Yiming Chen

Chen Zhang

Haizhou Li

224

23 May 2024

Reinforcing Language Agents via Policy Optimization with Action Decomposition

Muning Wen

247

23 May 2024

RaFe: Ranking Feedback Improves Query Rewriting for RAGConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Peng Wang

Fei Huang

Huajun Chen

Ningyu Zhang

RALM

171

23 May 2024

ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based EvaluationNeural Information Processing Systems (NeurIPS), 2024

347

23 May 2024

Large Language Models Can Self-Correct with Minimal EffortConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

276

23 May 2024

AndroidWorld: A Dynamic Benchmarking Environment for Autonomous AgentsInternational Conference on Learning Representations (ICLR), 2024

...

Daniel Toyama

625

178

23 May 2024

Large Language Models Meet NLP: A Survey

455

119

21 May 2024

DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction

Aimin Zhou

259

20 May 2024

The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving

Pai Zeng

Zhenyu Ning

Jieru Zhao

Mengwei Xu

286

18 May 2024

Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

250

16 May 2024

Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed RealityAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

...

351

16 May 2024

METAREFLECTION: Learning Instructions for Language Agents using Past ReflectionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

153

13 May 2024

MathDivide: Improved mathematical reasoning by large language models

S. Srivastava

Ashutosh Gandhi

LRM ReLM

111

12 May 2024

AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents

190

11 May 2024

LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-ThoughtInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

444

09 May 2024

MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Inderjeet Nair

Lu Wang

LRM

194

08 May 2024

Large Language Models for Cyber Security: A Systematic Literature Review

587

106

08 May 2024

Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense

332

07 May 2024

Optimizing Language Model's Reasoning Abilities with Weak Supervision

243

07 May 2024

Fleet of Agents: Coordinated Problem Solving with Large Language Models

196

07 May 2024

Self-Improving Customer Review Response Generation Based on LLMs

211

06 May 2024

Large Language Models Synergize with Automated Machine Learning

Jinglue Xu

Jialong Li

Zhen Liu

Nagar Anthel Venkatesh Suryanarayanan

210

06 May 2024

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Matthew Renze

Erhan Guven

LRM LLMAG

340

05 May 2024

LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language ModelEuropean Conference on Computer Vision (ECCV), 2024

Shanghang Zhang

319

03 May 2024

General Purpose Verification for Chain of Thought Prompting

Robert Vacareanu

Anurag Pratik

Evangelia Spiliopoulou

183

30 Apr 2024

LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

581

29 Apr 2024

CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving

207

26 Apr 2024

LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study

269

26 Apr 2024

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

325

26 Apr 2024

Benchmarking Mobile Device Control Agents across Diverse Configurations

360

25 Apr 2024