v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Bodhisattwa Prasad Majumder

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,674 papers shown

Who is Undercover? Guiding LLMs to Explore Multi-Perspective Team Tactic in the Game

206

20 Oct 2024

The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public DiscourseConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

260

19 Oct 2024

Explaining Graph Neural Networks with Large Language Models: A Counterfactual Perspective for Molecular Property Prediction

Jundong Li

219

19 Oct 2024

Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language ModelsInternational Conference on Machine Learning (ICML), 2024

Qitan Lv

Jie Wang

Hanzhu Chen

Bin Li

Yongdong Zhang

Feng Wu

HILM

336

19 Oct 2024

MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration

352

19 Oct 2024

Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning

Yaling Li

166

18 Oct 2024

Real-time Factuality Assessment from Adversarial FeedbackAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Sanxing Chen

Yukun Huang

Bhuwan Dhingra

265

18 Oct 2024

LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems

Nan Xu

Xuezhe Ma

LRM

390

18 Oct 2024

LoGU: Long-form Generation with Uncertainty ExpressionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

588

18 Oct 2024

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

...

J. H. Liu

318

17 Oct 2024

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

...

243

17 Oct 2024

Utilizing Large Language Models in an iterative paradigm with domain feedback for molecule optimization

Khiem Le

Nitesh Chawla

378

17 Oct 2024

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web NavigationInternational Conference on Learning Representations (ICLR), 2024

403

17 Oct 2024

MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison FeedbackNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

427

17 Oct 2024

Decomposition Dilemmas: Does Claim Decomposition Boost or Burden Fact-Checking Performance?North American Chapter of the Association for Computational Linguistics (NAACL), 2024

Qisheng Hu

Quanyu Long

Wenya Wang

934

17 Oct 2024

Retrospective Learning from InteractionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

339

17 Oct 2024

"Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities

Kaveh Eskandari Miandoab

Vasanth Sarathy

LRM ReLM

152

16 Oct 2024

Enhancing Mathematical Reasoning in LLMs by Stepwise CorrectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Zhaoxuan Tan

214

16 Oct 2024

Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning

Vernon Y.H. Toh

Deepanway Ghosal

Soujanya Poria

LRM

176

16 Oct 2024

Divide-Verify-Refine: Can LLMs Self-Align with Complex Instructions?Annual Meeting of the Association for Computational Linguistics (ACL), 2024

Hui Liu

Qi He

Suhang Wang

276

16 Oct 2024

MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generation

348

16 Oct 2024

Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-upAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

394

16 Oct 2024

Conformity in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

506

16 Oct 2024

Toolken+: Improving LLM Tool Usage with Reranking and a Reject OptionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Konstantin Yakovlev

Sergey I. Nikolenko

A. Bout

206

15 Oct 2024

Self-adaptive Multimodal Retrieval-Augmented Generation

Wenjia Zhai

VLM

191

15 Oct 2024

MIND: Math Informed syNthetic Dialogues for Pretraining LLMsInternational Conference on Learning Representations (ICLR), 2024

455

15 Oct 2024

Denial-of-Service Poisoning Attacks against Large Language Models

Yong Yang

376

125

14 Oct 2024

Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs

263

14 Oct 2024

Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

514

14 Oct 2024

Single Ground Truth Is Not Enough: Adding Flexibility to Aspect-Based Sentiment Analysis Evaluation

Jiyoung Lee

351

13 Oct 2024

COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement

889

12 Oct 2024

LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

202

12 Oct 2024

MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language ModelsInternational Conference on Learning Representations (ICLR), 2024

360

12 Oct 2024

Mentor-KD: Making Small Language Models Better Multi-step ReasonersConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Hojae Lee

Junho Kim

SangKeun Lee

LRM

210

11 Oct 2024

SocialGaze: Improving the Integration of Human Social Norms in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

219

11 Oct 2024

Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning TasksInternational Conference on Learning Representations (ICLR), 2024

321

11 Oct 2024

SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-CorrectionInternational Conference on Learning Representations (ICLR), 2024

353

11 Oct 2024

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent SystemAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Weize Chen

Qixin Xu

Chen Qian

Cheng Yang

Zhiyuan Liu

Maosong Sun

LLMAG

268

10 Oct 2024

A Framework for Collaborating a Large Language Model Tool in Brainstorming for Triggering Creative ThoughtsThinking Skills and Creativity (TSC), 2024

Hung-Fu Chang

Tong Li

KELM LLMAG

150

10 Oct 2024

Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningInternational Conference on Learning Representations (ICLR), 2024

Eunho Yang

371

10 Oct 2024

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven InteractionsInternational Conference on Learning Representations (ICLR), 2024

Shuaiqiang Wang

Jun Xu

Ji-Rong Wen

352

10 Oct 2024

Self-Boosting Large Language Models with Synthetic Preference DataInternational Conference on Learning Representations (ICLR), 2024

Qingxiu Dong

Zhifang Sui

236

09 Oct 2024

Tree of Problems: Improving structured problem solving with compositionalityConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

114

09 Oct 2024

The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Yanjun Chen

Dawei Zhu

Yirong Sun

Xinghao Chen

Wei Zhang

Xiaoyu Shen

ALM

251

09 Oct 2024

Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack

Leo McKee-Reid

Christoph Sträter

Maria Angelica Martinez

Joe Needham

Mikita Balesni

OffRL

179

09 Oct 2024

LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple ConstraintsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Thomas Palmeira Ferraz

Nanyun Peng

295

09 Oct 2024

Uncovering Factor Level Preferences to Improve Human-Model AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

374

09 Oct 2024

MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesInternational Conference on Learning Representations (ICLR), 2024

558

09 Oct 2024

Counterfactual Causal Inference in Natural Language with Large Language Models

Michael Witbrock

217

08 Oct 2024

O1 Replication Journey: A Strategic Progress Report -- Part 1

...

335

137

08 Oct 2024