v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023

30 March 2023

Bodhisattwa Prasad Majumder

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,678 papers shown

Calibrating Large Language Models with Sample Consistency

Marianna Apidianaki

265

21 Feb 2024

CriticBench: Evaluating Large Language Models as Critic

Dahua Lin

181

21 Feb 2024

Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

326

21 Feb 2024

Data-driven Discovery with Large Generative Models

Bodhisattwa Prasad Majumder

268

21 Feb 2024

BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

Wei Bi

Lingpeng Kong

LRM

291

21 Feb 2024

RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models

329

21 Feb 2024

Large Language Models for Data Annotation: A Survey

Huan Liu

403

21 Feb 2024

TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

...

Kathleen McKeown

242

20 Feb 2024

A Survey on Knowledge Distillation of Large Language Models

469

238

20 Feb 2024

Learning to Check: Unleashing Potentials for Self-Correction in Large Language Models

225

20 Feb 2024

Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation

460

20 Feb 2024

Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models

345

19 Feb 2024

How Interpretable are Reasoning Explanations from Prompting Large Language Models?

335

19 Feb 2024

An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide

Oluwole Fagbohun

Rachel M. Harrison

Anton Dereventsov

287

18 Feb 2024

Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

253

18 Feb 2024

Puzzle Solving using Reasoning of Large Language Models: A Survey

Panagiotis Giadikiaroglou

Maria Lymperaiou

Giorgos Filandrianos

Giorgos Stamou

ELM ReLM LRM

387

17 Feb 2024

LLM can Achieve Self-Regulation via Hyperparameter Aware Generation

Tianxiang Sun

Jiasheng Ye

Xipeng Qiu

Xuanjing Huang

170

17 Feb 2024

SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization

398

17 Feb 2024

When is Tree Search Useful for LLM Planning? It Depends on the Discriminator

Huan Sun

339

16 Feb 2024

Exploring Hybrid Question Answering via Program-based Prompting

200

16 Feb 2024

Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements

320

16 Feb 2024

Rowen: Adaptive Retrieval-Augmented Generation for Hallucination Mitigation in LLMs

466

16 Feb 2024

Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation

Linfeng Song

297

14 Feb 2024

Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models

124

13 Feb 2024

BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models

307

13 Feb 2024

On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks

Kaya Stechly

Subbarao Kambhampati

ReLM LRM

183

100

12 Feb 2024

Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs

Víctor Gallego

SyDa

160

12 Feb 2024

Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate

359

12 Feb 2024

Natural Language Reinforcement Learning

292

11 Feb 2024

Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education

Nischal Ashok Kumar

Andrew Lan

AI4Ed ELM

177

11 Feb 2024

Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate ThoughtInternational Conference on Machine Learning (ICML), 2024

134

10 Feb 2024

UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph ConstructionNeural Information Processing Systems (NeurIPS), 2024

Yansong Ning

Hao Liu

LLMAG

276

10 Feb 2024

Feedback Loops With Language Models Drive In-Context Reward HackingInternational Conference on Machine Learning (ICML), 2024

402

09 Feb 2024

Understanding the Effects of Iterative Prompting on TruthfulnessInternational Conference on Machine Learning (ICML), 2024

Satyapriya Krishna

Chirag Agarwal

Himabindu Lakkaraju

HILM

240

09 Feb 2024

Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement

Muning Wen

Cheng Deng

291

09 Feb 2024

Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity

531

09 Feb 2024

In-Context Principle Learning from Mistakes

Tianjun Zhang

211

08 Feb 2024

Improving Cross-Domain Low-Resource Text Generation through LLM Post-Editing: A Programmer-Interpreter Approach

125

07 Feb 2024

FaithLM: Towards Faithful Explanations for Large Language Models

323

07 Feb 2024

QuantAgent: Seeking Holy Grail in Trading by Self-Improving Large Language Model

134

06 Feb 2024

Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment VerificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Xiang Ren

304

06 Feb 2024

Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language ModelsThe Web Conference (WWW), 2024

351

06 Feb 2024

Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies

240

06 Feb 2024

Unified Hallucination Detection for Multimodal Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Ningyu Zhang

Lei Liang

Huajun Chen

459

05 Feb 2024

Understanding the planning of LLM agents: A survey

Xu Huang

Defu Lian

Ruiming Tang

Enhong Chen

LLMAG LM&Ro

322

353

05 Feb 2024

Position: What Can Large Language Models Tell Us about Time Series Analysis

Ming Jin

Kexin Zhang

244

05 Feb 2024

Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision

241

05 Feb 2024

DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging

Matteo Pagliardini

Amirkeivan Mohtashami

François Fleuret

Martin Jaggi

261

04 Feb 2024

Integration of cognitive tasks into artificial general intelligence test for large models

...

185

04 Feb 2024

Aligner: Efficient Alignment by Learning to Correct

Jiaming Ji

Juntao Dai

375

04 Feb 2024