ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17651
  4. Cited By
Self-Refine: Iterative Refinement with Self-Feedback
v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
    ReLMLRMDiffM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,678 papers shown
Calibrating Large Language Models with Sample Consistency
Calibrating Large Language Models with Sample Consistency
Qing Lyu
Kumar Shridhar
Chaitanya Malaviya
Li Zhang
Yanai Elazar
Niket Tandon
Marianna Apidianaki
Mrinmaya Sachan
Chris Callison-Burch
265
48
0
21 Feb 2024
CriticBench: Evaluating Large Language Models as Critic
CriticBench: Evaluating Large Language Models as Critic
Tian Lan
Wenwei Zhang
Chen Xu
Heyan Huang
Dahua Lin
Kai-xiang Chen
Xian-Ling Mao
ELMAI4MHLRM
181
2
0
21 Feb 2024
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
Zhaorui Yang
Tianyu Pang
Hao Feng
Han Wang
Wei Chen
Minfeng Zhu
Qian Liu
ALM
326
78
0
21 Feb 2024
Data-driven Discovery with Large Generative Models
Data-driven Discovery with Large Generative Models
Bodhisattwa Prasad Majumder
Harshit Surana
Dhruv Agarwal
Sanchaita Hazra
Ashish Sabharwal
Peter Clark
268
21
0
21 Feb 2024
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large
  Vision-Language Models
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Xueliang Zhao
Xinting Huang
Tingchen Fu
Qintong Li
Shansan Gong
Lemao Liu
Wei Bi
Lingpeng Kong
LRM
291
4
0
21 Feb 2024
RefuteBench: Evaluating Refuting Instruction-Following for Large
  Language Models
RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models
Jianhao Yan
Yun Luo
Yue Zhang
ALMLRM
329
12
0
21 Feb 2024
Large Language Models for Data Annotation: A Survey
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Wenlin Yao
Lu Cheng
Huan Liu
SyDa
403
87
0
21 Feb 2024
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue
  Summarization
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Liyan Tang
Igor Shalyminov
Amy Wing-mei Wong
Jon Burnsky
Jake W. Vincent
...
Hang Su
Lijia Sun
Yi Zhang
Saab Mansour
Kathleen McKeown
HILM
242
77
0
20 Feb 2024
A Survey on Knowledge Distillation of Large Language Models
A Survey on Knowledge Distillation of Large Language Models
Xiaohan Xu
Ming Li
Chongyang Tao
Tao Shen
Reynold Cheng
Jinyang Li
Can Xu
Dacheng Tao
Wanrong Zhu
KELMVLM
469
238
0
20 Feb 2024
Learning to Check: Unleashing Potentials for Self-Correction in Large
  Language Models
Learning to Check: Unleashing Potentials for Self-Correction in Large Language Models
Che Zhang
Zhenyang Xiao
Chengcheng Han
Yixin Lian
Yuejian Fang
LRM
225
0
0
20 Feb 2024
Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation
Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation
Dongjin Kang
Sunghwan Kim
Taeyoon Kwon
Seungjun Moon
Hyunsouk Cho
Youngjae Yu
Dongha Lee
Jinyoung Yeo
460
51
0
20 Feb 2024
Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of
  Large Language Models
Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models
Loka Li
Zhenhao Chen
Guan-Hong Chen
Yixuan Zhang
Yusheng Su
Eric P. Xing
Kun Zhang
LRM
345
34
0
19 Feb 2024
How Interpretable are Reasoning Explanations from Prompting Large
  Language Models?
How Interpretable are Reasoning Explanations from Prompting Large Language Models?
Yeo Wei Jie
Frank Xing
Rick Mong
Xiaoshi Zhong
ReLMLRM
335
39
0
19 Feb 2024
An Empirical Categorization of Prompting Techniques for Large Language
  Models: A Practitioner's Guide
An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide
Oluwole Fagbohun
Rachel M. Harrison
Anton Dereventsov
287
19
0
18 Feb 2024
Learning From Failure: Integrating Negative Examples when Fine-tuning
  Large Language Models as Agents
Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents
Renxi Wang
Jinyan Su
Xudong Han
Yixuan Zhang
Timothy Baldwin
LLMAG
253
38
0
18 Feb 2024
Puzzle Solving using Reasoning of Large Language Models: A Survey
Puzzle Solving using Reasoning of Large Language Models: A Survey
Panagiotis Giadikiaroglou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ELMReLMLRM
387
52
0
17 Feb 2024
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
Siyin Wang
Shimin Li
Tianxiang Sun
Jinlan Fu
Qinyuan Cheng
Jiasheng Ye
Junjie Ye
Xipeng Qiu
Xuanjing Huang
170
9
0
17 Feb 2024
SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization
SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization
Wendi Cui
Jiaxin Zhang
Zhuohang Li
Damien Lopez
Damien Lopez
Kamalika Das
Sricharan Kumar
Kumar Sricharan
398
7
0
17 Feb 2024
When is Tree Search Useful for LLM Planning? It Depends on the
  Discriminator
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator
Ziru Chen
Michael White
Raymond Mooney
Ali Payani
Yu-Chuan Su
Huan Sun
LLMAG
339
53
0
16 Feb 2024
Exploring Hybrid Question Answering via Program-based Prompting
Exploring Hybrid Question Answering via Program-based Prompting
Qi Shi
Han Cui
Haofeng Wang
Qingfu Zhu
Wanxiang Che
Ting Liu
200
8
0
16 Feb 2024
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate
  Controllable Controversial Statements
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
Ming Li
Jiuhai Chen
Lichang Chen
Wanrong Zhu
320
36
0
16 Feb 2024
Rowen: Adaptive Retrieval-Augmented Generation for Hallucination Mitigation in LLMs
Rowen: Adaptive Retrieval-Augmented Generation for Hallucination Mitigation in LLMs
Hanxing Ding
Liang Pang
Zihao Wei
Huawei Shen
Xueqi Cheng
HILMRALM
466
26
0
16 Feb 2024
Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via
  Self-Evaluation
Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation
Xiaoying Zhang
Baolin Peng
Ye Tian
Jingyan Zhou
Lifeng Jin
Linfeng Song
Haitao Mi
Chao Yang
HILM
297
97
0
14 Feb 2024
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal
  Foundation Models
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models
Maurice Diesendruck
Jianzhe Lin
Shima Imani
Gayathri Mahalingam
Mingyang Xu
Jie Zhao
124
3
0
13 Feb 2024
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
Haotian Sun
Yuchen Zhuang
Wei Wei
Chao Zhang
Bo Dai
307
6
0
13 Feb 2024
On the Self-Verification Limitations of Large Language Models on
  Reasoning and Planning Tasks
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Kaya Stechly
Kaya Stechly
Subbarao Kambhampati
ReLMLRM
183
100
0
12 Feb 2024
Refined Direct Preference Optimization with Synthetic Data for
  Behavioral Alignment of LLMs
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
Víctor Gallego
SyDa
160
9
0
12 Feb 2024
Can LLMs Produce Faithful Explanations For Fact-checking? Towards
  Faithful Explainable Fact-Checking via Multi-Agent Debate
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate
Kyungha Kim
Sangyun Lee
Kung-Hsiang Huang
Hou Pong Chan
Pengfei Yu
Chenhui Xu
LRM
359
59
0
12 Feb 2024
Natural Language Reinforcement Learning
Natural Language Reinforcement Learning
Xidong Feng
Bo Liu
Mengyue Yang
Ziyan Wang
Girish A. Koushiks
Yali Du
Ying Wen
Jun Wang
OffRL
292
13
0
11 Feb 2024
Using Large Language Models for Student-Code Guided Test Case Generation
  in Computer Science Education
Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education
Nischal Ashok Kumar
Andrew Lan
AI4EdELM
177
9
0
11 Feb 2024
Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to
  Searching for the Most Promising Intermediate Thought
Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate ThoughtInternational Conference on Machine Learning (ICML), 2024
Zhen-Yu Zhang
Siwei Han
Huaxiu Yao
Gang Niu
Masashi Sugiyama
LLMAGLRM
134
4
0
10 Feb 2024
UrbanKGent: A Unified Large Language Model Agent Framework for Urban
  Knowledge Graph Construction
UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph ConstructionNeural Information Processing Systems (NeurIPS), 2024
Yansong Ning
Hao Liu
LLMAG
276
16
0
10 Feb 2024
Feedback Loops With Language Models Drive In-Context Reward Hacking
Feedback Loops With Language Models Drive In-Context Reward HackingInternational Conference on Machine Learning (ICML), 2024
Alexander Pan
Erik Jones
Meena Jagadeesan
Jacob Steinhardt
KELM
402
56
0
09 Feb 2024
Understanding the Effects of Iterative Prompting on Truthfulness
Understanding the Effects of Iterative Prompting on TruthfulnessInternational Conference on Machine Learning (ICML), 2024
Satyapriya Krishna
Chirag Agarwal
Himabindu Lakkaraju
HILM
240
19
0
09 Feb 2024
Entropy-Regularized Token-Level Policy Optimization for Language Agent
  Reinforcement
Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement
Muning Wen
Junwei Liao
Cheng Deng
Jun Wang
Weinan Zhang
Ying Wen
291
8
0
09 Feb 2024
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
Kaiqu Liang
Zixu Zhang
J. F. Fisac
LLMAG
531
21
0
09 Feb 2024
In-Context Principle Learning from Mistakes
In-Context Principle Learning from Mistakes
Tianjun Zhang
Aman Madaan
Luyu Gao
Steven Zheng
Swaroop Mishra
Yiming Yang
Niket Tandon
Uri Alon
KELMReLM
211
40
0
08 Feb 2024
Improving Cross-Domain Low-Resource Text Generation through LLM
  Post-Editing: A Programmer-Interpreter Approach
Improving Cross-Domain Low-Resource Text Generation through LLM Post-Editing: A Programmer-Interpreter Approach
Zhuang Li
Levon Haroutunian
Raj Tumuluri
Philip R. Cohen
Gholamreza Haffari
125
5
0
07 Feb 2024
FaithLM: Towards Faithful Explanations for Large Language Models
FaithLM: Towards Faithful Explanations for Large Language Models
Yu-Neng Chuang
Guanchu Wang
Chia-Yuan Chang
Ruixiang Tang
Shaochen Zhong
Fan Yang
Mengnan Du
Xuanting Cai
Helen Zhou
Xia Hu
LRM
323
6
0
07 Feb 2024
QuantAgent: Seeking Holy Grail in Trading by Self-Improving Large
  Language Model
QuantAgent: Seeking Holy Grail in Trading by Self-Improving Large Language Model
Saizhuo Wang
Hang Yuan
Lionel M. Ni
Jian Guo
LLMAGAIFin
134
26
0
06 Feb 2024
Are Machines Better at Complex Reasoning? Unveiling Human-Machine
  Inference Gaps in Entailment Verification
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment VerificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Soumya Sanyal
Tianyi Xiao
Hamish Ivison
Wenya Wang
Xiang Ren
LRMReLM
304
19
0
06 Feb 2024
Learning to Generate Explainable Stock Predictions using Self-Reflective
  Large Language Models
Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language ModelsThe Web Conference (WWW), 2024
Kelvin J.L. Koa
Yunshan Ma
Ritchie Ng
Tat-Seng Chua
AIFinLLMAG
351
50
0
06 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous
  Experts with Human-Level Competencies
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
240
12
0
06 Feb 2024
Unified Hallucination Detection for Multimodal Large Language Models
Unified Hallucination Detection for Multimodal Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Xiang Chen
Chenxi Wang
Yida Xue
Ningyu Zhang
Xiaoyan Yang
Qian Li
Yue Shen
Lei Liang
Jinjie Gu
Huajun Chen
HILM
459
67
0
05 Feb 2024
Understanding the planning of LLM agents: A survey
Understanding the planning of LLM agents: A survey
Xu Huang
Weiwen Liu
Xiaolong Chen
Xingmei Wang
Hao Wang
Defu Lian
Yasheng Wang
Ruiming Tang
Enhong Chen
LLMAGLM&Ro
322
353
0
05 Feb 2024
Position: What Can Large Language Models Tell Us about Time Series
  Analysis
Position: What Can Large Language Models Tell Us about Time Series Analysis
Ming Jin
Yifan Zhang
Wei Chen
Kexin Zhang
Yuxuan Liang
Bin Yang
James Evans
Shirui Pan
Qingsong Wen
AI4TS
244
53
0
05 Feb 2024
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on
  Model-induced Process Supervision
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision
Zihan Wang
Yunxuan Li
Yuexin Wu
Liangchen Luo
Le Hou
Hongkun Yu
Jingbo Shang
LRM
241
42
0
05 Feb 2024
DenseFormer: Enhancing Information Flow in Transformers via Depth
  Weighted Averaging
DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging
Matteo Pagliardini
Amirkeivan Mohtashami
François Fleuret
Martin Jaggi
261
15
0
04 Feb 2024
Integration of cognitive tasks into artificial general intelligence test
  for large models
Integration of cognitive tasks into artificial general intelligence test for large models
Youzhi Qu
Chen Wei
Penghui Du
Wenxin Che
Chi Zhang
...
Bin Hu
Kai Du
Haiyan Wu
Jia Liu
Quanying Liu
ELM
185
12
0
04 Feb 2024
Aligner: Efficient Alignment by Learning to Correct
Aligner: Efficient Alignment by Learning to Correct
Jiaming Ji
Boyuan Chen
Hantao Lou
Chongye Guo
Borong Zhang
Xuehai Pan
Juntao Dai
Tianyi Qiu
Yaodong Yang
375
76
0
04 Feb 2024
Previous
123...262728...323334
Next
Page 27 of 34
Pageof 34