ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.17651
  4. Cited By
Self-Refine: Iterative Refinement with Self-Feedback
v1v2 (latest)

Self-Refine: Iterative Refinement with Self-Feedback

Neural Information Processing Systems (NeurIPS), 2023
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
    ReLMLRMDiffM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Self-Refine: Iterative Refinement with Self-Feedback"

50 / 1,674 papers shown
Who is Undercover? Guiding LLMs to Explore Multi-Perspective Team Tactic
  in the Game
Who is Undercover? Guiding LLMs to Explore Multi-Perspective Team Tactic in the Game
Ruiqi Dong
Zhixuan Liao
Guangwei Lai
Yuhan Ma
Danni Ma
Chenyou Fan
LLMAG
206
2
0
20 Oct 2024
The Computational Anatomy of Humility: Modeling Intellectual Humility in
  Online Public Discourse
The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public DiscourseConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xiaobo Guo
Neil Potnis
Melody Yu
Nabeel Gillani
Soroush Vosoughi
260
2
0
19 Oct 2024
Explaining Graph Neural Networks with Large Language Models: A
  Counterfactual Perspective for Molecular Property Prediction
Explaining Graph Neural Networks with Large Language Models: A Counterfactual Perspective for Molecular Property Prediction
Yinhan He
Zaiyi Zheng
Patrick Soga
Yaozhen Zhu
Yushun Dong
Jundong Li
219
2
0
19 Oct 2024
Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large
  Language Models
Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language ModelsInternational Conference on Machine Learning (ICML), 2024
Qitan Lv
Jie Wang
Hanzhu Chen
Bin Li
Yongdong Zhang
Feng Wu
HILM
336
11
0
19 Oct 2024
MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration
MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration
Siyuan Lu
Jiaqi Shao
B. Luo
Tao Lin
LM&RoLLMAGAI4CE
352
6
0
19 Oct 2024
Make LLMs better zero-shot reasoners: Structure-orientated autonomous
  reasoning
Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning
Pengfei He
Zitao Li
Yue Xing
Yaling Li
Shucheng Zhou
Bolin Ding
LLMAGLRM
166
5
0
18 Oct 2024
Real-time Factuality Assessment from Adversarial Feedback
Real-time Factuality Assessment from Adversarial FeedbackAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Sanxing Chen
Yukun Huang
Bhuwan Dhingra
265
0
0
18 Oct 2024
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
Nan Xu
Xuezhe Ma
LRM
390
5
0
18 Oct 2024
LoGU: Long-form Generation with Uncertainty Expressions
LoGU: Long-form Generation with Uncertainty ExpressionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Ruihan Yang
Caiqi Zhang
Zhisong Zhang
Xinting Huang
Sen Yang
Nigel Collier
Dong Yu
Deqing Yang
HILM
588
18
0
18 Oct 2024
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
Siwei Wu
Zhongyuan Peng
Xinrun Du
Tuney Zheng
Minghao Liu
...
Rundong Wang
Wenhao Huang
Ge Zhang
Chenghua Lin
J. H. Liu
ELMLLMAGLRMAI4CE
318
70
0
17 Oct 2024
Think Thrice Before You Act: Progressive Thought Refinement in Large
  Language Models
Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models
Chengyu Du
Jinyi Han
Yizhou Ying
Aili Chen
Qianyu He
...
Haoran Guo
Jiaqing Liang
Zulong Chen
Liangyue Li
Yanghua Xiao
KELMCLLLRM
243
5
0
17 Oct 2024
Utilizing Large Language Models in an iterative paradigm with domain
  feedback for molecule optimization
Utilizing Large Language Models in an iterative paradigm with domain feedback for molecule optimization
Khiem Le
Nitesh Chawla
378
0
0
17 Oct 2024
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web NavigationInternational Conference on Learning Representations (ICLR), 2024
Hyungjoo Chae
Namyoung Kim
Kai Tzu-iunn Ong
Minju Gwak
Gwanwoo Song
Jihoon Kim
Seon Gyeom Kim
Dongha Lee
Jinyoung Yeo
LLMAG
403
57
0
17 Oct 2024
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison FeedbackNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zonghai Yao
Aditya Parashar
Huixue Zhou
Won Seok Jang
Feiyun Ouyang
Zhichao Yang
Hong-ye Yu
ELM
427
14
0
17 Oct 2024
Decomposition Dilemmas: Does Claim Decomposition Boost or Burden Fact-Checking Performance?
Decomposition Dilemmas: Does Claim Decomposition Boost or Burden Fact-Checking Performance?North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Qisheng Hu
Quanyu Long
Wenya Wang
934
21
0
17 Oct 2024
Retrospective Learning from Interactions
Retrospective Learning from InteractionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Zizhao Chen
Mustafa Omer Gul
Yiwei Chen
Gloria Geng
Anne Wu
Yoav Artzi
LRM
339
3
0
17 Oct 2024
"Let's Argue Both Sides": Argument Generation Can Force Small Models to
  Utilize Previously Inaccessible Reasoning Capabilities
"Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities
Kaveh Eskandari Miandoab
Vasanth Sarathy
LRMReLM
152
2
0
16 Oct 2024
Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Enhancing Mathematical Reasoning in LLMs by Stepwise CorrectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Zhenyu Wu
Qingkai Zeng
Zizhuo Zhang
Zhaoxuan Tan
Chao Shen
Meng Jiang
KELMLRM
214
8
0
16 Oct 2024
Not All Votes Count! Programs as Verifiers Improve Self-Consistency of
  Language Models for Math Reasoning
Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning
Vernon Y.H. Toh
Deepanway Ghosal
Soujanya Poria
LRM
176
6
0
16 Oct 2024
Divide-Verify-Refine: Can LLMs Self-Align with Complex Instructions?
Divide-Verify-Refine: Can LLMs Self-Align with Complex Instructions?Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Xianren Zhang
Xianfeng Tang
Hui Liu
Zongyu Wu
Qi He
Dongwon Lee
Suhang Wang
ALM
276
2
0
16 Oct 2024
MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generation
MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generation
Aniket Deroy
Subhankar Maity
Sudeshna Sarkar
LLMAGLRM
348
5
0
16 Oct 2024
Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up
Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-upAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Jiahao Yuan
Dehui Du
Hao Zhang
Zixiang Di
Usman Naseem
LRM
394
9
0
16 Oct 2024
Conformity in Large Language Models
Conformity in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Xiaochen Zhu
Caiqi Zhang
Tom Stafford
Nigel Collier
Andreas Vlachos
506
8
0
16 Oct 2024
Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option
Toolken+: Improving LLM Tool Usage with Reranking and a Reject OptionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Konstantin Yakovlev
Sergey I. Nikolenko
A. Bout
206
3
0
15 Oct 2024
Self-adaptive Multimodal Retrieval-Augmented Generation
Self-adaptive Multimodal Retrieval-Augmented Generation
Wenjia Zhai
VLM
191
3
0
15 Oct 2024
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs
MIND: Math Informed syNthetic Dialogues for Pretraining LLMsInternational Conference on Learning Representations (ICLR), 2024
Syeda Nahida Akter
Shrimai Prabhumoye
John Kamalu
S. Satheesh
Eric Nyberg
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
LRMSyDaReLM
455
6
0
15 Oct 2024
Denial-of-Service Poisoning Attacks against Large Language Models
Denial-of-Service Poisoning Attacks against Large Language Models
Kuofeng Gao
Tianyu Pang
Chao Du
Yong Yang
Shu-Tao Xia
Min Lin
SILMAAML
376
125
0
14 Oct 2024
Balancing Continuous Pre-Training and Instruction Fine-Tuning:
  Optimizing Instruction-Following in LLMs
Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs
Ishan Jindal
Chandana Badrinath
Pranjal Bharti
Lakkidi Vinay
Sachin Dev Sharma
CLLALM
263
6
0
14 Oct 2024
Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Fangru Lin
Shaoguang Mao
Emanuele La Malfa
Valentin Hofmann
Adrian de Wynter
Jing Yao
Si-Qing Chen
Michael Wooldridge
J. Pierrehumbert
Furu Wei
514
3
0
14 Oct 2024
Single Ground Truth Is Not Enough: Adding Flexibility to Aspect-Based Sentiment Analysis Evaluation
Single Ground Truth Is Not Enough: Adding Flexibility to Aspect-Based Sentiment Analysis Evaluation
Soyoung Yang
Hojun Cho
Jiyoung Lee
Sohee Yoon
E. Choi
Jaegul Choo
Won Ik Cho
351
0
0
13 Oct 2024
COrAL: Order-Agnostic Language Modeling for Efficient Iterative
  Refinement
COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement
Yuxi Xie
Anirudh Goyal
Xiaobao Wu
Xunjian Yin
Xiao Xu
Min-Yen Kan
Liangming Pan
William Yang Wang
LRM
889
1
0
12 Oct 2024
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language
  Model for Commonsense Reasoning
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Kang Liu
Xiaojian Jiang
Jiexin Xu
Jun Zhao
LRMKELM
202
1
0
12 Oct 2024
MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Jiachun Li
Pengfei Cao
Zhuoran Jin
Yubo Chen
Kang Liu
Jun Zhao
LRMELM
360
13
0
12 Oct 2024
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Mentor-KD: Making Small Language Models Better Multi-step ReasonersConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Hojae Lee
Junho Kim
SangKeun Lee
LRM
210
14
0
11 Oct 2024
SocialGaze: Improving the Integration of Human Social Norms in Large
  Language Models
SocialGaze: Improving the Integration of Human Social Norms in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Anvesh Rao Vijjini
Rakesh R Menon
Jiayi Fu
Shashank Srivastava
Snigdha Chaturvedi
ALM
219
4
0
11 Oct 2024
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning TasksInternational Conference on Learning Representations (ICLR), 2024
Rushang Karia
Daniel Bramblett
D. Dobhal
Siddharth Srivastava
ELMLRM
321
2
0
11 Oct 2024
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-CorrectionInternational Conference on Learning Representations (ICLR), 2024
L. Yang
Zhaochen Yu
Tianze Zhang
Minkai Xu
Alfons Kemper
Tengjiao Wang
Shuicheng Yan
ELMReLMLRM
353
0
0
11 Oct 2024
Optima: Optimizing Effectiveness and Efficiency for LLM-Based
  Multi-Agent System
Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent SystemAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Weize Chen
Qixin Xu
Chen Qian
Cheng Yang
Zhiyuan Liu
Maosong Sun
LLMAG
268
17
0
10 Oct 2024
A Framework for Collaborating a Large Language Model Tool in
  Brainstorming for Triggering Creative Thoughts
A Framework for Collaborating a Large Language Model Tool in Brainstorming for Triggering Creative ThoughtsThinking Skills and Creativity (TSC), 2024
Hung-Fu Chang
Tong Li
KELMLLMAG
150
19
0
10 Oct 2024
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningInternational Conference on Learning Representations (ICLR), 2024
Hyun Ryu
Gyeongman Kim
Hyemin S. Lee
Eunho Yang
LRM
371
24
0
10 Oct 2024
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven InteractionsInternational Conference on Learning Representations (ICLR), 2024
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jun Xu
Ji-Rong Wen
352
24
0
10 Oct 2024
Self-Boosting Large Language Models with Synthetic Preference Data
Self-Boosting Large Language Models with Synthetic Preference DataInternational Conference on Learning Representations (ICLR), 2024
Qingxiu Dong
Li Dong
Xingxing Zhang
Zhifang Sui
Furu Wei
SyDa
236
27
0
09 Oct 2024
Tree of Problems: Improving structured problem solving with
  compositionality
Tree of Problems: Improving structured problem solving with compositionalityConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
A. Zebaze
Benoît Sagot
Rachel Bawden
LRM
114
5
0
09 Oct 2024
The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield
  Better Language Models
The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yanjun Chen
Dawei Zhu
Yirong Sun
Xinghao Chen
Wei Zhang
Xiaoyu Shen
ALM
251
12
0
09 Oct 2024
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest
  Models Reward Hack
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack
Leo McKee-Reid
Christoph Sträter
Maria Angelica Martinez
Joe Needham
Mikita Balesni
OffRL
179
7
0
09 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for
  Enhanced Following of Instructions with Multiple Constraints
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple ConstraintsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Thomas Palmeira Ferraz
Kartik Mehta
Yu-Hsiang Lin
Haw-Shiuan Chang
Shereen Oraby
Sijia Liu
Vivek Subramanian
Tagyoung Chung
Mohit Bansal
Nanyun Peng
295
26
0
09 Oct 2024
Uncovering Factor Level Preferences to Improve Human-Model Alignment
Uncovering Factor Level Preferences to Improve Human-Model AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Juhyun Oh
Eunsu Kim
Jiseon Kim
Wenda Xu
Inha Cha
William Yang Wang
Alice Oh
374
1
0
09 Oct 2024
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesInternational Conference on Learning Representations (ICLR), 2024
Zonglin Yang
Wanhao Liu
Ben Gao
Tong Xie
You Li
Wanli Ouyang
Soujanya Poria
Xiaoshi Zhong
Dongzhan Zhou
LRM
558
44
0
09 Oct 2024
Counterfactual Causal Inference in Natural Language with Large Language
  Models
Counterfactual Causal Inference in Natural Language with Large Language Models
Gaël Gendron
Jože M. Rožanec
Michael Witbrock
Gillian Dobbie
CML
217
7
0
08 Oct 2024
O1 Replication Journey: A Strategic Progress Report -- Part 1
O1 Replication Journey: A Strategic Progress Report -- Part 1
Yiwei Qin
Xuefeng Li
Haoyang Zou
Yixiu Liu
Shijie Xia
...
Yixin Ye
Weizhe Yuan
Hector Liu
Rui Wang
Pengfei Liu
VLM
335
137
0
08 Oct 2024
Previous
123...181920...323334
Next