Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.01904
Cited By
REFINER: Reasoning Feedback on Intermediate Representations
4 April 2023
Debjit Paul
Mete Ismayilzada
Maxime Peyrard
Beatriz Borges
Antoine Bosselut
Robert West
Boi Faltings
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"REFINER: Reasoning Feedback on Intermediate Representations"
50 / 140 papers shown
Title
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Xiaobao Wu
LRM
60
0
0
05 May 2025
Tina: Tiny Reasoning Models via LoRA
Shangshang Wang
Julian Asilis
Ömer Faruk Akgül
Enes Burak Bilgin
Ollie Liu
W. Neiswanger
OffRL
LRM
29
1
0
22 Apr 2025
CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models
Feiyang Li
Peng Fang
Zhan Shi
Arijit Khan
Fang Wang
D. Feng
Weihao Wang
Xin Zhang
Yongjian Cui
ReLM
LRM
40
0
0
18 Apr 2025
Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback
Nearchos Potamitis
Akhil Arora
LRM
42
0
0
17 Apr 2025
Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification
Anqi Zhang
Yulin Chen
Jane Pan
Chen Zhao
Aurojit Panda
Jinyang Li
He He
ReLM
LRM
32
2
0
07 Apr 2025
A Survey of Scaling in Large Language Model Reasoning
Zihan Chen
Song Wang
Zhen Tan
Xingbo Fu
Zhenyu Lei
Peng Wang
Huan Liu
Cong Shen
Jundong Li
LRM
84
0
0
02 Apr 2025
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis
Anjiang Wei
Tarun Suresh
Jiannan Cao
Naveen Kannan
Yuheng Wu
Kai Yan
Thiago S. F. X. Teixeira
Ke Wang
Alex Aiken
ELM
LRM
34
0
0
29 Mar 2025
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback
Taewon Yun
Jihwan Oh
Hyangsuk Min
Yuho Lee
Jihwan Bang
Jason (Jinglun) Cai
Hwanjun Song
OffRL
LRM
34
0
0
27 Mar 2025
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration
David Wan
Justin Chih-Yao Chen
Elias Stengel-Eskin
Mohit Bansal
LLMAG
LRM
60
1
0
19 Mar 2025
Visualizing Thought: Conceptual Diagrams Enable Robust Planning in LMMs
Nasim Borazjanizadeh
Roei Herzig
Eduard Oks
Trevor Darrell
Rogerio Feris
Leonid Karlinsky
LRM
46
0
0
14 Mar 2025
Self-Taught Self-Correction for Small Language Models
Viktor Moskvoretskii
Chris Biemann
Irina Nikishina
LRM
64
0
0
11 Mar 2025
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
Zichong Li
Xinyu Feng
Yuheng Cai
Zixuan Zhang
Tianyi Liu
Chen Liang
Weizhu Chen
Haoyu Wang
T. Zhao
LRM
50
1
0
06 Mar 2025
Generator-Assistant Stepwise Rollback Framework for Large Language Model Agent
Xingzuo Li
Kehai Chen
Yunfei Long
X. Bai
Yong-mei Xu
Min Zhang
LRM
LLMAG
79
1
0
04 Mar 2025
Instruct-of-Reflection: Enhancing Large Language Models Iterative Reflection Capabilities via Dynamic-Meta Instruction
Liping Liu
Chunhong Zhang
Likang Wu
Chuang Zhao
Zheng Hu
Ming He
Jianping Fan
LLMAG
LRM
36
0
0
02 Mar 2025
Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time
Jiazheng Li
Yuxiang Zhou
Junru Lu
Gladys Tyen
Lin Gui
Cesare Aloisi
Yulan He
LRM
33
2
0
26 Feb 2025
How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities
M. Lin
Hui Liu
X. Tang
Jingying Zeng
Zhenwei Dai
Chen Luo
Zheng Li
Xiang Zhang
Qi He
Suhang Wang
OffRL
LRM
41
0
0
25 Feb 2025
A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics
Ting-Ruen Wei
Haowei Liu
Xuyang Wu
Yi Fang
LRM
AI4CE
ReLM
KELM
108
1
0
21 Feb 2025
S
2
^2
2
R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
Ruotian Ma
Peisong Wang
Cheng Liu
Xingyan Liu
Jiaqi Chen
Bang Zhang
Xin Zhou
Nan Du
Jia Li
LRM
54
2
0
18 Feb 2025
LLMs can implicitly learn from mistakes in-context
Lisa Alazraki
Maximilian Mozes
Jon Ander Campos
Yi Chern Tan
Marek Rei
Max Bartolo
ReLM
LRM
88
0
0
12 Feb 2025
Towards Internet-Scale Training For Agents
Brandon Trabucco
Gunnar A. Sigurdsson
Robinson Piramuthu
Ruslan Salakhutdinov
ALM
98
2
0
10 Feb 2025
Cascaded Self-Evaluation Augmented Training for Lightweight Multimodal LLMs
Zheqi Lv
Wenkai Wang
Jiawei Wang
Shengyu Zhang
Fei Wu
LRM
ReLM
51
0
0
10 Jan 2025
Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting
Dong-Hai Zhu
Yu-Jie Xiong
Jia-Chen Zhang
Xi-Jiong Xie
Chun-Ming Xia
ReLM
LRM
37
0
0
08 Jan 2025
Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models
Sijia Chen
Baochun Li
Di Niu
LLMAG
LRM
AI4CE
67
11
0
08 Jan 2025
Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models
Kaleem Ullah Qasim
Jiashu Zhang
Tariq Alsahfi
Ateeq Ur Rehman Butt
LRM
ReLM
61
1
0
03 Jan 2025
Malware Classification using a Hybrid Hidden Markov Model-Convolutional Neural Network
Ritik Mehta
Olha Jurecková
Mark Stamp
57
30
0
25 Dec 2024
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning
Huchen Jiang
Yangyang Ma
Chaofan Ding
Kexin Luan
Xinhan Di
ReLM
LRM
31
2
0
23 Dec 2024
HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases
Meng-Chieh Lee
Qi Zhu
Costas Mavromatis
Zhen Han
Soji Adeshina
V. Ioannidis
Huzefa Rangwala
Christos Faloutsos
RALM
73
2
0
20 Dec 2024
Refining Answer Distributions for Improved Large Language Model Reasoning
Soumyasundar Pal
Didier Chetelat
Yingxue Zhang
Mark Coates
ReLM
LRM
79
0
0
17 Dec 2024
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning
Di Zhang
Jingdi Lei
Junxian Li
Xunzhi Wang
Y. Liu
...
S. M. I. Simon X. Yang
Jianbo Wu
Peng Ye
Wanli Ouyang
Dongzhan Zhou
OffRL
LRM
105
6
0
27 Nov 2024
Metacognition for Unknown Situations and Environments (MUSE)
Rodolfo Valiente
Praveen K. Pilly
64
0
0
20 Nov 2024
Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks
Adam Fourney
Gagan Bansal
Hussein Mozannar
Cheng Tan
Eduardo Salinas
...
Victor C. Dibia
Ahmed Hassan Awadallah
Ece Kamar
Rafah Hosn
Saleema Amershi
AI4CE
LRM
LLMAG
38
34
0
07 Nov 2024
Teaching Models to Improve on Tape
L. Bezalel
Eyal Orgad
Amir Globerson
27
0
0
03 Nov 2024
PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Zhiwei Liu
Weiran Yao
Jianguo Zhang
Rithesh Murthy
Liangwei Yang
...
Juan Carlos Niebles
Shelby Heinecke
Huan Wang
Silvio Savarese
Caiming Xiong
15
0
0
24 Oct 2024
CorrectionLM: Self-Corrections with SLM for Dialogue State Tracking
Chia-Hsuan Lee
Hao Cheng
Mari Ostendorf
LRM
21
0
0
23 Oct 2024
Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning
Pengfei He
Zitao Li
Yue Xing
Yaling Li
Jiliang Tang
Bolin Ding
LLMAG
LRM
20
1
0
18 Oct 2024
Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option
Konstantin Yakovlev
Sergey I. Nikolenko
A. Bout
21
0
0
15 Oct 2024
A Framework for Collaborating a Large Language Model Tool in Brainstorming for Triggering Creative Thoughts
Hung-Fu Chang
Tong Li
KELM
LLMAG
34
0
0
10 Oct 2024
Better than Your Teacher: LLM Agents that learn from Privileged AI Feedback
Sanjiban Choudhury
Paloma Sodhi
LLMAG
19
3
0
07 Oct 2024
Rationale-Aware Answer Verification by Pairwise Self-Evaluation
Akira Kawabata
Saku Sugawara
LRM
28
2
0
07 Oct 2024
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
Siyuan Huang
Zhiyuan Ma
Jintao Du
Changhua Meng
Weiqiang Wang
Zhouhan Lin
LRM
24
3
0
07 Oct 2024
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
Di Zhang
Jianbo Wu
Jingdi Lei
Tong Che
Jiatong Li
...
Shufei Zhang
Marco Pavone
Yuqiang Li
Wanli Ouyang
Dongzhan Zhou
LRM
27
42
0
03 Oct 2024
Multimodal Auto Validation For Self-Refinement in Web Agents
Ruhana Azam
Tamer Abuelsaad
Aditya Vempaty
Ashish Jagmohan
21
1
0
01 Oct 2024
Instance-adaptive Zero-shot Chain-of-Thought Prompting
Xiaosong Yuan
Chen Shen
Shaotian Yan
Xiaofeng Zhang
Liang Xie
Wenxiao Wang
Renchu Guan
Ying Wang
Jieping Ye
ReLM
LRM
41
4
0
30 Sep 2024
Path-Consistency: Prefix Enhancement for Efficient Inference in LLM
Jiace Zhu
Yingtao Shen
Jie Zhao
An Zou
LLMAG
LRM
22
4
0
25 Aug 2024
Enhancing Robustness in Large Language Models: Prompting for Mitigating the Impact of Irrelevant Information
Ming Jiang
Tingting Huang
Biao Guo
Yao Lu
Feng Zhang
LRM
29
1
0
20 Aug 2024
Internal Consistency and Self-Feedback in Large Language Models: A Survey
Xun Liang
Shichao Song
Zifan Zheng
Hanyu Wang
Qingchen Yu
...
Rong-Hua Li
Peng Cheng
Zhonghao Wang
Feiyu Xiong
Zhiyu Li
HILM
LRM
56
24
0
19 Jul 2024
Learning to Refine with Fine-Grained Natural Language Feedback
Manya Wadhwa
Xinyu Zhao
Junyi Jessy Li
Greg Durrett
18
11
0
02 Jul 2024
Direct-Inverse Prompting: Analyzing LLMs' Discriminative Capacity in Self-Improving Generation
Jihyun Janice Ahn
Ryo Kamoi
Lu Cheng
Rui Zhang
Wenpeng Yin
30
1
0
27 Jun 2024
VDebugger: Harnessing Execution Feedback for Debugging Visual Programs
Xueqing Wu
Zongyu Lin
Songyan Zhao
Te-Lin Wu
Pan Lu
Nanyun Peng
Kai-Wei Chang
LRM
45
2
0
19 Jun 2024
Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
Yi Fang
Moxin Li
Wenjie Wang
Hui Lin
Fuli Feng
LRM
47
5
0
17 Jun 2024
1
2
3
Next