Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.01904
Cited By
REFINER: Reasoning Feedback on Intermediate Representations
4 April 2023
Debjit Paul
Mete Ismayilzada
Maxime Peyrard
Beatriz Borges
Antoine Bosselut
Robert West
Boi Faltings
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"REFINER: Reasoning Feedback on Intermediate Representations"
50 / 140 papers shown
Title
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
Ruihan Yang
Jiangjie Chen
Yikai Zhang
Siyu Yuan
Aili Chen
Kyle Richardson
Yanghua Xiao
Deqing Yang
AI4CE
LM&Ro
41
8
0
07 Jun 2024
Are LLMs classical or nonmonotonic reasoners? Lessons from generics
Alina Leidinger
R. Rooij
Ekaterina Shutova
LRM
21
3
0
05 Jun 2024
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Ryo Kamoi
Yusen Zhang
Nan Zhang
Jiawei Han
Rui Zhang
LRM
40
57
0
03 Jun 2024
Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
Xiaoyuan Li
Wenjie Wang
Moxin Li
Junrong Guo
Yang Zhang
Fuli Feng
ELM
LRM
33
15
0
02 Jun 2024
A Theoretical Understanding of Self-Correction through In-context Alignment
Yifei Wang
Yuyang Wu
Zeming Wei
Stefanie Jegelka
Yisen Wang
LRM
28
13
0
28 May 2024
RLSF: Reinforcement Learning via Symbolic Feedback
Piyush Jha
Prithwish Jana
Arnav Arora
Vijay Ganesh
LRM
36
3
0
26 May 2024
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
Chuanneng Sun
Songjun Huang
D. Pompili
LLMAG
29
25
0
17 May 2024
AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI Agents
Shuyuan Xu
Zelong Li
Kai Mei
Yongfeng Zhang
26
3
0
11 May 2024
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models
Leonardo Ranaldi
André Freitas
LRM
ReLM
29
8
0
01 May 2024
Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language Models
Houjun Liu
LM&Ro
LRM
19
0
0
29 Apr 2024
Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Yunxiang Zhang
Muhammad Khalifa
Lajanugen Logeswaran
Jaekyeom Kim
Moontae Lee
Honglak Lee
Lu Wang
LRM
KELM
ReLM
23
31
0
26 Apr 2024
No more optimization rules: LLM-enabled policy-based multi-modal query optimizer
Yifan Wang
Haodi Ma
Daisy Zhe Wang
28
1
0
20 Mar 2024
StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows
Yiran Wu
Tianwei Yue
Shaokun Zhang
Chi Wang
Qingyun Wu
40
21
0
17 Mar 2024
ToolNet: Connecting Large Language Models with Massive Tools via Tool Graph
Xukun Liu
Zhiyuan Peng
Xiaoyuan Yi
Xing Xie
Lirong Xiang
Yuchen Liu
Dongkuan Xu
CLL
LLMAG
50
12
0
29 Feb 2024
Small But Funny: A Feedback-Driven Approach to Humor Distillation
Sahithya Ravi
Patrick Huber
Akshat Shrivastava
Aditya Sagar
Ahmed Aly
Vered Shwartz
Arash Einolghozati
33
5
0
28 Feb 2024
Self-Refinement of Language Models from External Proxy Metrics Feedback
Keshav Ramji
Young-Suk Lee
R. Astudillo
M. Sultan
Tahira Naseem
Asim Munawar
Radu Florian
Salim Roukos
HILM
20
3
0
27 Feb 2024
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Wenqi Zhang
Ke Tang
Hai Wu
Mengna Wang
Yongliang Shen
Guiyang Hou
Zeqi Tan
Peng Li
Y. Zhuang
Weiming Lu
LLMAG
23
33
0
27 Feb 2024
ByteComposer: a Human-like Melody Composition Method based on Language Model Agent
Xia Liang
Xingjian Du
Jiaju Lin
Pei Zou
Yuan Wan
Bilei Zhu
24
4
0
24 Feb 2024
Brain-Inspired Two-Stage Approach: Enhancing Mathematical Reasoning by Imitating Human Thought Processes
Yezeng Chen
Zui Chen
Yi Zhou
LRM
23
2
0
23 Feb 2024
Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
Hanqi Yan
Qinglin Zhu
Xinyu Wang
Lin Gui
Yulan He
LRM
LLMAG
24
4
0
22 Feb 2024
Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning
Debjit Paul
Robert West
Antoine Bosselut
Boi Faltings
ReLM
LRM
25
5
0
21 Feb 2024
Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models
Loka Li
Zhenhao Chen
Guan-Hong Chen
Yixuan Zhang
Yusheng Su
Eric P. Xing
Kun Zhang
LRM
36
15
0
19 Feb 2024
An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide
Oluwole Fagbohun
Rachel M. Harrison
Anton Dereventsov
41
6
0
18 Feb 2024
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
Siyin Wang
Shimin Li
Tianxiang Sun
Jinlan Fu
Qinyuan Cheng
Jiasheng Ye
Junjie Ye
Xipeng Qiu
Xuanjing Huang
8
4
0
17 Feb 2024
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements
Alex Havrilla
Sharath Raparthy
Christoforus Nalmpantis
Jane Dwivedi-Yu
Maksym Zhuravinskyi
Eric Hambro
Roberta Railneau
ReLM
LRM
25
49
0
13 Feb 2024
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
Kaiqu Liang
Zixu Zhang
J. F. Fisac
LLMAG
33
5
0
09 Feb 2024
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision
Zihan Wang
Yunxuan Li
Yuexin Wu
Liangchen Luo
Le Hou
Hongkun Yu
Jingbo Shang
LRM
29
18
0
05 Feb 2024
Integration of cognitive tasks into artificial general intelligence test for large models
Youzhi Qu
Chen Wei
Penghui Du
Wenxin Che
Chi Zhang
...
Bin Hu
Kai Du
Haiyan Wu
Jia Liu
Quanying Liu
ELM
26
6
0
04 Feb 2024
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning
Tinghui Zhu
Kai Zhang
Jian Xie
Yu-Chuan Su
LRM
10
14
0
31 Jan 2024
Scalable Qualitative Coding with LLMs: Chain-of-Thought Reasoning Matches Human Performance in Some Hermeneutic Tasks
Zackary Dunivin
17
16
0
26 Jan 2024
Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey
Haochen Li
Jonathan Leung
Zhiqi Shen
LM&MA
LLMAG
LRM
12
0
0
25 Jan 2024
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Mirac Suzgun
Adam Tauman Kalai
KELM
LRM
LLMAG
ReLM
35
63
0
23 Jan 2024
Small Language Model Can Self-correct
Haixia Han
Jiaqing Liang
Jie Shi
Qi He
Yanghua Xiao
LRM
SyDa
ReLM
KELM
23
11
0
14 Jan 2024
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Dennis Ulmer
Elman Mansimov
Kaixiang Lin
Justin Sun
Xibin Gao
Yi Zhang
LLMAG
19
27
0
10 Jan 2024
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
Wenqi Zhang
Yongliang Shen
Linjuan Wu
Qiuying Peng
Jun Wang
Y. Zhuang
Weiming Lu
LRM
LLMAG
22
37
0
04 Jan 2024
LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models
Qianxi Li
Yingyue Cao
Jikun Kang
Tianpei Yang
Xi Chen
Jun Jin
Matthew E. Taylor
14
2
0
31 Dec 2023
LDM
2
^2
2
: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Xingjin Wang
Linjing Li
D. Zeng
16
0
0
13 Dec 2023
TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-Commerce
Tongxin Hu
Zhuang Li
Xin Jin
Lizhen Qu
Xin Zhang
20
2
0
08 Dec 2023
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering
Linyong Nan
Ellen Zhang
Weijin Zou
Yilun Zhao
Wenfei Zhou
Arman Cohan
LLMAG
33
13
0
16 Nov 2023
Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models
Weize Liu
Guocong Li
Kai Zhang
Bang Du
Qiyuan Chen
Xuming Hu
Hongxia Xu
Jintai Chen
Jian Wu
LRM
13
6
0
15 Nov 2023
Towards A Unified View of Answer Calibration for Multi-Step Reasoning
Shumin Deng
Ningyu Zhang
Nay Oo
Bryan Hooi
LRM
25
1
0
15 Nov 2023
Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment
Philippe Laban
Lidiya Murakhovs'ka
Caiming Xiong
Chien-Sheng Wu
LRM
19
17
0
14 Nov 2023
LLMs cannot find reasoning errors, but can correct them given the error location
Gladys Tyen
Hassan Mansoor
Victor Carbune
Peter Chen
Tony Mak
LRM
9
70
0
14 Nov 2023
SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training with Adversarial Remarks
Mengsay Loem
Masahiro Kaneko
Naoaki Okazaki
LRM
19
5
0
14 Nov 2023
The ART of LLM Refinement: Ask, Refine, and Trust
Kumar Shridhar
Koustuv Sinha
Andrew Cohen
Tianlu Wang
Ping Yu
Ramakanth Pasunuru
Mrinmaya Sachan
Jason Weston
Asli Celikyilmaz
LLMAG
ReLM
LRM
20
24
0
14 Nov 2023
From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models
Junbing Yan
Chengyu Wang
Taolin Zhang
Xiaofeng He
Jun Huang
Wei Zhang
ReLM
LRM
21
7
0
12 Nov 2023
Prompt Engineering a Prompt Engineer
Qinyuan Ye
Maxamed Axmed
Reid Pryzant
Fereshte Khani
VLM
LLMAG
LRM
19
28
0
09 Nov 2023
Defining a New NLP Playground
Sha Li
Chi Han
Pengfei Yu
Carl N. Edwards
Manling Li
...
Yi Ren Fung
Charles Yu
Joel R. Tetreault
Eduard H. Hovy
Heng Ji
31
5
0
31 Oct 2023
N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics
Sajad Mousavi
Ricardo Luna Gutierrez
Desik Rengarajan
Vineet Gundecha
Ashwin Ramesh Babu
Avisek Naug
Antonio Guillen-Perez
S. Sarkar
LRM
HILM
KELM
18
6
0
28 Oct 2023
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization
Xinyuan Wang
Chenxi Li
Zhen Wang
Fan Bai
Haotian Luo
Jiayou Zhang
Nebojsa Jojic
Eric P. Xing
Zhiting Hu
23
98
0
25 Oct 2023
Previous
1
2
3
Next