Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.03078
Cited By
Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
6 October 2022
Jiacheng Liu
Skyler Hallinan
Ximing Lu
Pengfei He
Sean Welleck
Hannaneh Hajishirzi
Yejin Choi
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering"
48 / 48 papers shown
Title
ExpertRAG: Efficient RAG with Mixture of Experts -- Optimizing Context Retrieval for Adaptive LLM Responses
Esmail Gumaan
MoE
28
0
0
23 Mar 2025
Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models
Kaleem Ullah Qasim
Jiashu Zhang
Tariq Alsahfi
Ateeq Ur Rehman Butt
LRM
ReLM
61
1
0
03 Jan 2025
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Kang-Jun Liu
Xiaojian Jiang
Jiexin Xu
Jun Zhao
LRM
KELM
29
0
0
12 Oct 2024
ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering
Francesco Maria Molfese
Simone Conia
Riccardo Orlando
Roberto Navigli
ReLM
LRM
RALM
20
0
0
07 Oct 2024
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
Hamish Ivison
Yizhong Wang
Jiacheng Liu
Zeqiu Wu
Valentina Pyatkin
Nathan Lambert
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
34
38
0
13 Jun 2024
Large Language Models Can Self-Correct with Minimal Effort
Zhenyu Wu
Qingkai Zeng
Zhihan Zhang
Zhaoxuan Tan
Chao Shen
Meng-Long Jiang
KELM
LRM
ReLM
22
3
0
23 May 2024
Authorship Style Transfer with Policy Optimization
Shuai Liu
Shantanu Agarwal
Jonathan May
35
5
0
12 Mar 2024
Teaching Large Language Models to Reason with Reinforcement Learning
Alex Havrilla
Yuqing Du
Sharath Chandra Raparthy
Christoforos Nalmpantis
Jane Dwivedi-Yu
Maksym Zhuravinskyi
Eric Hambro
Sainbayar Sukhbaatar
Roberta Raileanu
ReLM
LRM
29
67
0
07 Mar 2024
KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
Yuqi Zhu
Shuofei Qiao
Yixin Ou
Shumin Deng
N. Zhang
Shiwei Lyu
Yue Shen
Lei Liang
Jinjie Gu
H. Chen
LLMAG
LM&Ro
72
25
0
05 Mar 2024
Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Daojian Zeng
Kang Liu
Jun Zhao
LRM
38
8
0
28 Feb 2024
Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models?
Ning Bian
Xianpei Han
Hongyu Lin
Yaojie Lu
Ben He
Le Sun
21
1
0
22 Feb 2024
Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning
Debjit Paul
Robert West
Antoine Bosselut
Boi Faltings
ReLM
LRM
25
5
0
21 Feb 2024
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
Yougang Lyu
Lingyong Yan
Shuaiqiang Wang
Haibo Shi
Dawei Yin
Pengjie Ren
Zhumin Chen
Maarten de Rijke
Zhaochun Ren
16
5
0
17 Feb 2024
GenDec: A robust generative Question-decomposition method for Multi-hop reasoning
Jian Wu
Linyi Yang
Yuliang Ji
Wenhao Huang
Börje F. Karlsson
Manabu Okumura
16
2
0
17 Feb 2024
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Haritz Puerto
Martin Tutek
Somak Aditya
Xiaodan Zhu
Iryna Gurevych
ReCod
ReLM
LRM
43
9
0
18 Jan 2024
Graph Elicitation for Guiding Multi-Step Reasoning in Large Language Models
Jinyoung Park
Ameen Patel
Omar Zia Khan
Hyunwoo J. Kim
Jooyeon Kim
KELM
LRM
ReLM
23
4
0
16 Nov 2023
Digital Socrates: Evaluating LLMs through Explanation Critiques
Yuling Gu
Oyvind Tafjord
Peter Clark
ELM
LRM
19
2
0
16 Nov 2023
Merging Generated and Retrieved Knowledge for Open-Domain QA
Yunxiang Zhang
Muhammad Khalifa
Lajanugen Logeswaran
Moontae Lee
Honglak Lee
Lu Wang
RALM
21
37
0
22 Oct 2023
Teaching Language Models to Self-Improve through Interactive Demonstrations
Xiao Yu
Baolin Peng
Michel Galley
Jianfeng Gao
Zhou Yu
LRM
ReLM
28
19
0
20 Oct 2023
Robust Training for Conversational Question Answering Models with Reinforced Reformulation Generation
Magdalena Kaiser
Rishiraj Saha Roy
G. Weikum
8
3
0
20 Oct 2023
Generating Summaries with Controllable Readability Levels
Leonardo F. R. Ribeiro
Mohit Bansal
Markus Dreyer
54
19
0
16 Oct 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
Jiacheng Liu
Ramakanth Pasunuru
Hannaneh Hajishirzi
Yejin Choi
Asli Celikyilmaz
LRM
ReLM
19
22
0
07 Oct 2023
Don't throw away your value model! Generating more preferable text with Value-Guided Monte-Carlo Tree Search decoding
Jiacheng Liu
Andrew Cohen
Ramakanth Pasunuru
Yejin Choi
Hannaneh Hajishirzi
Asli Celikyilmaz
6
22
0
26 Sep 2023
Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering
Yike Wu
Nan Hu
Sheng Bi
Guilin Qi
J. Ren
Anhuan Xie
Wei Song
RALM
16
53
0
20 Sep 2023
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
21
19
0
31 Jul 2023
Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference
Junhao Zheng
Qianli Ma
Shengjie Qiu
Yue Wu
Peitian Ma
Junlong Liu
Hu Feng
Xichen Shang
Haibin Chen
AAML
KELM
CML
CLL
76
15
0
19 Jun 2023
FLamE: Few-shot Learning from Natural Language Explanations
Yangqiaoyu Zhou
Yiming Zhang
Chenhao Tan
LRM
FAtt
17
9
0
13 Jun 2023
From Words to Wires: Generating Functioning Electronic Devices from Natural Language Descriptions
Peter Alexander Jansen
19
2
0
24 May 2023
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate
Boshi Wang
Xiang Yue
Huan Sun
ELM
LRM
13
58
0
22 May 2023
Making Language Models Better Tool Learners with Execution Feedback
Shuofei Qiao
Honghao Gui
Chengfei Lv
Qianghuai Jia
Huajun Chen
Ningyu Zhang
LLMAG
27
45
0
22 May 2023
Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang
Tianhua Tao
Yi Gu
Tianmin Shu
Zirui Wang
Zichao Yang
Zhiting Hu
ALM
LLMAG
LM&Ro
CLL
20
93
0
18 May 2023
A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero Shot
Aanisha Bhattacharya
Yaman Kumar Singla
Balaji Krishnamurthy
R. Shah
Changyou Chen
VGen
17
11
0
16 May 2023
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Afra Feyza Akyürek
Ekin Akyürek
Aman Madaan
A. Kalyan
Peter Clark
Derry Wijaya
Niket Tandon
ALM
KELM
34
85
0
15 May 2023
Knowledge Rumination for Pre-trained Language Models
Yunzhi Yao
Peng Wang
Shengyu Mao
Chuanqi Tan
Fei Huang
Huajun Chen
Ningyu Zhang
KELM
17
3
0
15 May 2023
Large Language Model Programs
Imanol Schlag
Sainbayar Sukhbaatar
Asli Celikyilmaz
Wen-tau Yih
Jason Weston
Jürgen Schmidhuber
Xian Li
LRM
29
14
0
09 May 2023
Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements
Jiacheng Liu
Wenya Wang
Dianzhuo Wang
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
VLM
31
31
0
05 May 2023
Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
...
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
35
1,389
0
30 Mar 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
19
15
0
17 Feb 2023
Augmented Language Models: a Survey
Grégoire Mialon
Roberto Dessì
Maria Lomeli
Christoforos Nalmpantis
Ramakanth Pasunuru
...
Jane Dwivedi-Yu
Asli Celikyilmaz
Edouard Grave
Yann LeCun
Thomas Scialom
LRM
KELM
22
362
0
15 Feb 2023
Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning
Yunhu Ye
Binyuan Hui
Min Yang
Binhua Li
Fei Huang
Yongbin Li
LMTD
ReLM
LRM
11
142
0
31 Jan 2023
ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations
Valentina Pyatkin
Jena D. Hwang
Vivek Srikumar
Ximing Lu
Liwei Jiang
Yejin Choi
Chandra Bhagavatula
21
21
0
20 Dec 2022
Reasoning with Language Model Prompting: A Survey
Shuofei Qiao
Yixin Ou
Ningyu Zhang
Xiang Chen
Yunzhi Yao
Shumin Deng
Chuanqi Tan
Fei Huang
Huajun Chen
ReLM
ELM
LRM
44
295
0
19 Dec 2022
Elaboration-Generating Commonsense Question Answering at Scale
Wenya Wang
Vivek Srikumar
Hannaneh Hajishirzi
Noah A. Smith
ELM
LRM
17
15
0
02 Sep 2022
Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations
Jaehun Jung
Lianhui Qin
Sean Welleck
Faeze Brahman
Chandra Bhagavatula
Ronan Le Bras
Yejin Choi
ReLM
LRM
206
189
0
24 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge
Bill Yuchen Lin
Ziyi Wu
Yichi Yang
Dong-Ho Lee
Xiang Ren
ReLM
LRM
227
62
0
02 Jan 2021
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,724
0
26 Sep 2016
1