Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.13829
Cited By
Learning from Mistakes via Cooperative Study Assistant for Large Language Models
23 May 2023
Danqing Wang
Lei Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning from Mistakes via Cooperative Study Assistant for Large Language Models"
6 / 6 papers shown
Title
Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation
Chengwei Dai
Kun Li
Wei Zhou
Song Hu
LRM
36
5
0
30 May 2024
Recover: A Neuro-Symbolic Framework for Failure Detection and Recovery
Cristina Cornelio
Mohammed Diab
OffRL
22
9
0
31 Mar 2024
Temporal Knowledge Question Answering via Abstract Reasoning Induction
Ziyang Chen
Dongfang Li
Xiang Zhao
Baotian Hu
Min Zhang
LRM
17
7
0
15 Nov 2023
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
223
2,413
0
06 Oct 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
BBQ: A Hand-Built Bias Benchmark for Question Answering
Alicia Parrish
Angelica Chen
Nikita Nangia
Vishakh Padmakumar
Jason Phang
Jana Thompson
Phu Mon Htut
Sam Bowman
210
364
0
15 Oct 2021
1