Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.03244
Cited By
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
5 February 2024
Kolby Nottingham
Bodhisattwa Prasad Majumder
Bhavana Dalvi
Sameer Singh
Peter Clark
Roy Fox
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills"
3 / 3 papers shown
Title
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
208
2,413
0
06 Oct 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
226
89
0
27 Sep 2021
1