Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.00001
Cited By
Reward Design with Language Models
27 February 2023
Minae Kwon
Sang Michael Xie
Kalesha Bullard
Dorsa Sadigh
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reward Design with Language Models"
50 / 159 papers shown
Title
On Pre-training of Multimodal Language Models Customized for Chart Understanding
Wan-Cyuan Fan
Yen-Chun Chen
Mengchen Liu
Lu Yuan
Leonid Sigal
36
5
0
19 Jul 2024
Learning Goal-Conditioned Representations for Language Reward Models
Vaskar Nath
Dylan Slack
Jeff Da
Yuntao Ma
Hugh Zhang
Spencer Whitehead
Sean Hendryx
24
0
0
18 Jul 2024
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
24
7
0
18 Jul 2024
E
2
C
F
D
\mathrm{E^{2}CFD}
E
2
CFD
: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
19
0
0
08 Jul 2024
Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs
Xianfu Chen
Celimuge Wu
Yi Shen
Yusheng Ji
Tsutomu Yoshinaga
Qiang Ni
Charilaos C. Zarakovitis
Honggang Zhang
31
1
0
06 Jul 2024
Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs
Zichao Shen
Tianchen Zhu
Qingyun Sun
Shiqi Gao
Jianxin Li
OffRL
18
1
0
28 Jun 2024
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Kenneth Li
Yiming Wang
Fernanda Viégas
Martin Wattenberg
30
6
0
17 Jun 2024
Bayesian Statistical Modeling with Predictors from LLMs
Michael Franke
Polina Tsvilodub
Fausto Carcassi
34
4
0
13 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
30
1
0
09 Jun 2024
REvolve: Reward Evolution with Large Language Models using Human Feedback
Rishi Hazra
Alkis Sygkounas
A. Persson
Amy Loutfi
Pedro Zuidberg Dos Martires
27
3
0
03 Jun 2024
Controlling Large Language Model Agents with Entropic Activation Steering
Nate Rahn
P. DÓro
Marc G. Bellemare
LLMSV
16
6
0
01 Jun 2024
Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning
Zhihao Liu
Xianliang Yang
Zichuan Liu
Yifan Xia
Wei Jiang
Yuanyu Zhang
Lijuan Li
Guoliang Fan
Lei Song
Bian Jiang
LLMAG
23
1
0
27 May 2024
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making
Chuanhao Li
Runhan Yang
Tiankai Li
Milad Bafarassat
Kourosh Sharifi
Dirk Bergemann
Zhuoran Yang
LLMAG
22
5
0
25 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
40
3
0
25 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
35
55
0
17 May 2024
Learning Reward for Robot Skills Using Large Language Models via Self-Alignment
Yuwei Zeng
Yao Mu
Lin Shao
26
12
0
12 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
32
0
0
06 May 2024
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks
Murtaza Dalal
Tarun Chiruvolu
Devendra Singh Chaplot
Ruslan Salakhutdinov
LM&Ro
29
37
0
02 May 2024
Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs
Bahar Radmehr
Adish Singla
Tanja Kaser
LLMAG
AI4CE
25
5
0
29 Apr 2024
Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations
Puhao Li
Tengyu Liu
Yuyang Li
Muzhi Han
Haoran Geng
Shu Wang
Yixin Zhu
Song-Chun Zhu
Siyuan Huang
39
16
0
26 Apr 2024
Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation
Xun Wu
Shaohan Huang
Furu Wei
42
8
0
23 Apr 2024
Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts
Jing-Cheng Pang
Si-Hang Yang
Kaiyuan Li
Jiaji Zhang
Xiong-Hui Chen
Nan Tang
Yang Yu
OffRL
KELM
LLMAG
36
4
0
14 Apr 2024
Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation
Hanlin Tian
Kethan Reddy
Yuxiang Feng
Mohammed Quddus
Y. Demiris
Panagiotis Angeloudis
27
10
0
12 Apr 2024
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
69
49
0
02 Apr 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
27
48
0
30 Mar 2024
RAIL: Robot Affordance Imagination with Large Language Models
Ceng Zhang
Xin Meng
Dongchen Qi
Gregory S. Chirikjian
LM&Ro
27
3
0
28 Mar 2024
LORD: Large Models based Opposite Reward Design for Autonomous Driving
Xin Ye
Feng Tao
Abhirup Mallik
Burhaneddin Yaman
Liu Ren
OffRL
26
2
0
27 Mar 2024
Depending on yourself when you should: Mentoring LLM with RL agents to become the master in cybersecurity games
Yikuan Yan
Yaolun Zhang
Keman Huang
24
10
0
26 Mar 2024
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Abhaysinh Zala
Jaemin Cho
Han Lin
Jaehong Yoon
Mohit Bansal
26
13
0
18 Mar 2024
Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
Joonhyung Lee
Sangbeom Park
Yongin Kwon
Jemin Lee
Minwook Ahn
Sungjoon Choi
24
0
0
18 Mar 2024
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
Runyu Ma
Jelle Luijkx
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
33
7
0
14 Mar 2024
BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Shikhar Murty
Christopher D. Manning
Peter Shaw
Mandar Joshi
Kenton Lee
LM&Ro
LLMAG
26
14
0
12 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
35
8
0
11 Mar 2024
Playing NetHack with LLMs: Potential & Limitations as Zero-Shot Agents
Dominik Jeurissen
Diego Perez-Liebana
Jeremy Gow
Duygu Cakmak
James Kwan
LLMAG
18
7
0
01 Mar 2024
Learning with Language-Guided State Abstractions
Andi Peng
Ilia Sucholutsky
Belinda Z. Li
T. Sumers
Thomas L. Griffiths
Jacob Andreas
Julie A. Shah
LM&Ro
43
13
0
28 Feb 2024
Multi-Agent, Human-Agent and Beyond: A Survey on Cooperation in Social Dilemmas
Hao Guo
Chunjiang Mu
Yang Chen
Chen Shen
Shuyue Hu
Zhen Wang
42
5
0
27 Feb 2024
Reward Design for Justifiable Sequential Decision-Making
A. Sukovic
Goran Radanović
19
0
0
24 Feb 2024
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
Nikhil Behari
Edwin Zhang
Yunfan Zhao
Aparna Taneja
Dheeraj M. Nagaraj
Milind Tambe
51
10
0
22 Feb 2024
From Reals to Logic and Back: Inventing Symbolic Vocabularies, Actions, and Models for Planning from Raw Data
Naman Shah
Jayesh Nagpal
Pulkit Verma
Siddharth Srivastava
NAI
34
9
0
19 Feb 2024
Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Jacky Liang
Fei Xia
Wenhao Yu
Andy Zeng
Montse Gonzalez Arenas
...
N. Heess
Kanishka Rao
Nik Stewart
Jie Tan
Carolina Parada
LM&Ro
52
32
0
18 Feb 2024
LLM-driven Imitation of Subrational Behavior : Illusion or Reality?
Andrea Coletta
Kshama Dwarakanath
Penghang Liu
Svitlana Vyetrenko
T. Balch
11
10
0
13 Feb 2024
Natural Language Reinforcement Learning
Xidong Feng
Ziyu Wan
Mengyue Yang
Ziyan Wang
Girish A. Koushiks
Yali Du
Ying Wen
Jun Wang
OffRL
35
3
0
11 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
41
45
0
08 Feb 2024
Code as Reward: Empowering Reinforcement Learning with VLMs
David Venuto
Sami Nur Islam
Martin Klissarov
Doina Precup
Sherry Yang
Ankit Anand
VLM
18
9
0
07 Feb 2024
Reinforcement Learning from Bagged Reward
Yuting Tang
Xin-Qiang Cai
Yao-Xiang Ding
Qiyu Wu
Guoqing Liu
Masashi Sugiyama
OffRL
19
0
0
06 Feb 2024
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang
Zhanyi Sun
Jesse Zhang
Zhou Xian
Erdem Biyik
David Held
Zackory M. Erickson
VLM
53
48
0
06 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
16
6
0
02 Feb 2024
LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Subbarao Kambhampati
Karthik Valmeekam
L. Guan
Mudit Verma
Kaya Stechly
Siddhant Bhambri
Lucas Saldyt
Anil Murthy
LRM
84
108
0
02 Feb 2024
Institutional Platform for Secure Self-Service Large Language Model Exploration
V. Bumgardner
Mitchell A. Klusty
W. V. Logan
Samuel E. Armstrong
Caylin D. Hickey
Jeff Talbert
Caylin Hickey
Jeff Talbert
44
1
0
01 Feb 2024
Hierarchical Continual Reinforcement Learning via Large Language Model
Chaofan Pan
Xin Yang
Hao Wang
Wei Wei
Tianrui Li
20
2
0
25 Jan 2024
Previous
1
2
3
4
Next