ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.00001
  4. Cited By
Reward Design with Language Models

Reward Design with Language Models

27 February 2023
Minae Kwon
Sang Michael Xie
Kalesha Bullard
Dorsa Sadigh
    LM&Ro
ArXivPDFHTML

Papers citing "Reward Design with Language Models"

50 / 159 papers shown
Title
Towards Socially and Morally Aware RL agent: Reward Design With LLM
Towards Socially and Morally Aware RL agent: Reward Design With LLM
Zhaoyue Wang
8
2
0
23 Jan 2024
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Quentin Delfosse
Sebastian Sztwiertnia
M. Rothermel
Wolfgang Stammer
Kristian Kersting
41
18
0
11 Jan 2024
Evaluating Language Model Agency through Negotiations
Evaluating Language Model Agency through Negotiations
Tim R. Davidson
V. Veselovsky
Martin Josifoski
Maxime Peyrard
Antoine Bosselut
Michal Kosinski
Robert West
LLMAG
29
22
0
09 Jan 2024
RePLan: Robotic Replanning with Perception and Language Models
RePLan: Robotic Replanning with Perception and Language Models
Marta Skreta
Zihan Zhou
Jia Lin Yuan
Kourosh Darvish
Alán Aspuru-Guzik
Animesh Garg
LM&Ro
LRM
32
26
0
08 Jan 2024
Behavioural Cloning in VizDoom
Behavioural Cloning in VizDoom
Ryan Spick
Timothy Bradley
Ayush Raina
P. Amadori
Guy Moss
LM&Ro
15
0
0
08 Jan 2024
A Survey on Robotic Manipulation of Deformable Objects: Recent Advances,
  Open Challenges and New Frontiers
A Survey on Robotic Manipulation of Deformable Objects: Recent Advances, Open Challenges and New Frontiers
Feida Gu
Yanmin Zhou
Zhipeng Wang
Shuo Jiang
Bin He
AI4CE
16
8
0
16 Dec 2023
Auto MC-Reward: Automated Dense Reward Design with Large Language Models
  for Minecraft
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li
Xue Yang
Zhaokai Wang
Xizhou Zhu
Jie Zhou
Yu Qiao
Xiaogang Wang
Hongsheng Li
Lewei Lu
Jifeng Dai
29
32
0
14 Dec 2023
Vision-Language Models as a Source of Rewards
Vision-Language Models as a Source of Rewards
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
Harris Chan
Gheorghe Comanici
...
Yannick Schroecker
Stephen Spencer
Richie Steigerwald
Luyu Wang
Lei Zhang
VLM
LRM
32
26
0
14 Dec 2023
Foundation Models in Robotics: Applications, Challenges, and the Future
Foundation Models in Robotics: Applications, Challenges, and the Future
Roya Firoozi
Johnathan Tucker
Stephen Tian
Anirudha Majumdar
Jiankai Sun
...
Brian Ichter
Danny Driess
Jiajun Wu
Cewu Lu
Mac Schwager
LM&Ro
AI4CE
LRM
VLM
35
137
0
13 Dec 2023
Language Models, Agent Models, and World Models: The LAW for Machine
  Reasoning and Planning
Language Models, Agent Models, and World Models: The LAW for Machine Reasoning and Planning
Zhiting Hu
Tianmin Shu
LLMAG
LM&Ro
LRM
102
34
0
08 Dec 2023
FoMo Rewards: Can we cast foundation models as reward functions?
FoMo Rewards: Can we cast foundation models as reward functions?
Ekdeep Singh Lubana
Johann Brehmer
P. D. Haan
Taco S. Cohen
OffRL
LRM
33
2
0
06 Dec 2023
Large Language Model as a Policy Teacher for Training Reinforcement
  Learning Agents
Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents
Zihao Zhou
Bin-Bin Hu
Chenyang Zhao
Pu Zhang
Bin Liu
LLMAG
25
8
0
22 Nov 2023
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation
  via Language Corrections
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Lihan Zha
Yuchen Cui
Li-Heng Lin
Minae Kwon
Montse Gonzalez Arenas
Andy Zeng
Fei Xia
Dorsa Sadigh
23
36
0
17 Nov 2023
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought
  Generation
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation
Ruomeng Ding
Chaoyun Zhang
Lu Wang
Yong Xu
Ming-Jie Ma
Wei Zhang
Si Qin
Saravan Rajmohan
Qingwei Lin
Dongmei Zhang
LRM
33
59
0
07 Nov 2023
Make a Donut: Hierarchical EMD-Space Planning for Zero-Shot Deformable Manipulation with Tools
Make a Donut: Hierarchical EMD-Space Planning for Zero-Shot Deformable Manipulation with Tools
Yang You
Bokui Shen
Congyue Deng
Haoran Geng
Songlin Wei
He-Nan Wang
Leonidas J. Guibas
16
1
0
05 Nov 2023
Accelerating Reinforcement Learning of Robotic Manipulations via
  Feedback from Large Language Models
Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models
Kun-Mo Chu
Xufeng Zhao
C. Weber
Mengdi Li
Stefan Wermter
LLMAG
LM&Ro
22
14
0
04 Nov 2023
Language Agents with Reinforcement Learning for Strategic Play in the
  Werewolf Game
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
Zelai Xu
Chao Yu
Fei Fang
Yu Wang
Yi Wu
LLMAG
23
78
0
29 Oct 2023
Learning Reward for Physical Skills using Large Language Model
Learning Reward for Physical Skills using Large Language Model
Yuwei Zeng
Yiqing Xu
23
6
0
21 Oct 2023
Eureka: Human-Level Reward Design via Coding Large Language Models
Eureka: Human-Level Reward Design via Coding Large Language Models
Yecheng Jason Ma
William Liang
Guanzhi Wang
De-An Huang
Osbert Bastani
Dinesh Jayaraman
Yuke Zhu
Linxi Fan
A. Anandkumar
19
286
0
19 Oct 2023
LgTS: Dynamic Task Sampling using LLM-generated sub-goals for
  Reinforcement Learning Agents
LgTS: Dynamic Task Sampling using LLM-generated sub-goals for Reinforcement Learning Agents
Yash Shukla
Wenchang Gao
Vasanth Sarathy
Alvaro Velasquez
Robert Wright
Jivko Sinapov
16
9
0
14 Oct 2023
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
S. Sontakke
Jesse Zhang
Sébastien M. R. Arnold
Karl Pertsch
Erdem Biyik
Dorsa Sadigh
Chelsea Finn
Laurent Itti
OffRL
11
63
0
11 Oct 2023
ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot
  Coordination
ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination
Xihuai Wang
Shao Zhang
Wenhao Zhang
Wentao Dong
Jingxiao Chen
Ying Wen
Weinan Zhang
28
8
0
08 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
107
159
0
04 Oct 2023
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in
  Unstructured Environments
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Chak Lam Shek
Xiyang Wu
Wesley A. Suttle
Carl E. Busart
Erin Zaroukian
Dinesh Manocha
Pratap Tokekar
Amrit Singh Bedi
LLMAG
41
8
0
30 Sep 2023
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Martin Klissarov
P. DÓro
Shagun Sodhani
Roberta Raileanu
Pierre-Luc Bacon
Pascal Vincent
Amy Zhang
Mikael Henaff
LRM
LLMAG
16
54
0
29 Sep 2023
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic
  Manipulation Tasks
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks
Wenke Huang
Filippos Christianos
Zhibin Li
24
8
0
28 Sep 2023
SPOTS: Stable Placement of Objects with Reasoning in Semi-Autonomous
  Teleoperation Systems
SPOTS: Stable Placement of Objects with Reasoning in Semi-Autonomous Teleoperation Systems
Joonhyung Lee
Sangbeom Park
Jeongeun Park
Kyungjae Lee
Sungjoon Choi
15
2
0
25 Sep 2023
Guide Your Agent with Adaptive Multimodal Rewards
Guide Your Agent with Adaptive Multimodal Rewards
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
18
9
0
19 Sep 2023
An Interactive Framework for Profiling News Media Sources
An Interactive Framework for Profiling News Media Sources
Nikhil Mehta
Dan Goldwasser
15
4
0
14 Sep 2023
Self-Refined Large Language Model as Automated Reward Function Designer
  for Deep Reinforcement Learning in Robotics
Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics
Jiayang Song
Zhehua Zhou
Jiawei Liu
Chunrong Fang
Zhan Shu
Lei Ma
15
27
0
13 Sep 2023
Gesture-Informed Robot Assistance via Foundation Models
Gesture-Informed Robot Assistance via Foundation Models
Li-Heng Lin
Yuchen Cui
Yilun Hao
Fei Xia
Dorsa Sadigh
LM&Ro
SLR
13
19
0
06 Sep 2023
Integrating LLMs and Decision Transformers for Language Grounded
  Generative Quality-Diversity
Integrating LLMs and Decision Transformers for Language Grounded Generative Quality-Diversity
Achkan Salehi
Stéphane Doncieux
18
0
0
25 Aug 2023
SayCanPay: Heuristic Planning with Large Language Models using Learnable
  Domain Knowledge
SayCanPay: Heuristic Planning with Large Language Models using Learnable Domain Knowledge
Rishi Hazra
Pedro Zuidberg Dos Martires
Luc de Raedt
LM&Ro
LLMAG
13
31
0
24 Aug 2023
From Instructions to Intrinsic Human Values -- A Survey of Alignment
  Goals for Big Models
From Instructions to Intrinsic Human Values -- A Survey of Alignment Goals for Big Models
Jing Yao
Xiaoyuan Yi
Xiting Wang
Jindong Wang
Xing Xie
ALM
14
42
0
23 Aug 2023
LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient
  Querying
LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying
T. G. Karimpanal
Laknath Semage
Santu Rana
Hung Le
T. Tran
Sunil R. Gupta
Svetha Venkatesh
RALM
LLMAG
8
2
0
21 Aug 2023
Pre-Trained Large Language Models for Industrial Control
Pre-Trained Large Language Models for Industrial Control
Lei Song
Chuheng Zhang
Li Zhao
Jiang Bian
LM&Ro
AI4CE
22
12
0
06 Aug 2023
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with
  Language Models
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Wenlong Huang
Chen Wang
Ruohan Zhang
Yunzhu Li
Jiajun Wu
Li Fei-Fei
LM&Ro
28
477
0
12 Jul 2023
RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
Zhao Mandi
Shreeya Jain
Shuran Song
LM&Ro
LLMAG
26
121
0
10 Jul 2023
Large Language Models as General Pattern Machines
Large Language Models as General Pattern Machines
Suvir Mirchandani
F. Xia
Peter R. Florence
Brian Ichter
Danny Driess
Montse Gonzalez Arenas
Kanishka Rao
Dorsa Sadigh
Andy Zeng
LLMAG
39
183
0
10 Jul 2023
Preference Ranking Optimization for Human Alignment
Preference Ranking Optimization for Human Alignment
Feifan Song
Yu Bowen
Minghao Li
Haiyang Yu
Fei Huang
Yongbin Li
Houfeng Wang
ALM
13
234
0
30 Jun 2023
Inferring the Goals of Communicating Agents from Actions and
  Instructions
Inferring the Goals of Communicating Agents from Actions and Instructions
Lance Ying
Zhi-Xuan Tan
Vikash K. Mansinghka
J. Tenenbaum
LLMAG
25
18
0
28 Jun 2023
LARG, Language-based Automatic Reward and Goal Generation
LARG, Language-based Automatic Reward and Goal Generation
Julien Perez
Denys Proux
Claude Roux
Michael Niemaz
LM&Ro
14
1
0
19 Jun 2023
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
16
9
0
15 Jun 2023
Toward Grounded Commonsense Reasoning
Toward Grounded Commonsense Reasoning
Minae Kwon
Hengyuan Hu
Vivek Myers
Siddharth Karamcheti
Anca Dragan
Dorsa Sadigh
LM&Ro
ReLM
LRM
22
9
0
14 Jun 2023
Language to Rewards for Robotic Skill Synthesis
Language to Rewards for Robotic Skill Synthesis
Wenhao Yu
Nimrod Gileadi
Chuyuan Fu
Sean Kirmani
Kuang-Huei Lee
...
N. Heess
Dorsa Sadigh
Jie Tan
Yuval Tassa
F. Xia
LM&Ro
30
267
0
14 Jun 2023
Rewarded soups: towards Pareto-optimal alignment by interpolating
  weights fine-tuned on diverse rewards
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
Alexandre Ramé
Guillaume Couairon
Mustafa Shukor
Corentin Dancette
Jean-Baptiste Gaya
Laure Soulier
Matthieu Cord
MoMe
30
135
0
07 Jun 2023
OMNI: Open-endedness via Models of human Notions of Interestingness
OMNI: Open-endedness via Models of human Notions of Interestingness
Jenny Zhang
Joel Lehman
Kenneth O. Stanley
Jeff Clune
LRM
6
27
0
02 Jun 2023
Strategic Reasoning with Language Models
Strategic Reasoning with Language Models
Kanishk Gandhi
Dorsa Sadigh
Noah D. Goodman
LM&Ro
LRM
35
35
0
30 May 2023
Demo2Code: From Summarizing Demonstrations to Synthesizing Code via
  Extended Chain-of-Thought
Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought
Huaxiaoyue Wang
Gonzalo Gonzalez-Pumariega
Yash Sharma
Sanjiban Choudhury
LM&Ro
26
33
0
26 May 2023
ALGO: Synthesizing Algorithmic Programs with LLM-Generated Oracle
  Verifiers
ALGO: Synthesizing Algorithmic Programs with LLM-Generated Oracle Verifiers
Kexun Zhang
Danqing Wang
Jingtao Xia
William Yang Wang
Lei Li
23
39
0
24 May 2023
Previous
1234
Next