ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.07297
  4. Cited By
Language Instructed Reinforcement Learning for Human-AI Coordination

Language Instructed Reinforcement Learning for Human-AI Coordination

13 April 2023
Hengyuan Hu
Dorsa Sadigh
    LM&Ro
ArXivPDFHTML

Papers citing "Language Instructed Reinforcement Learning for Human-AI Coordination"

50 / 53 papers shown
Title
IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation
IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation
In-Chang Baek
Sung-Hyun Kim
Seo-Young Lee
Dong-Hyeun Kim
Kyung-Joong Kim
56
0
0
16 Mar 2025
M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Ziyan Wang
Zhicheng Zhang
Fei Fang
Yali Du
39
0
0
03 Mar 2025
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning
Sheila Schoepp
Masoud Jafaripour
Yingyue Cao
Tianpei Yang
Fatemeh Abdollahi
Shadan Golestan
Zahin Sufiyan
Osmar Zaiane
Matthew E. Taylor
OffRL
LM&Ro
46
0
0
24 Feb 2025
Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression
Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression
Kai Yoshida
M. Mizukami
Seiya Kawano
Canasai Kruengkrai
Hiroaki Sugiyama
Koichiro Yoshino
ALM
OffRL
76
1
0
28 Jan 2025
Efficient Language-instructed Skill Acquisition via Reward-Policy
  Co-Evolution
Efficient Language-instructed Skill Acquisition via Reward-Policy Co-Evolution
Changxin Huang
Yanbin Chang
Junfan Lin
Junyang Liang
Runhao Zeng
Jianqiang Li
70
0
0
18 Dec 2024
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption
  In Zero-Shot Coordination Games
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
Usman Anwar
Ashish Pandian
Jia Wan
David M. Krueger
Jakob N. Foerster
34
0
0
07 Nov 2024
SYNERGAI: Perception Alignment for Human-Robot Collaboration
SYNERGAI: Perception Alignment for Human-Robot Collaboration
Yixin Chen
Guoxi Zhang
Yaowei Zhang
Hongming Xu
Peiyuan Zhi
Qing Li
Siyuan Huang
32
0
0
24 Sep 2024
MotIF: Motion Instruction Fine-tuning
MotIF: Motion Instruction Fine-tuning
Minyoung Hwang
Joey Hejna
Dorsa Sadigh
Yonatan Bisk
45
1
0
16 Sep 2024
Towards Collaborative Intelligence: Propagating Intentions and Reasoning
  for Multi-Agent Coordination with Large Language Models
Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models
Xihe Qiu
Haoyu Wang
Xiaoyu Tan
Chao Qu
Yujie Xiong
Yuan-Chia Cheng
Yinghui Xu
Wei Chu
Yuan Qi
LLMAG
LM&Ro
30
0
0
17 Jul 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
30
1
0
09 Jun 2024
A Survey of Language-Based Communication in Robotics
A Survey of Language-Based Communication in Robotics
William Hunt
Sarvapali D. Ramchurn
Mohammad D. Soorati
LM&Ro
47
12
0
06 Jun 2024
Knowing What Not to Do: Leverage Language Model Insights for Action
  Space Pruning in Multi-agent Reinforcement Learning
Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning
Zhihao Liu
Xianliang Yang
Zichuan Liu
Yifan Xia
Wei Jiang
Yuanyu Zhang
Lijuan Li
Guoliang Fan
Lei Song
Bian Jiang
LLMAG
31
1
0
27 May 2024
Attaining Human`s Desirable Outcomes in Human-AI Interaction via
  Structural Causal Games
Attaining Human`s Desirable Outcomes in Human-AI Interaction via Structural Causal Games
Anjie Liu
Jianhong Wang
Haoxuan Li
Xu Chen
Jun Wang
Samuel Kaski
Mengyue Yang
40
0
0
26 May 2024
On Bits and Bandits: Quantifying the Regret-Information Trade-off
On Bits and Bandits: Quantifying the Regret-Information Trade-off
Itai Shufaro
Nadav Merlis
Nir Weinberger
Shie Mannor
31
0
0
26 May 2024
Policy Learning with a Language Bottleneck
Policy Learning with a Language Bottleneck
Megha Srivastava
Cédric Colas
Dorsa Sadigh
Jacob Andreas
35
3
0
07 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
32
0
0
06 May 2024
PICLe: Eliciting Diverse Behaviors from Large Language Models with
  Persona In-Context Learning
PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning
Hyeong Kyu Choi
Yixuan Li
59
17
0
03 May 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept,
  Taxonomy, and Methods
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
27
48
0
30 Mar 2024
LORD: Large Models based Opposite Reward Design for Autonomous Driving
LORD: Large Models based Opposite Reward Design for Autonomous Driving
Xin Ye
Feng Tao
Abhirup Mallik
Burhaneddin Yaman
Liu Ren
OffRL
26
2
0
27 Mar 2024
Collaborative AI Teaming in Unknown Environments via Active Goal
  Deduction
Collaborative AI Teaming in Unknown Environments via Active Goal Deduction
Zuyuan Zhang
Hanhan Zhou
Mahdi Imani
Taeyoung Lee
Tian-Shing Lan
24
11
0
22 Mar 2024
A Survey on Human-AI Teaming with Large Pre-Trained Models
A Survey on Human-AI Teaming with Large Pre-Trained Models
Vanshika Vats
Marzia Binta Nizam
Minghao Liu
Ziyuan Wang
Richard Ho
...
Celeste Shen
Rachel Shen
Nafisa Hussain
Kesav Ravichandran
James Davis
LM&MA
36
8
0
07 Mar 2024
Multi-Agent, Human-Agent and Beyond: A Survey on Cooperation in Social
  Dilemmas
Multi-Agent, Human-Agent and Beyond: A Survey on Cooperation in Social Dilemmas
Hao Guo
Chunjiang Mu
Yang Chen
Chen Shen
Shuyue Hu
Zhen Wang
42
6
0
27 Feb 2024
How Can LLM Guide RL? A Value-Based Approach
How Can LLM Guide RL? A Value-Based Approach
Shenao Zhang
Sirui Zheng
Shuqi Ke
Zhihan Liu
Wanxin Jin
Jianbo Yuan
Yingxiang Yang
Hongxia Yang
Zhaoran Wang
22
8
0
25 Feb 2024
Learning to Learn Faster from Human Feedback with Language Model
  Predictive Control
Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Jacky Liang
Fei Xia
Wenhao Yu
Andy Zeng
Montse Gonzalez Arenas
...
N. Heess
Kanishka Rao
Nik Stewart
Jie Tan
Carolina Parada
LM&Ro
54
32
0
18 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
16
6
0
02 Feb 2024
True Knowledge Comes from Practice: Aligning LLMs with Embodied
  Environments via Reinforcement Learning
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning
Weihao Tan
Wentao Zhang
Shanqi Liu
Longtao Zheng
Xinrun Wang
Bo An
OffRL
36
16
0
25 Jan 2024
Safe Reinforcement Learning with Free-form Natural Language Constraints
  and Pre-Trained Language Models
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models
Xingzhou Lou
Junge Zhang
Ziyan Wang
Kaiqi Huang
Yali Du
25
3
0
15 Jan 2024
Large Language Models for Robotics: Opportunities, Challenges, and
  Perspectives
Large Language Models for Robotics: Opportunities, Challenges, and Perspectives
Jiaqi Wang
Zihao Wu
Yiwei Li
Hanqi Jiang
Peng Shu
...
Lin Zhao
Bao Ge
Xiang Li
Tianming Liu
Shu Zhang
LM&Ro
27
60
0
09 Jan 2024
Vision-Language Models as a Source of Rewards
Vision-Language Models as a Source of Rewards
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
Harris Chan
Gheorghe Comanici
...
Yannick Schroecker
Stephen Spencer
Richie Steigerwald
Luyu Wang
Lei Zhang
VLM
LRM
32
26
0
14 Dec 2023
Toward General-Purpose Robots via Foundation Models: A Survey and
  Meta-Analysis
Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
Yafei Hu
Quanting Xie
Vidhi Jain
Jonathan M Francis
Jay Patrikar
...
Xiaolong Wang
Sebastian A. Scherer
Z. Kira
Fei Xia
Yonatan Bisk
LM&Ro
AI4CE
30
63
0
14 Dec 2023
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation
  via Language Corrections
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Lihan Zha
Yuchen Cui
Li-Heng Lin
Minae Kwon
Montse Gonzalez Arenas
Andy Zeng
Fei Xia
Dorsa Sadigh
23
36
0
17 Nov 2023
Imitation Bootstrapped Reinforcement Learning
Imitation Bootstrapped Reinforcement Learning
Hengyuan Hu
Suvir Mirchandani
Dorsa Sadigh
30
24
0
03 Nov 2023
Efficient Human-AI Coordination via Preparatory Language-based
  Convention
Efficient Human-AI Coordination via Preparatory Language-based Convention
Cong Guan
Lichao Zhang
Chunpeng Fan
Yi-Chen Li
Feng Chen
Lihe Li
Yunjia Tian
Lei Yuan
Yang Yu
LM&Ro
26
8
0
01 Nov 2023
Large Language Models as Generalizable Policies for Embodied Tasks
Large Language Models as Generalizable Policies for Embodied Tasks
Andrew Szot
Max Schwarzer
Harsh Agrawal
Bogdan Mazoure
Walter A. Talbott
Katherine Metcalf
Natalie Mackraz
Devon Hjelm
Alexander Toshev
LM&Ro
22
57
0
26 Oct 2023
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
S. Sontakke
Jesse Zhang
Sébastien M. R. Arnold
Karl Pertsch
Erdem Biyik
Dorsa Sadigh
Chelsea Finn
Laurent Itti
OffRL
14
63
0
11 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
107
159
0
04 Oct 2023
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in
  Unstructured Environments
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Chak Lam Shek
Xiyang Wu
Wesley A. Suttle
Carl E. Busart
Erin Zaroukian
Dinesh Manocha
Pratap Tokekar
Amrit Singh Bedi
LLMAG
46
8
0
30 Sep 2023
SPOTS: Stable Placement of Objects with Reasoning in Semi-Autonomous
  Teleoperation Systems
SPOTS: Stable Placement of Objects with Reasoning in Semi-Autonomous Teleoperation Systems
Joonhyung Lee
Sangbeom Park
Jeongeun Park
Kyungjae Lee
Sungjoon Choi
26
2
0
25 Sep 2023
Guide Your Agent with Adaptive Multimodal Rewards
Guide Your Agent with Adaptive Multimodal Rewards
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
18
9
0
19 Sep 2023
Self-Refined Large Language Model as Automated Reward Function Designer
  for Deep Reinforcement Learning in Robotics
Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics
Jiayang Song
Zhehua Zhou
Jiawei Liu
Chunrong Fang
Zhan Shu
Lei Ma
25
27
0
13 Sep 2023
Gesture-Informed Robot Assistance via Foundation Models
Gesture-Informed Robot Assistance via Foundation Models
Li-Heng Lin
Yuchen Cui
Yilun Hao
Fei Xia
Dorsa Sadigh
LM&Ro
SLR
13
19
0
06 Sep 2023
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with
  Language Models
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Wenlong Huang
Chen Wang
Ruohan Zhang
Yunzhu Li
Jiajun Wu
Li Fei-Fei
LM&Ro
28
477
0
12 Jul 2023
Large Language Models as General Pattern Machines
Large Language Models as General Pattern Machines
Suvir Mirchandani
F. Xia
Peter R. Florence
Brian Ichter
Danny Driess
Montse Gonzalez Arenas
Kanishka Rao
Dorsa Sadigh
Andy Zeng
LLMAG
44
183
0
10 Jul 2023
Toward Grounded Commonsense Reasoning
Toward Grounded Commonsense Reasoning
Minae Kwon
Hengyuan Hu
Vivek Myers
Siddharth Karamcheti
Anca Dragan
Dorsa Sadigh
LM&Ro
ReLM
LRM
31
9
0
14 Jun 2023
Language to Rewards for Robotic Skill Synthesis
Language to Rewards for Robotic Skill Synthesis
Wenhao Yu
Nimrod Gileadi
Chuyuan Fu
Sean Kirmani
Kuang-Huei Lee
...
N. Heess
Dorsa Sadigh
Jie Tan
Yuval Tassa
F. Xia
LM&Ro
30
267
0
14 Jun 2023
Can ChatGPT Enable ITS? The Case of Mixed Traffic Control via
  Reinforcement Learning
Can ChatGPT Enable ITS? The Case of Mixed Traffic Control via Reinforcement Learning
Michael Villarreal
Bibek Poudel
Weizi Li
22
23
0
13 Jun 2023
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
293
4,077
0
24 May 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
Improving Intrinsic Exploration with Language Abstractions
Improving Intrinsic Exploration with Language Abstractions
Jesse Mu
Victor Zhong
Roberta Raileanu
Minqi Jiang
Noah D. Goodman
Tim Rocktaschel
Edward Grefenstette
95
63
0
17 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,402
0
28 Jan 2022
12
Next