Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.00690
Cited By
DeepMind Control Suite
2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeepMind Control Suite"
50 / 791 papers shown
Title
TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation
Hangyu Li
Qin Zhao
Haoran Xu
Xinyu Jiang
Qingwei Ben
...
Jia Zeng
Hanqing Wang
Bo Dai
Junting Dong
Jiangmiao Pang
24
0
0
19 May 2025
TD-GRPC: Temporal Difference Learning with Group Relative Policy Constraint for Humanoid Locomotion
Khang Nguyen
Khai Nguyen
An T. Le
Jan Peters
Manfred Huber
Ngo Anh Vien
Minh Nhat Vu
12
0
0
19 May 2025
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
16
0
0
17 May 2025
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long
Jorge Cortés
Nikolay Atanasov
14
0
0
16 May 2025
Zero-Shot Visual Generalization in Robot Manipulation
Sumeet Batra
Gaurav Sukhatme
19
0
0
16 May 2025
Approximated Behavioral Metric-based State Projection for Federated Reinforcement Learning
Zengxia Guo
Bohui An
Zhongqi Lu
FedML
26
0
0
15 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yansen Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRL
LLMAG
LM&Ro
LRM
19
0
0
15 May 2025
Learning Diverse Natural Behaviors for Enhancing the Agility of Quadrupedal Robots
Huiqiao Fu
Haoyu Dong
Wentao Xu
Zhehao Zhou
Guizhou Deng
Kaiqiang Tang
D. Dong
Chunlin Chen
24
0
0
15 May 2025
ADD: Physics-Based Motion Imitation with Adversarial Differential Discriminators
Ziyu Zhang
S. Bashkirov
Dun Yang
Michael Taylor
Xue Bin Peng
40
0
0
08 May 2025
Trajectory Entropy Reinforcement Learning for Predictable and Robust Control
Bang You
Chenxu Wang
Huaping Liu
29
0
0
07 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
71
0
0
04 May 2025
Wasserstein Policy Optimization
David Pfau
Ian Davies
Diana Borsa
Joao G. M. Araujo
Brendan D. Tracey
H. V. Hasselt
29
0
0
01 May 2025
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Junkyu Lee
Tian Gao
Elliot Nelson
Miao Liu
D. Bhattacharjya
Songtao Lu
OffRL
47
0
0
30 Apr 2025
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
Mingqi Yuan
Qi Wang
Guozheng Ma
Bo-wen Li
Xin Jin
Yunbo Wang
Xiaokang Yang
Wenjun Zeng
D. Tao
OffRL
AI4CE
40
0
0
24 Apr 2025
Solving New Tasks by Adapting Internet Video Knowledge
Calvin Luo
Zilai Zeng
Yilun Du
Chen Sun
38
1
0
21 Apr 2025
MAPLE: Encoding Dexterous Robotic Manipulation Priors Learned From Egocentric Videos
Alexey Gavryushin
Xi Wang
Robert J. S. Malate
Chenyu Yang
Xiaojun Jia
Shubh Goel
Davide Liconti
René Zurbrugg
Robert K. Katzschmann
Marc Pollefeys
44
1
0
08 Apr 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
Xuguang Lan
40
0
0
05 Apr 2025
Adapting World Models with Latent-State Dynamics Residuals
JB Lanier
Kyungmin Kim
Armin Karamzade
Yifei Liu
Ankita Sinha
Kat He
Davide Corsi
Roy Fox
49
0
0
03 Apr 2025
Bootstrapped Model Predictive Control
Yuhang Wang
Hanwei Guo
Sizhe Wang
Long Qian
Xuguang Lan
59
0
0
24 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
49
0
0
23 Mar 2025
LaMOuR: Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning
Chan Kim
Seung-Woo Seo
Seong-Woo Kim
OODD
238
0
0
21 Mar 2025
VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences
Anukriti Singh
Amisha Bhaskar
Peihong Yu
Souradip Chakraborty
Ruthwik Dasyam
Amrit Singh Bedi
Pratap Tokekar
66
0
0
18 Mar 2025
Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments
Peter Böhm
Pauline Pounds
Archie C. Chapman
45
0
0
14 Mar 2025
Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning
Qian Wang
Zhe Zhang
Baao Xie
Xin Jin
Yansen Wang
Shiyu Wang
Liaomo Zheng
Xiaokang Yang
Wenjun Zeng
OffRL
68
0
0
11 Mar 2025
Dynamics-Invariant Quadrotor Control using Scale-Aware Deep Reinforcement Learning
Varad Vaidya
Jishnu Keshavan
43
0
0
09 Mar 2025
Knowledge Retention for Continual Model-Based Reinforcement Learning
Yixiang Sun
Haotian Fu
M. L. Littman
George Konidaris
OffRL
CLL
VLM
59
0
0
06 Mar 2025
Boosting Offline Optimizers with Surrogate Sensitivity
Manh Cuong Dao
Phi Le Nguyen
Thao Nguyen Truong
Trong Nghia Hoang
OffRL
62
4
0
06 Mar 2025
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
72
0
0
06 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
39
0
0
04 Mar 2025
Discrete Codebook World Models for Continuous Control
Aidan Scannell
Mohammadreza Nakhaei
Kalle Kujanpää
Yi Zhao
Kevin Sebastian Luck
Dieter Büchler
Joni Pajarinen
OffRL
50
1
0
01 Mar 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
70
1
0
24 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
Günter Klambauer
Razvan Pascanu
Sepp Hochreiter
77
5
0
21 Feb 2025
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Bryan L. M. de Oliveira
Murilo L. da Luz
Bruno Brandão
Luana G. B. Martins
Telma W. de L. Soares
Luckeciano C. Melo
OffRL
72
1
0
17 Feb 2025
Towards Empowerment Gain through Causal Structure Learning in Model-Based RL
Hongye Cao
Fan Feng
Meng Fang
Shaokang Dong
Tianpei Yang
Jing Huo
Yang Gao
61
1
0
14 Feb 2025
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization
Xuefeng Liu
Songhao Jiang
Siyu Chen
Zhuoran Yang
Yuxin Chen
Ian Foster
Rick L. Stevens
LM&MA
OffRL
60
0
0
11 Feb 2025
Skill Expansion and Composition in Parameter Space
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
64
4
0
09 Feb 2025
Efficient Reinforcement Learning Through Adaptively Pretrained Visual Encoder
Yuhan Zhang
Guoqing Ma
Guangfu Hao
Liangxuan Guo
Yang Chen
S. Yu
OnRL
74
0
0
08 Feb 2025
Learning Fused State Representations for Control from Multi-View Observations
Zeyu Wang
Yao Li
Xin Li
Hongyu Zang
Romain Laroche
Riashat Islam
OffRL
54
0
0
03 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
60
2
0
29 Jan 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
46
3
0
28 Jan 2025
Episodic Novelty Through Temporal Distance
Y. Jiang
Qihan Liu
Yiqin Yang
Xiaoteng Ma
Dianyu Zhong
...
Jun Yang
Bin Liang
Bo Xu
Chongjie Zhang
Qianchuan Zhao
OffRL
45
0
0
28 Jan 2025
Dream to Fly: Model-Based Reinforcement Learning for Vision-Based Drone Flight
Angel Romero
Ashwin Shenai
Ismail Geles
Elie Aljalbout
Davide Scaramuzza
84
1
0
24 Jan 2025
RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning
Mingkang Wu
Devin White
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
OffRL
39
0
0
13 Jan 2025
Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking
Yun-Hai Liu
Bowen Yang
Licheng Zhong
He Wang
Li Yi
50
5
0
23 Dec 2024
Environment Descriptions for Usability and Generalisation in Reinforcement Learning
Dennis J. N. J. Soemers
Spyridon Samothrakis
Kurt Driessens
M. Winands
OffRL
85
1
0
22 Dec 2024
Equivariant Action Sampling for Reinforcement Learning and Planning
Linfeng Zhao
Owen Howell
Xupeng Zhu
Jung Yeon Park
Zhewen Zhang
Robin Walters
Lawson L. S. Wong
111
2
0
16 Dec 2024
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
Miles Hutson
Isaac Kauvar
Nick Haber
75
0
0
08 Dec 2024
Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control
Seongmin Park
Hyungmin Kim
Wonseok Jeon
Juyoung Yang
Byeongwook Jeon
Yoonseon Oh
Jungwook Choi
93
1
0
02 Dec 2024
A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents
Jean Vassoyan
Anan Schütt
Jill-Jênn Vie
Arun-Balajiee Lekshmi-Narayanan
Elisabeth André
Nicolas Vayatis
AI4Ed
76
0
0
18 Nov 2024
World Models: The Safety Perspective
Zifan Zeng
Chongzhe Zhang
Feng Liu
Joseph Sifakis
Qunli Zhang
Shiming Liu
Peng Wang
KELM
LLMAG
47
1
0
12 Nov 2024
1
2
3
4
...
14
15
16
Next