Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.00378
Cited By
Time Limits in Reinforcement Learning
1 December 2017
Fabio Pardo
Arash Tavakoli
Vitaly Levdik
Petar Kormushev
CLL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Time Limits in Reinforcement Learning"
35 / 35 papers shown
Title
A General Approach of Automated Environment Design for Learning the Optimal Power Flow
Thomas Wolgast
Astrid Nieße
AI4CE
18
0
0
01 May 2025
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
86
2
0
22 Jan 2025
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
John L. Zhou
Weizhe Hong
Jonathan C. Kao
30
0
0
03 Jun 2024
Learning Goal-Directed Object Pushing in Cluttered Scenes with Location-Based Attention
Nils Dengler
Juan Del Aguila Ferrandis
João Moura
S. Vijayakumar
Maren Bennewitz
59
0
0
26 Mar 2024
Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning
Zida Wu
Mathieu Lauriere
Samuel Jia Cong Chua
M. Geist
Olivier Pietquin
Ankur M. Mehta
37
5
0
06 Mar 2024
EgoGen: An Egocentric Synthetic Data Generator
Gen Li
Kai Zhao
Siwei Zhang
X. Lyu
Mihai Dusmanu
Yan Zhang
Marc Pollefeys
Siyu Tang
EgoV
VGen
42
14
0
16 Jan 2024
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
78
5
0
13 Dec 2023
Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
Stephen Mak
Liming Xu
Tim Pearce
Michael Ostroumov
Alexandra Brintrup
29
11
0
26 Oct 2023
ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination
Xihuai Wang
Shao Zhang
Wenhao Zhang
Wentao Dong
Jingxiao Chen
Ying Wen
Weinan Zhang
30
8
0
08 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
38
19
0
06 Oct 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
33
28
0
28 Jul 2023
Eco-evolutionary Dynamics of Non-episodic Neuroevolution in Large Multi-agent Environments
Hamon Gautier
Eleni Nisioti
Clément Moulin-Frier
31
2
0
18 Feb 2023
MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks
Raffaele Galliera
A. Morelli
Roberto Fronteddu
N. Suri
26
4
0
02 Feb 2023
Parallel Automatic History Matching Algorithm Using Reinforcement Learning
Omar S. Alolayan
Abdullah O. Alomar
John R. Williams
33
6
0
14 Nov 2022
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal Conditioning
Caroline Wang
Garrett A. Warnell
Peter Stone
40
3
0
26 Oct 2022
Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning
Carlos M. Casas
B. Carro
Antonio J. Sánchez-Esguevillas
14
1
0
06 Oct 2022
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
60
100
0
06 Nov 2021
Learning to Iteratively Solve Routing Problems with Dual-Aspect Collaborative Transformer
Yining Ma
Jingwen Li
Zhiguang Cao
Wen Song
Le Zhang
Zhenghua Chen
Jing Tang
83
129
0
06 Oct 2021
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning
Nikita Rudin
David Hoeller
Philipp Reist
Marco Hutter
115
546
0
24 Sep 2021
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
29
9
0
04 Aug 2021
Tianshou: a Highly Modularized Deep Reinforcement Learning Library
Jiayi Weng
Huayu Chen
Dong Yan
Kaichao You
Alexis Duburcq
Minghao Zhang
Yi Su
Hang Su
Jun Zhu
NoLa
OffRL
30
194
0
29 Jul 2021
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
32
5
0
11 Jun 2021
Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning
Guillaume Bellegarda
Yiyu Chen
Zhuochen Liu
Quan Nguyen
34
44
0
11 Mar 2021
How to Make Deep RL Work in Practice
Nirnai Rao
Elie Aljalbout
Axel Sauer
Sami Haddadin
OffRL
18
11
0
25 Oct 2020
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models
Cédric Colas
B. Hejblum
S. Rouillon
R. Thiébaut
Pierre-Yves Oudeyer
Clément Moulin-Frier
M. Prague
13
22
0
09 Oct 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
M. Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
31
213
0
10 Jun 2020
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
32
57
0
12 May 2020
Learning Variable Ordering Heuristics for Solving Constraint Satisfaction Problems
Wen Song
Zhiguang Cao
Jie Zhang
Andrew Lim
21
33
0
23 Dec 2019
Learning Improvement Heuristics for Solving Routing Problems
Yaoxin Wu
Wen Song
Zhiguang Cao
Jie Zhang
Andrew Lim
33
281
0
12 Dec 2019
MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions
V. Sivakumar
Olivier Delalleau
Tim Rocktaschel
Alexander H. Miller
Heinrich Küttler
Nantas Nardelli
Michael G. Rabbat
Joelle Pineau
Sebastian Riedel
10
36
0
09 Oct 2019
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
32
285
0
12 Jun 2019
Artificial Intelligence for Prosthetics - challenge solutions
L. Kidzinski
Carmichael F. Ong
Sharada Mohanty
Jennifer Hicks
Sean F. Carroll
...
E. Tumer
J. Watson
M. Salathé
Sergey Levine
Scott L. Delp
15
40
0
07 Feb 2019
Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks
Fabio Pardo
Vitaly Levdik
Petar Kormushev
25
4
0
06 Oct 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
16
126
0
27 Feb 2018
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
1