Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2106.02757
Cited By
v1
v2 (latest)
Heuristic-Guided Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2021
5 June 2021
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Heuristic-Guided Reinforcement Learning"
39 / 39 papers shown
Title
Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach
Sebastian Reboul
Hélène Halconruy
Randal Douc
OffRL
84
0
0
22 Oct 2025
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research
Mirko Stappert
Bernhard Lutz
Niklas Goby
Dirk Neumann
OffRL
196
0
0
03 Apr 2025
Optimizing 2D+1 Packing in Constrained Environments Using Deep Reinforcement Learning
International Conference on Enterprise Information Systems (ICEIS), 2025
Victor Ulisses Pugliese
Oséias F. de A. Ferreira
Fabio A. Faria
OffRL
190
0
0
21 Mar 2025
Towards Bio-inspired Heuristically Accelerated Reinforcement Learning for Adaptive Underwater Multi-Agents Behaviour
Antoine Vivien
Thomas Chaffre
Matthew Stephenson
Eva Artusi
Paulo E. Santos
Benoit Clement
Anne-Gwenn Bosser
AI4CE
174
0
0
10 Feb 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
International Conference on Learning Representations (ICLR), 2025
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
367
8
0
04 Feb 2025
MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking
Sebastian Farquhar
Vikrant Varma
David Lindner
David Elson
Caleb Biddulph
Ian Goodfellow
Rohin Shah
364
10
0
22 Jan 2025
Online inductive learning from answer sets for efficient reinforcement learning exploration
Celeste Veronese
Daniele Meli
Alessandro Farinelli
OnRL
180
2
0
13 Jan 2025
Fairness in Reinforcement Learning with Bisimulation Metrics
S. Rezaei-Shoshtari
Hanna Yurchyk
Scott Fujimoto
Doina Precup
David Meger
406
0
0
03 Jan 2025
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Conference on Learning for Dynamics & Control (L4DC), 2024
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep Chinchali
Ufuk Topcu
OffRL
378
1
0
02 Dec 2024
On the Modeling Capabilities of Large Language Models for Sequential Decision Making
International Conference on Learning Representations (ICLR), 2024
Martin Klissarov
Devon Hjelm
Alexander Toshev
Bogdan Mazoure
LM&Ro
ELM
OffRL
LRM
256
5
0
08 Oct 2024
Intelligent Router for LLM Workloads: Improving Performance Through Workload-Aware Scheduling
Kunal Jain
Anjaly Parayil
Ankur Mallick
Esha Choukse
Xiaoting Qin
...
Chetan Bansal
Victor Rühle
Anoop Kulkarni
Steve Kofsky
Saravan Rajmohan
153
2
0
24 Aug 2024
Emotion-Agent: Unsupervised Deep Reinforcement Learning with Distribution-Prototype Reward for Continuous Emotional EEG Analysis
Zhihao Zhou
Qile Liu
Jiyuan Wang
Zhen Liang
144
0
0
22 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
International Conference on Learning Representations (ICLR), 2024
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
573
17
0
06 Aug 2024
Proofread: Fixes All Errors with One Tap
Renjie Liu
Yanxiang Zhang
Yun Zhu
Haicheng Sun
Yuanbo Zhang
Michael Xuelin Huang
Shanqing Cai
Lei Meng
Shumin Zhai
ALM
163
4
0
06 Jun 2024
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
243
1
0
06 May 2024
On the Sample Efficiency of Abstractions and Potential-Based Reward Shaping in Reinforcement Learning
Giuseppe Canonaco
Leo Ardon
Alberto Pozanco
Daniel Borrajo
OffRL
210
2
0
11 Apr 2024
Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method
Kyuwon Choi
Cheolkyun Rho
Taeyoun Kim
D. Choi
OffRL
130
0
0
21 Mar 2024
Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning
Hao Chen
Weiwei Wan
Masaki Matsushita
Takeyuki Kotaka
Kensuke Harada
147
4
0
18 Jan 2024
Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping
Lauren H. Cooke
Harvey Klyne
Edwin Zhang
Cassidy Laidlaw
Milind Tambe
Finale Doshi-Velez
303
2
0
15 Dec 2023
On Using Admissible Bounds for Learning Forward Search Heuristics
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Carlos Núnez-Molina
Masataro Asai
Pablo Mesejo
Juan Fernández-Olivares
248
4
0
23 Aug 2023
Towards an On-device Agent for Text Rewriting
Yun Zhu
Yinxiao Liu
Felix Stahlberg
Shankar Kumar
Yu-hui Chen
Liangchen Luo
Lei Shu
Renjie Liu
Jindong Chen
Lei Meng
LLMAG
151
8
0
22 Aug 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
205
19
0
06 Jul 2023
SACHA: Soft Actor-Critic with Heuristic-Based Attention for Partially Observable Multi-Agent Path Finding
IEEE Robotics and Automation Letters (RA-L), 2023
Qiushi Lin
Hang Ma
250
27
0
05 Jul 2023
Improving Offline RL by Blending Heuristics
International Conference on Learning Representations (ICLR), 2023
Sinong Geng
Aldo Pacchiano
Andrey Kolobov
Ching-An Cheng
OffRL
185
9
0
01 Jun 2023
Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach
IEEE Conference on Network Softwarization (NetSoft), 2023
Jamila Alsayed Kassem
Li Zhong
Arie Taal
Paola Grosso
76
1
0
25 Apr 2023
Accelerating exploration and representation learning with offline pre-training
Bogdan Mazoure
Jake Bruce
Doina Precup
Rob Fergus
Ankit Anand
OffRL
221
7
0
31 Mar 2023
Provable Reset-free Reinforcement Learning by No-Regret Reduction
International Conference on Machine Learning (ICML), 2023
Hoai-An Nguyen
Ching-An Cheng
OffRL
292
3
0
06 Jan 2023
Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Conference on Robot Learning (CoRL), 2022
Yanjiang Guo
Jingyue Gao
Zheng Wu
Chengming Shi
Jianyu Chen
OffRL
222
7
0
03 Dec 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Neural Information Processing Systems (NeurIPS), 2022
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
189
91
0
18 Oct 2022
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Neural Information Processing Systems (NeurIPS), 2022
Allen Nie
Yannis Flet-Berliac
Deon R. Jordan
William Steenbergen
Emma Brunskill
OffRL
225
13
0
16 Oct 2022
Making Reinforcement Learning Work on Swimmer
Mael Franceschetti
Coline Lacoux
Ryan Ohouens
Antonin Raffin
Olivier Sigaud
OffRL
123
8
0
16 Aug 2022
Lyapunov Design for Robust and Efficient Robotic Reinforcement Learning
Conference on Robot Learning (CoRL), 2022
T. Westenbroek
F. Castañeda
Ayush Agrawal
S. Shankar Sastry
Koushil Sreenath
220
34
0
13 Aug 2022
Hindsight Learning for MDPs with Exogenous Inputs
International Conference on Machine Learning (ICML), 2022
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
191
25
0
13 Jul 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Neural Information Processing Systems (NeurIPS), 2022
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
304
81
0
03 Jun 2022
Adversarially Trained Actor Critic for Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
243
144
0
05 Feb 2022
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Neural Information Processing Systems (NeurIPS), 2021
Bogdan Mazoure
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OffRL
156
25
0
29 Nov 2021
Deep Multiagent Reinforcement Learning: Challenges and Directions
Artificial Intelligence Review (AIR), 2021
Annie Wong
Thomas Bäck
Anna V. Kononova
Aske Plaat
AI4CE
213
143
0
29 Jun 2021
Safe Reinforcement Learning Using Advantage-Based Intervention
Nolan Wagener
Byron Boots
Ching-An Cheng
253
63
0
16 Jun 2021
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning
Bogdan Mazoure
Paul Mineiro
Pavithra Srinath
R. S. Sedeh
Doina Precup
Adith Swaminathan
OffRL
176
4
0
01 Jun 2021
1