ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.02121
  4. Cited By
Learn What Not to Learn: Action Elimination with Deep Reinforcement
  Learning

Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning

6 September 2018
Tom Zahavy
Matan Haroush
Nadav Merlis
D. Mankowitz
Shie Mannor
ArXivPDFHTML

Papers citing "Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning"

41 / 41 papers shown
Title
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
50
0
0
24 Jan 2025
Safety through Permissibility: Shield Construction for Fast and Safe
  Reinforcement Learning
Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning
A. Politowicz
Sahisnu Mazumder
Bing-Quan Liu
28
0
0
29 May 2024
FaaSched: A Jitter-Aware Serverless Scheduler
FaaSched: A Jitter-Aware Serverless Scheduler
Abhisek Panda
S. Sarangi
36
0
0
11 Mar 2023
Learning to Follow Instructions in Text-Based Games
Learning to Follow Instructions in Text-Based Games
Mathieu Tuli
Andrew C. Li
Pashootan Vaezipoor
Toryn Q. Klassen
Scott Sanner
Sheila A. McIlraith
36
13
0
08 Nov 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
11
6
0
31 Oct 2022
Knowledge-Guided Exploration in Deep Reinforcement Learning
Knowledge-Guided Exploration in Deep Reinforcement Learning
Sahisnu Mazumder
Bing-Quan Liu
Shuai Wang
Yingxuan Zhu
Xiaotian Yin
Lifeng Liu
Jian Li
46
4
0
26 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing:
  Benchmarks, Baselines, and Building Blocks for Natural Language Policy
  Optimization
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
31
240
0
03 Oct 2022
Selective Token Generation for Few-shot Natural Language Generation
Selective Token Generation for Few-shot Natural Language Generation
DaeJin Jo
Taehwan Kwon
Eun-Sol Kim
Sungwoong Kim
35
1
0
17 Sep 2022
An Analysis of Deep Reinforcement Learning Agents for Text-based Games
An Analysis of Deep Reinforcement Learning Agents for Text-based Games
Chen Chen
Yue Dai
Josiah Poon
Caren Han
LLMAG
19
2
0
09 Sep 2022
Aligning to Social Norms and Values in Interactive Narratives
Aligning to Social Norms and Values in Interactive Narratives
Prithviraj Ammanabrolu
Liwei Jiang
Maarten Sap
Hannaneh Hajishirzi
Yejin Choi
AI4CE
28
46
0
04 May 2022
A Practical AoI Scheduler in IoT Networks with Relays
A Practical AoI Scheduler in IoT Networks with Relays
Biplav Choudhury
Prasenjit Karmakar
Vijay K. Shah
Jeffrey H. Reed
11
1
0
08 Mar 2022
Active Learning of Quantum System Hamiltonians yields Query Advantage
Active Learning of Quantum System Hamiltonians yields Query Advantage
Arko Dutt
E. Pednault
C. Wu
S. Sheldon
J. Smolin
L. Bishop
I. Chuang
14
11
0
29 Dec 2021
Towards Autonomous Satellite Communications: An AI-based Framework to
  Address System-level Challenges
Towards Autonomous Satellite Communications: An AI-based Framework to Address System-level Challenges
J. Luis
Skylar Eiskowitz
Nils Pachler de la Osa
E. Crawley
B. Cameron
20
5
0
11 Dec 2021
Brick-by-Brick: Combinatorial Construction with Deep Reinforcement
  Learning
Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning
H. Chung
Jungtaek Kim
Boris Knyazev
Jinhwi Lee
Graham W. Taylor
Jaesik Park
Minsu Cho
SSL
OffRL
18
20
0
29 Oct 2021
Situated Dialogue Learning through Procedural Environment Generation
Situated Dialogue Learning through Procedural Environment Generation
Prithviraj Ammanabrolu
Renee Jia
Mark O. Riedl
109
14
0
07 Oct 2021
Generalization in Text-based Games via Hierarchical Reinforcement
  Learning
Generalization in Text-based Games via Hierarchical Reinforcement Learning
Yunqiu Xu
Meng Fang
Ling Chen
Yali Du
Chengqi Zhang
AI4CE
40
20
0
21 Sep 2021
A Survey of Text Games for Reinforcement Learning informed by Natural
  Language
A Survey of Text Games for Reinforcement Learning informed by Natural Language
P. Osborne
Heido Nomm
André Freitas
AI4CE
32
24
0
20 Sep 2021
Deep hierarchical reinforcement agents for automated penetration testing
Deep hierarchical reinforcement agents for automated penetration testing
Khuong Tran
Ashlesha Akella
Maxwell Standen
Junae Kim
David Bowman
Toby J. Richer
Chin-Teng Lin Institution One
46
38
0
14 Sep 2021
A Systematic Survey of Text Worlds as Embodied Natural Language
  Environments
A Systematic Survey of Text Worlds as Embodied Natural Language Environments
Peter Alexander Jansen
LM&Ro
26
21
0
08 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Planning Spatial Networks with Monte Carlo Tree Search
Planning Spatial Networks with Monte Carlo Tree Search
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
27
7
0
12 Jun 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
14
117
0
11 Jun 2021
Training Value-Aligned Reinforcement Learning Agents Using a Normative
  Prior
Training Value-Aligned Reinforcement Learning Agents Using a Normative Prior
Md Sultan al Nahian
Spencer Frazier
Brent Harrison
Mark O. Riedl
27
17
0
19 Apr 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Ofir Nabati
Tom Zahavy
Shie Mannor
19
18
0
07 Feb 2021
Reinforcement Learning with Combinatorial Actions: An Application to
  Vehicle Routing
Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing
A. Delarue
Ross Anderson
Christian Tjandraatmadja
35
93
0
22 Oct 2020
Deep Reinforcement Learning with Stacked Hierarchical Attention for
  Text-based Games
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games
Yunqiu Xu
Meng Fang
Ling-Hao Chen
Yali Du
Qiufeng Wang
Chengqi Zhang
OffRL
25
44
0
22 Oct 2020
Text-based RL Agents with Commonsense Knowledge: New Challenges,
  Environments and Baselines
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
K. Murugesan
Mattia Atzeni
Pavan Kapanipathi
Pushkar Shukla
Sadhana Kumaravel
Gerald Tesauro
Kartik Talamadupula
Mrinmaya Sachan
Murray Campbell
LM&Ro
LLMAG
OffRL
32
54
0
08 Oct 2020
Keep CALM and Explore: Language Models for Action Generation in
  Text-based Games
Keep CALM and Explore: Language Models for Action Generation in Text-based Games
Shunyu Yao
Rohan Rao
Matthew J. Hausknecht
Karthik Narasimhan
LLMAG
LM&Ro
19
127
0
06 Oct 2020
How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and
  Act in Fantasy Worlds
How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds
Prithviraj Ammanabrolu
Jack Urbanek
Margaret Li
Arthur Szlam
Tim Rocktaschel
Jason Weston
LM&Ro
19
44
0
01 Oct 2020
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Shengyi Huang
Santiago Ontañón
27
310
0
25 Jun 2020
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement
  Learning
Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiaolin Hu
Feng Chen
24
81
0
20 Jun 2020
How to Avoid Being Eaten by a Grue: Structured Exploration Strategies
  for Textual Worlds
How to Avoid Being Eaten by a Grue: Structured Exploration Strategies for Textual Worlds
Prithviraj Ammanabrolu
Ethan Tien
Matthew J. Hausknecht
Mark O. Riedl
LLMAG
24
50
0
12 Jun 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
120
0
24 Mar 2020
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement
  Learning and Hierarchical Actions Filtering
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering
Yuval Heffetz
Roman Vainshtein
Gilad Katz
Lior Rokach
17
39
0
31 Oct 2019
Integrating Behavior Cloning and Reinforcement Learning for Improved
  Performance in Dense and Sparse Reward Environments
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
OffRL
16
31
0
09 Oct 2019
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from
  forbidden action
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action
Mathieu Seurin
Philippe Preux
Olivier Pietquin
16
12
0
04 Oct 2019
Interactive Fiction Games: A Colossal Adventure
Interactive Fiction Games: A Colossal Adventure
Matthew J. Hausknecht
Prithviraj Ammanabrolu
Marc-Alexandre Côté
Xingdi Yuan
LLMAG
LM&Ro
AI4CE
18
191
0
11 Sep 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through
  Likelihood Matching
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching
Tom Zahavy
Shie Mannor
HAI
23
30
0
24 Jan 2019
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
214
1,327
0
05 Jun 2016
Convolutional Neural Networks for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
255
13,368
0
25 Aug 2014
1