ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.00183
  4. Cited By
Learning Action Representations for Reinforcement Learning

Learning Action Representations for Reinforcement Learning

1 February 2019
Yash Chandak
Georgios Theocharous
James E. Kostas
Scott M. Jordan
Philip S. Thomas
ArXivPDFHTML

Papers citing "Learning Action Representations for Reinforcement Learning"

36 / 36 papers shown
Title
Learning Actionable World Models for Industrial Process Control
Learning Actionable World Models for Industrial Process Control
Peng Yan
Ahmed Abdulkadir
Gerrit A. Schatte
Giulia Anguzzi
Joonsu Gha
Nikola Pascher
Matthias Rosenthal
Yunlong Gao
Benjamin Grewe
Thilo Stadelmann
DRL
AI4CE
49
0
0
03 Mar 2025
Action Tokenizer Matters in In-Context Imitation Learning
An Vuong
M. Vu
Dong An
Ian Reid
66
1
0
03 Mar 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
55
0
0
24 Jan 2025
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
35
9
0
24 May 2024
Off-Policy Evaluation of Slate Bandit Policies via Optimizing
  Abstraction
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
27
5
0
03 Feb 2024
Distributional Off-Policy Evaluation for Slate Recommendations
Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari
David Arbour
Georgios Theocharous
N. Vlassis
OffRL
46
0
0
27 Aug 2023
Policy Gradient Methods in the Presence of Symmetries and State
  Abstractions
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
38
2
0
09 May 2023
Generative Slate Recommendation with Reinforcement Learning
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet
Thibaut Thonet
Jean-Michel Render
Maarten de Rijke
27
23
0
20 Jan 2023
Representation Learning for Continuous Action Spaces is Beneficial for
  Efficient Policy Learning
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Learn the Time to Learn: Replay Scheduling in Continual Learning
Learn the Time to Learn: Replay Scheduling in Continual Learning
Marcus Klasson
Hedvig Kjellström
Chen Zhang
CLL
37
9
0
18 Sep 2022
Selective Token Generation for Few-shot Natural Language Generation
Selective Token Generation for Few-shot Natural Language Generation
DaeJin Jo
Taehwan Kwon
Eun-Sol Kim
Sungwoong Kim
40
1
0
17 Sep 2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
S. Rezaei-Shoshtari
Rosie Zhao
Prakash Panangaden
David Meger
Doina Precup
35
18
0
15 Sep 2022
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation
  with Residual Actor
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Wanqi Xue
Qingpeng Cai
Ruohan Zhan
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
38
24
0
01 Jun 2022
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill
  Acquisition
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Dylan Slack
Yinlam Chow
Bo Dai
Nevan Wichers
OffRL
40
7
0
10 Feb 2022
Tutorial on amortized optimization
Tutorial on amortized optimization
Brandon Amos
OffRL
78
43
0
01 Feb 2022
Discrete and continuous representations and processing in deep learning:
  Looking forward
Discrete and continuous representations and processing in deep learning: Looking forward
Ruben Cartuyvels
Graham Spinks
Marie-Francine Moens
OCL
38
20
0
04 Jan 2022
Learning Large Neighborhood Search Policy for Integer Programming
Learning Large Neighborhood Search Policy for Integer Programming
Yaoxin Wu
Wen Song
Zhiguang Cao
Jie Zhang
32
41
0
01 Nov 2021
How to Sense the World: Leveraging Hierarchy in Multimodal Perception
  for Robust Reinforcement Learning Agents
How to Sense the World: Leveraging Hierarchy in Multimodal Perception for Robust Reinforcement Learning Agents
Miguel Vasco
Hang Yin
Francisco S. Melo
Ana Paiva
39
7
0
07 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
41
93
0
14 Sep 2021
Deep hierarchical reinforcement agents for automated penetration testing
Deep hierarchical reinforcement agents for automated penetration testing
Khuong Tran
Ashlesha Akella
Maxwell Standen
Junae Kim
David Bowman
Toby J. Richer
Chin-Teng Lin Institution One
51
38
0
14 Sep 2021
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via
  Hybrid Action Representation
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Boyan Li
Hongyao Tang
Yan Zheng
Jianye Hao
Pengyi Li
Zhen Wang
Zhaopeng Meng
Li Wang
37
42
0
12 Sep 2021
Low-Dimensional State and Action Representation Learning with MDP
  Homomorphism Metrics
Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics
N. Botteghi
M. Poel
B. Sirmaçek
C. Brune
24
3
0
04 Jul 2021
Generalization to New Actions in Reinforcement Learning
Generalization to New Actions in Reinforcement Learning
Ayush Jain
Andrew Szot
Joseph J. Lim
AI4CE
35
34
0
03 Nov 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
35
204
0
04 Oct 2020
State Action Separable Reinforcement Learning
State Action Separable Reinforcement Learning
Ziyao Zhang
Liang Ma
K. Leung
Konstantinos Poularakis
Mudhakar Srivatsa
31
2
0
05 Jun 2020
Temporally-Extended ε-Greedy Exploration
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
22
34
0
02 Jun 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward
  Estimation
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
69
18
0
09 May 2020
Shared Autonomy with Learned Latent Actions
Shared Autonomy with Learned Latent Actions
Hong Jun Jeon
Dylan P. Losey
Dorsa Sadigh
16
78
0
07 May 2020
Parameterizing Branch-and-Bound Search Trees to Learn Branching Policies
Parameterizing Branch-and-Bound Search Trees to Learn Branching Policies
Giulia Zarpellon
Jason Jo
Andrea Lodi
Yoshua Bengio
24
96
0
12 Feb 2020
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement
  Learning and Hierarchical Actions Filtering
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering
Yuval Heffetz
Roman Vainshtein
Gilad Katz
Lior Rokach
25
39
0
31 Oct 2019
Dynamics Learning with Cascaded Variational Inference for Multi-Step
  Manipulation
Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation
Kuan Fang
Yuke Zhu
Animesh Garg
Silvio Savarese
Li Fei-Fei
DRL
23
47
0
29 Oct 2019
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from
  forbidden action
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action
Mathieu Seurin
Philippe Preux
Olivier Pietquin
23
12
0
04 Oct 2019
The Differentiable Cross-Entropy Method
The Differentiable Cross-Entropy Method
Brandon Amos
Denis Yarats
34
54
0
27 Sep 2019
Controlling Assistive Robots with Learned Latent Actions
Controlling Assistive Robots with Learned Latent Actions
Dylan P. Losey
K. Srinivasan
Ajay Mandlekar
Animesh Garg
Dorsa Sadigh
34
69
0
20 Sep 2019
Dynamics-aware Embeddings
Dynamics-aware Embeddings
William F. Whitney
Rajat Agarwal
Kyunghyun Cho
Abhinav Gupta
SSL
25
53
0
25 Aug 2019
Off-Policy Actor-Critic
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
163
220
0
22 May 2012
1