Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.01124
Cited By
Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
3 December 2015
P. Sunehag
Richard Evans
Gabriel Dulac-Arnold
Yori Zwols
D. Visentin
Ben Coppin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions"
13 / 13 papers shown
Title
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Federico Tomasi
Joseph Cauteruccio
Surya Kanoria
K. Ciosek
Matteo Rinaldi
Zhenwen Dai
OffRL
30
5
0
13 Oct 2023
Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari
David Arbour
Georgios Theocharous
N. Vlassis
OffRL
44
0
0
27 Aug 2023
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet
Thibaut Thonet
Jean-Michel Render
Maarten de Rijke
27
23
0
20 Jan 2023
Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Micah Carroll
Anca Dragan
Stuart J. Russell
Dylan Hadfield-Menell
OffRL
38
41
0
25 Apr 2022
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
54
20
0
15 Oct 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
138
273
0
23 Jan 2021
RecSim: A Configurable Simulation Platform for Recommender Systems
Eugene Ie
Chih-Wei Hsu
Martin Mladenov
Vihan Jain
Sanmit Narvekar
Jing Wang
Rui Wu
Craig Boutilier
30
179
0
11 Sep 2019
Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology
Eugene Ie
Vihan Jain
Jing Wang
Sanmit Narvekar
Ritesh Agarwal
...
Vince Gatto
Paul Covington
Jim McFadden
Tushar Chandra
Craig Boutilier
OffRL
24
69
0
29 May 2019
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control
Yangchen Pan
Amir-massoud Farahmand
Martha White
S. Nabi
P. Grover
D. Nikovski
51
18
0
13 Jun 2018
Beyond Greedy Ranking: Slate Optimization via List-CVAE
Ray Jiang
Sven Gowal
Timothy A. Mann
Danilo Jimenez Rezende
22
49
0
05 Mar 2018
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning
Xiangyu Zhao
Li Zhang
Zhuoye Ding
Long Xia
Jiliang Tang
Dawei Yin
29
328
0
19 Feb 2018
Deep Reinforcement Learning in Large Discrete Action Spaces
Gabriel Dulac-Arnold
Richard Evans
H. V. Hasselt
P. Sunehag
Timothy Lillicrap
Jonathan J. Hunt
Timothy A. Mann
T. Weber
T. Degris
Ben Coppin
OffRL
20
569
0
24 Dec 2015
Matroid Bandits: Fast Combinatorial Optimization with Learning
B. Kveton
Zheng Wen
Azin Ashkan
Hoda Eydgahi
Brian Eriksson
46
119
0
20 Mar 2014
1