Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.03845
Cited By
v1
v2
v3 (latest)
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation
10 November 2019
Xueying Bai
Jian Guan
Hongning Wang
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation"
33 / 33 papers shown
Title
AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing
Lingyue Fu
Ting Long
Jianghao Lin
Wei Xia
Xinyi Dai
Ruiming Tang
Yun Wang
Weinan Zhang
Yong Yu
OffRL
69
0
0
07 Apr 2025
Federated Control in Markov Decision Processes
Hao Jin
Yang Peng
Liangyu Zhang
Zhihua Zhang
FedML
67
0
0
07 May 2024
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems
Siyu Wang
Xiaocong Chen
Lina Yao
OffRL
86
2
0
26 Mar 2024
Aligning GPTRec with Beyond-Accuracy Goals with Reinforcement Learning
Aleksandr V. Petrov
Craig MacDonald
39
2
0
07 Mar 2024
ReRoGCRL: Representation-based Robustness in Goal-Conditioned Reinforcement Learning
Xiangyu Yin
Sihao Wu
Jiaxu Liu
Meng Fang
Xingyu Zhao
Xiaowei Huang
Wenjie Ruan
AAML
79
5
0
12 Dec 2023
Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation
Jialin Liu
Xinyan Su
Zeyu He
Xiangyu Zhao
Jun Li
OffRL
51
0
0
30 Oct 2023
A General Neural Causal Model for Interactive Recommendation
Jialin Liu
Xinyan Su
Peng Zhou
Xiangyu Zhao
Jun Li
CML
46
0
0
30 Oct 2023
Reward Dropout Improves Control: Bi-objective Perspective on Reinforced LM
Changhun Lee
Chiehyeon Lim
68
0
0
06 Oct 2023
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
Zhenghai Xue
Qingpeng Cai
Tianyou Zuo
Bin Yang
Lantao Hu
Peng Jiang
Kun Gai
53
1
0
06 Oct 2023
A General Offline Reinforcement Learning Framework for Interactive Recommendation
Teng Xiao
Donglin Wang
OffRL
106
74
0
01 Oct 2023
Model-based Offline Policy Optimization with Adversarial Network
Junming Yang
Xingguo Chen
Shengyuan Wang
Bolei Zhang
OffRL
60
2
0
05 Sep 2023
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Tianchi Cai
Shenliao Bao
Jiyan Jiang
Shiji Zhou
Wenpeng Zhang
Lihong Gu
Jinjie Gu
Guannan Zhang
OffRL
63
2
0
25 Aug 2023
Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions
Tongxin Li
Yiheng Lin
Shaolei Ren
Adam Wierman
AAML
OffRL
93
8
0
20 Jul 2023
Robust Reinforcement Learning Objectives for Sequential Recommender Systems
Melissa Mozifian
Tristan Sylvain
David Evans
Li Meng
OffRL
51
0
0
30 May 2023
Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning
Siyu Wang
Xiaocong Chen
Dietmar Jannach
Lina Yao
CML
OffRL
114
28
0
17 Apr 2023
Effective Dimension in Bandit Problems under Censorship
G. Guinet
Saurabh Amin
Patrick Jaillet
44
1
0
14 Feb 2023
Multi-Task Recommendations with Reinforcement Learning
Ziru Liu
Jiejie Tian
Qingpeng Cai
Xiangyu Zhao
Jingtong Gao
...
Da Chen
Tonghao He
Dong Zheng
Peng Jiang
Kun Gai
114
44
0
07 Feb 2023
Optimizing DDPM Sampling with Shortcut Fine-Tuning
Ying Fan
Kangwook Lee
111
60
0
31 Jan 2023
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet
Thibaut Thonet
Jean-Michel Render
Maarten de Rijke
88
24
0
20 Jan 2023
Synthetic Data-Based Simulators for Recommender Systems: A Survey
Elizaveta Stavinova
A. Grigorievskiy
A. Volodkevich
P. Chunaev
Klavdiya Olegovna Bochenina
D. Bugaychenko
SyDa
69
8
0
22 Jun 2022
A generative recommender system with GMM prior for cancer drug generation and sensitivity prediction
Krzysztof Koras
Marcin Mo.zejko
Paula Szymczak
E. Staub
Ewa Szczurek
51
0
0
07 Jun 2022
Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Micah Carroll
Anca Dragan
Stuart J. Russell
Dylan Hadfield-Menell
OffRL
125
44
0
25 Apr 2022
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang
Zhene Zou
Minghao Zhao
Qilin Deng
Yue Shang
Yile Liang
Runze Wu
Xudong Shen
Tangjie Lyu
Changjie Fan
OffRL
43
9
0
18 Oct 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
257
46
0
06 Oct 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation
Luo Ji
Qin Qi
Bingqing Han
Hongxia Yang
OffRL
55
28
0
20 Aug 2021
Conditional Sequential Slate Optimization
Yipeng Zhang
Mingjian Lu
Saratchandra Indrakanti
M. Kannadasan
A. Bagherjeiran
113
0
0
12 Aug 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
48
8
0
03 May 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
253
283
0
23 Jan 2021
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation
Diksha Garg
Priyanka Gupta
Pankaj Malhotra
Lovekesh Vig
Gautam M. Shroff
OffRL
37
8
0
16 Dec 2020
Offline Meta-level Model-based Reinforcement Learning Approach for Cold-Start Recommendation
Yanan Wang
Yong Ge
Li Li
Rui Chen
Tong Xu
OffRL
61
7
0
04 Dec 2020
Learning from eXtreme Bandit Feedback
Romain Lopez
Inderjit S. Dhillon
Michael I. Jordan
OffRL
91
25
0
27 Sep 2020
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce
Jianxiong Wei
Anxiang Zeng
Yueqiu Wu
P. Guo
Q. Hua
Qingpeng Cai
OffRL
69
9
0
25 May 2020
1