ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.03845
  4. Cited By
Model-Based Reinforcement Learning with Adversarial Training for Online
  Recommendation
v1v2v3 (latest)

Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation

10 November 2019
Xueying Bai
Jian Guan
Hongning Wang
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation"

33 / 33 papers shown
Title
AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing
AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing
Lingyue Fu
Ting Long
Jianghao Lin
Wei Xia
Xinyi Dai
Ruiming Tang
Yun Wang
Weinan Zhang
Yong Yu
OffRL
69
0
0
07 Apr 2025
Federated Control in Markov Decision Processes
Federated Control in Markov Decision Processes
Hao Jin
Yang Peng
Liangyu Zhang
Zhihua Zhang
FedML
67
0
0
07 May 2024
Retentive Decision Transformer with Adaptive Masking for Reinforcement
  Learning based Recommendation Systems
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems
Siyu Wang
Xiaocong Chen
Lina Yao
OffRL
86
2
0
26 Mar 2024
Aligning GPTRec with Beyond-Accuracy Goals with Reinforcement Learning
Aligning GPTRec with Beyond-Accuracy Goals with Reinforcement Learning
Aleksandr V. Petrov
Craig MacDonald
39
2
0
07 Mar 2024
ReRoGCRL: Representation-based Robustness in Goal-Conditioned
  Reinforcement Learning
ReRoGCRL: Representation-based Robustness in Goal-Conditioned Reinforcement Learning
Xiangyu Yin
Sihao Wu
Jiaxu Liu
Meng Fang
Xingyu Zhao
Xiaowei Huang
Wenjie Ruan
AAML
79
5
0
12 Dec 2023
Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from
  Imperfect Demonstration for Interactive Recommendation
Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation
Jialin Liu
Xinyan Su
Zeyu He
Xiangyu Zhao
Jun Li
OffRL
51
0
0
30 Oct 2023
A General Neural Causal Model for Interactive Recommendation
A General Neural Causal Model for Interactive Recommendation
Jialin Liu
Xinyan Su
Peng Zhou
Xiangyu Zhao
Jun Li
CML
46
0
0
30 Oct 2023
Reward Dropout Improves Control: Bi-objective Perspective on Reinforced
  LM
Reward Dropout Improves Control: Bi-objective Perspective on Reinforced LM
Changhun Lee
Chiehyeon Lim
68
0
0
06 Oct 2023
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
Zhenghai Xue
Qingpeng Cai
Tianyou Zuo
Bin Yang
Lantao Hu
Peng Jiang
Kun Gai
53
1
0
06 Oct 2023
A General Offline Reinforcement Learning Framework for Interactive
  Recommendation
A General Offline Reinforcement Learning Framework for Interactive Recommendation
Teng Xiao
Donglin Wang
OffRL
106
74
0
01 Oct 2023
Model-based Offline Policy Optimization with Adversarial Network
Model-based Offline Policy Optimization with Adversarial Network
Junming Yang
Xingguo Chen
Shengyuan Wang
Bolei Zhang
OffRL
60
2
0
05 Sep 2023
Model-free Reinforcement Learning with Stochastic Reward Stabilization
  for Recommender Systems
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Tianchi Cai
Shenliao Bao
Jiyan Jiang
Shiji Zhou
Wenpeng Zhang
Lihong Gu
Jinjie Gu
Guannan Zhang
OffRL
63
2
0
25 Aug 2023
Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with
  Q-Value Predictions
Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions
Tongxin Li
Yiheng Lin
Shaolei Ren
Adam Wierman
AAMLOffRL
93
8
0
20 Jul 2023
Robust Reinforcement Learning Objectives for Sequential Recommender
  Systems
Robust Reinforcement Learning Objectives for Sequential Recommender Systems
Melissa Mozifian
Tristan Sylvain
David Evans
Li Meng
OffRL
51
0
0
30 May 2023
Causal Decision Transformer for Recommender Systems via Offline
  Reinforcement Learning
Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning
Siyu Wang
Xiaocong Chen
Dietmar Jannach
Lina Yao
CMLOffRL
114
28
0
17 Apr 2023
Effective Dimension in Bandit Problems under Censorship
Effective Dimension in Bandit Problems under Censorship
G. Guinet
Saurabh Amin
Patrick Jaillet
44
1
0
14 Feb 2023
Multi-Task Recommendations with Reinforcement Learning
Multi-Task Recommendations with Reinforcement Learning
Ziru Liu
Jiejie Tian
Qingpeng Cai
Xiangyu Zhao
Jingtong Gao
...
Da Chen
Tonghao He
Dong Zheng
Peng Jiang
Kun Gai
114
44
0
07 Feb 2023
Optimizing DDPM Sampling with Shortcut Fine-Tuning
Optimizing DDPM Sampling with Shortcut Fine-Tuning
Ying Fan
Kangwook Lee
111
60
0
31 Jan 2023
Generative Slate Recommendation with Reinforcement Learning
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet
Thibaut Thonet
Jean-Michel Render
Maarten de Rijke
88
24
0
20 Jan 2023
Synthetic Data-Based Simulators for Recommender Systems: A Survey
Synthetic Data-Based Simulators for Recommender Systems: A Survey
Elizaveta Stavinova
A. Grigorievskiy
A. Volodkevich
P. Chunaev
Klavdiya Olegovna Bochenina
D. Bugaychenko
SyDa
69
8
0
22 Jun 2022
A generative recommender system with GMM prior for cancer drug
  generation and sensitivity prediction
A generative recommender system with GMM prior for cancer drug generation and sensitivity prediction
Krzysztof Koras
Marcin Mo.zejko
Paula Szymczak
E. Staub
Ewa Szczurek
51
0
0
07 Jun 2022
Estimating and Penalizing Induced Preference Shifts in Recommender
  Systems
Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Micah Carroll
Anca Dragan
Stuart J. Russell
Dylan Hadfield-Menell
OffRL
125
44
0
25 Apr 2022
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender
  System
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang
Zhene Zou
Minghao Zhao
Qilin Deng
Yue Shang
Yile Liang
Runze Wu
Xudong Shen
Tangjie Lyu
Changjie Fan
OffRL
43
9
0
18 Oct 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
257
46
0
06 Oct 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A
  Systematic Review and Future Directions
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
Reinforcement Learning to Optimize Lifetime Value in Cold-Start
  Recommendation
Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation
Luo Ji
Qin Qi
Bingqing Han
Hongxia Yang
OffRL
55
28
0
20 Aug 2021
Conditional Sequential Slate Optimization
Conditional Sequential Slate Optimization
Yipeng Zhang
Mingjian Lu
Saratchandra Indrakanti
M. Kannadasan
A. Bagherjeiran
113
0
0
12 Aug 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
48
8
0
03 May 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
253
283
0
23 Jan 2021
Batch-Constrained Distributional Reinforcement Learning for
  Session-based Recommendation
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation
Diksha Garg
Priyanka Gupta
Pankaj Malhotra
Lovekesh Vig
Gautam M. Shroff
OffRL
37
8
0
16 Dec 2020
Offline Meta-level Model-based Reinforcement Learning Approach for
  Cold-Start Recommendation
Offline Meta-level Model-based Reinforcement Learning Approach for Cold-Start Recommendation
Yanan Wang
Yong Ge
Li Li
Rui Chen
Tong Xu
OffRL
61
7
0
04 Dec 2020
Learning from eXtreme Bandit Feedback
Learning from eXtreme Bandit Feedback
Romain Lopez
Inderjit S. Dhillon
Michael I. Jordan
OffRL
91
25
0
27 Sep 2020
Generator and Critic: A Deep Reinforcement Learning Approach for Slate
  Re-ranking in E-commerce
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce
Jianxiong Wei
Anxiang Zeng
Yueqiu Wu
P. Guo
Q. Hua
Qingpeng Cai
OffRL
69
9
0
25 May 2020
1