Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.05869
Cited By
Large-scale Interactive Recommendation with Tree-structured Policy Gradient
14 November 2018
Haokun Chen
Xinyi Dai
Han Cai
Weinan Zhang
Xuejian Wang
Ruiming Tang
Yuzhou Zhang
Yong Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Large-scale Interactive Recommendation with Tree-structured Policy Gradient"
15 / 15 papers shown
Title
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
Yi Zhang
Ruihong Qiu
Xuwei Xu
Jiajun Liu
Sen Wang
OffRL
36
0
0
12 May 2025
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
Hanwen Du
B. Peng
Xia Ning
38
0
0
12 Oct 2024
Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems
Hojoon Lee
Dongyoon Hwang
Kyushik Min
Jaegul Choo
18
6
0
22 Aug 2023
AutoAssign+: Automatic Shared Embedding Assignment in Streaming Recommendation
Ziru Liu
Kecheng Chen
Fengyi Song
Bo Chen
Xiangyu Zhao
Huifeng Guo
Ruiming Tang
21
3
0
14 Aug 2023
Conversational Tree Search: A New Hybrid Dialog Task
Dirk Vath
Lindsey Vanderlyn
Ngoc Thang Vu
51
7
0
17 Mar 2023
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Qingpeng Cai
Shuchang Liu
Xueliang Wang
Tianyou Zuo
Wentao Xie
Bin Yang
Dong Zheng
Peng Jiang
Kun Gai
OffRL
33
41
0
03 Feb 2023
Two-Stage Constrained Actor-Critic for Short Video Recommendation
Qingpeng Cai
Zhenghai Xue
Chi Zhang
Wanqi Xue
Shuchang Liu
...
Tianyou Zuo
Wentao Xie
Dong Zheng
Peng Jiang
Kun Gai
OffRL
CML
27
44
0
03 Feb 2023
Constrained Reinforcement Learning for Short Video Recommendation
Qingpeng Cai
Ruohan Zhan
Chi Zhang
Jie Zheng
Guangwei Ding
Pinghua Gong
Dong Zheng
Peng Jiang
33
6
0
26 May 2022
KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems
Chongming Gao
Shijun Li
Wenqiang Lei
Jiawei Chen
Biao Li
Peng Jiang
Xiangnan He
Jiaxin Mao
Tat-Seng Chua
37
131
0
22 Feb 2022
Breaking the Linear Iteration Cost Barrier for Some Well-known Conditional Gradient Methods Using MaxIP Data-structures
Anshumali Shrivastava
Zhao Song
Zhaozhuo Xu
27
28
0
30 Nov 2021
Reinforcement Learning based Path Exploration for Sequential Explainable Recommendation
Yicong Li
Hongxu Chen
Yile Li
Lin Li
Philip S. Yu
Guandong Xu
23
15
0
24 Nov 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
22
8
0
03 May 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
138
273
0
23 Jan 2021
Toward Simulating Environments in Reinforcement Learning Based Recommendations
Xiangyu Zhao
Long Xia
Zhuoye Ding
Dawei Yin
Jiliang Tang
30
25
0
27 Jun 2019
Deep reinforcement learning for search, recommendation, and online advertising: a survey
Xiangyu Zhao
Long Xia
Jiliang Tang
Dawei Yin
OffRL
30
82
0
18 Dec 2018
1