ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.04162
  4. Cited By
Policy Gradients for Contextual Recommendations
v1v2v3 (latest)

Policy Gradients for Contextual Recommendations

12 February 2018
Feiyang Pan
Qingpeng Cai
Pingzhong Tang
Fuzhen Zhuang
Qing He
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Policy Gradients for Contextual Recommendations"

13 / 13 papers shown
Title
Contrastive Representation for Interactive Recommendation
Contrastive Representation for Interactive Recommendation
Jingyu Li
Zhiyong Feng
Dongxiao He
Hongqi Chen
Qinghang Gao
Guoli Wu
59
0
0
24 Dec 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using
  Normalized Weight Functions
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
42
0
0
16 Jun 2024
Contextual Decision Trees
Contextual Decision Trees
Tommaso Aldinucci
Enrico Civitelli
Leonardo Di Gangi
Alessandro Sestini
46
3
0
13 Jul 2022
Personalized Transfer of User Preferences for Cross-domain
  Recommendation
Personalized Transfer of User Preferences for Cross-domain Recommendation
Yongchun Zhu
Zhenwei Tang
Yudan Liu
Fuzhen Zhuang
Ruobing Xie
Xu-Yao Zhang
Leyu Lin
Qing He
65
175
0
21 Oct 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A
  Systematic Review and Future Directions
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
161
101
0
01 Jul 2021
Improving Long-Term Metrics in Recommendation Systems using
  Short-Horizon Reinforcement Learning
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning
Bogdan Mazoure
Paul Mineiro
Pavithra Srinath
R. S. Sedeh
Doina Precup
Adith Swaminathan
OffRL
52
4
0
01 Jun 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
48
8
0
03 May 2021
Deep Reinforcement Learning-Based Product Recommender for Online
  Advertising
Deep Reinforcement Learning-Based Product Recommender for Online Advertising
Milad Vaali Esfahaani
Yanbo Xue
P. Setoodeh
OffRL
28
3
0
30 Jan 2021
Sample Complexity of Policy Gradient Finding Second-Order Stationary
  Points
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points
Long Yang
Qian Zheng
Gang Pan
92
21
0
02 Dec 2020
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
Jianfeng Liu
Feiyang Pan
Ling Luo
OffRL
65
23
0
24 May 2020
Field-aware Calibration: A Simple and Empirically Strong Method for
  Reliable Probabilistic Predictions
Field-aware Calibration: A Simple and Empirically Strong Method for Reliable Probabilistic Predictions
Feiyang Pan
Xiang Ao
Pingzhong Tang
Min Lu
Dapeng Liu
Lei Xiao
Qing He
72
22
0
26 May 2019
Warm Up Cold-start Advertisements: Improving CTR Predictions via
  Learning to Learn ID Embeddings
Warm Up Cold-start Advertisements: Improving CTR Predictions via Learning to Learn ID Embeddings
Feiyang Pan
Shuokai Li
Xiang Ao
Pingzhong Tang
Qing He
80
188
0
25 Apr 2019
1