ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.00720
  4. Cited By
RecoGym: A Reinforcement Learning Environment for the problem of Product
  Recommendation in Online Advertising
v1v2 (latest)

RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising

2 August 2018
D. Rohde
Stephen Bonner
Travis Dunlop
Flavian Vasile
Alexandros Karatzoglou
    OffRL
ArXiv (abs)PDFHTML

Papers citing "RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising"

31 / 81 papers shown
Title
Understanding Longitudinal Dynamics of Recommender Systems with
  Agent-Based Modeling and Simulation
Understanding Longitudinal Dynamics of Recommender Systems with Agent-Based Modeling and Simulation
G. Adomavicius
Dietmar Jannach
Stephan Leitner
Jingjing Zhang
63
8
0
25 Aug 2021
What are you optimizing for? Aligning Recommender Systems with Human
  Values
What are you optimizing for? Aligning Recommender Systems with Human Values
J. Stray
Ivan Vendrov
Jeremy Nixon
Steven Adler
Dylan Hadfield-Menell
OffRL
71
55
0
22 Jul 2021
T-RECS: A Simulation Tool to Study the Societal Impact of Recommender
  Systems
T-RECS: A Simulation Tool to Study the Societal Impact of Recommender Systems
Eli Lucherini
Matthew Sun
Amy A. Winecoff
Arvind Narayanan
85
23
0
19 Jul 2021
Learning-To-Ensemble by Contextual Rank Aggregation in E-Commerce
Learning-To-Ensemble by Contextual Rank Aggregation in E-Commerce
Xuesi Wang
Guangda Huzhang
Qianying Lin
Qing Da
30
1
0
19 Jul 2021
Imitate TheWorld: A Search Engine Simulation Platform
Imitate TheWorld: A Search Engine Simulation Platform
Yongqing Gao
Guangda Huzhang
Weijie Shen
Yawen Liu
Wen-Ji Zhou
Qing Da
Yang Yu
61
3
0
16 Jul 2021
Improving Long-Term Metrics in Recommendation Systems using
  Short-Horizon Reinforcement Learning
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning
Bogdan Mazoure
Paul Mineiro
Pavithra Srinath
R. S. Sedeh
Doina Precup
Adith Swaminathan
OffRL
65
4
0
01 Jun 2021
RecSim NG: Toward Principled Uncertainty Modeling for Recommender
  Ecosystems
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Martin Mladenov
Chih-Wei Hsu
Vihan Jain
Eugene Ie
Christopher Colby
Nicolas Mayoraz
H. Pham
Dustin Tran
Ivan Vendrov
Craig Boutilier
BDL
56
33
0
14 Mar 2021
No-Regret Algorithms for Private Gaussian Process Bandit Optimization
No-Regret Algorithms for Private Gaussian Process Bandit Optimization
Abhimanyu Dubey
66
13
0
24 Feb 2021
Deep Reinforcement Learning-Based Product Recommender for Online
  Advertising
Deep Reinforcement Learning-Based Product Recommender for Online Advertising
Milad Vaali Esfahaani
Yanbo Xue
P. Setoodeh
OffRL
28
3
0
30 Jan 2021
Do Offline Metrics Predict Online Performance in Recommender Systems?
Do Offline Metrics Predict Online Performance in Recommender Systems?
K. Krauth
Sarah Dean
Alex Zhao
Wenshuo Guo
Mihaela Curmei
Benjamin Recht
Michael I. Jordan
OffRL
73
41
0
07 Nov 2020
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Xiaocong Chen
Lina Yao
Aixin Sun
Xianzhi Wang
Xiwei Xu
Liming Zhu
OffRL
38
27
0
04 Nov 2020
Generalization to New Actions in Reinforcement Learning
Generalization to New Actions in Reinforcement Learning
Ayush Jain
Andrew Szot
Joseph J. Lim
AI4CE
94
35
0
03 Nov 2020
MARS-Gym: A Gym framework to model, train, and evaluate Recommender
  Systems for Marketplaces
MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces
Marlesson R. O. Santana
Luckeciano C. Melo
Fernando H. F. Camargo
Bruno Brandão
Anderson Soares
Renan M. Oliveira
Sandor Caetano
OffRL
37
15
0
30 Sep 2020
From Clicks to Conversions: Recommendation for long-term reward
From Clicks to Conversions: Recommendation for long-term reward
Philomene Chagniot
Flavian Vasile
D. Rohde
OffRL
20
2
0
01 Sep 2020
BLOB : A Probabilistic Model for Recommendation that Combines Organic
  and Bandit Signals
BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Otmane Sakhi
Stephen Bonner
D. Rohde
Flavian Vasile
94
37
0
28 Aug 2020
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible
  Off-Policy Evaluation
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
201
75
0
17 Aug 2020
Self-Supervised Reinforcement Learning for Recommender Systems
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSLOffRL
157
202
0
10 Jun 2020
AliExpress Learning-To-Rank: Maximizing Online Model Performance without
  Going Online
AliExpress Learning-To-Rank: Maximizing Online Model Performance without Going Online
Guangda Huzhang
Zhen-Jia Pang
Yongqing Gao
Yawen Liu
Weijie Shen
...
Qing Da
Anxiang Zeng
Han Yu
Yang Yu
Zhi Zhou
108
4
0
25 Mar 2020
IKEA Furniture Assembly Environment for Long-Horizon Complex
  Manipulation Tasks
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Youngwoon Lee
E. Hu
Zhengyu Yang
Alexander Yin
Joseph J. Lim
96
124
0
17 Nov 2019
Reconsidering Analytical Variational Bounds for Output Layers of Deep
  Networks
Reconsidering Analytical Variational Bounds for Output Layers of Deep Networks
Otmane Sakhi
Stephen Bonner
D. Rohde
Flavian Vasile
BDL
20
1
0
02 Oct 2019
Learning from Bandit Feedback: An Overview of the State-of-the-art
Learning from Bandit Feedback: An Overview of the State-of-the-art
Olivier Jeunen
Dmytro Mykhaylov
D. Rohde
Flavian Vasile
Alexandre Gilotte
Martin Bompaire
OffRL
37
10
0
18 Sep 2019
Towards Sharing Task Environments to Support Reproducible Evaluations of
  Interactive Recommender Systems
Towards Sharing Task Environments to Support Reproducible Evaluations of Interactive Recommender Systems
Andrea Barraza-Urbina
Mathieu d’Aquin
LRM
16
2
0
13 Sep 2019
How robust is MovieLens? A dataset analysis for recommender systems
How robust is MovieLens? A dataset analysis for recommender systems
A. Tousch
13
5
0
12 Sep 2019
RecSim: A Configurable Simulation Platform for Recommender Systems
RecSim: A Configurable Simulation Platform for Recommender Systems
Eugene Ie
Chih-Wei Hsu
Martin Mladenov
Vihan Jain
Sanmit Narvekar
Jing Wang
Rui Wu
Craig Boutilier
118
183
0
11 Sep 2019
Recommendation System-based Upper Confidence Bound for Online
  Advertising
Recommendation System-based Upper Confidence Bound for Online Advertising
Nhan Nguyen-Thanh
D. Marinca
K. Khawam
D. Rohde
Flavian Vasile
E. Lohan
Steven Martin
Dominique Quadri
OffRL
56
13
0
09 Sep 2019
DEAR: Deep Reinforcement Learning for Online Advertising Impression in
  Recommender Systems
DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems
Xiangyu Zhao
Changsheng Gu
Haoshenglun Zhang
Xiwang Yang
Xiaobing Liu
Jiliang Tang
Hui Liu
OffRL
81
102
0
09 Sep 2019
On the Value of Bandit Feedback for Offline Recommender System
  Evaluation
On the Value of Bandit Feedback for Offline Recommender System Evaluation
Olivier Jeunen
D. Rohde
Flavian Vasile
OffRL
51
10
0
26 Jul 2019
Tripartite Heterogeneous Graph Propagation for Large-scale Social
  Recommendation
Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation
KyungHyun Kim
Donghyun Kwak
Hanock Kwak
Young-Jin Park
Sangkwon Sim
Jae-Han Cho
Minkyu Kim
Jihun Kwon
Nako Sung
Jung-Woo Ha
71
19
0
24 Jul 2019
Three Methods for Training on Bandit Feedback
Three Methods for Training on Bandit Feedback
Dmytro Mykhaylov
D. Rohde
Flavian Vasile
Martin Bompaire
Olivier Jeunen
OffRL
39
7
0
24 Apr 2019
Latent Variable Session-Based Recommendation
Latent Variable Session-Based Recommendation
D. Rohde
Stephen Bonner
BDL
57
3
0
24 Apr 2019
Deep reinforcement learning for search, recommendation, and online
  advertising: a survey
Deep reinforcement learning for search, recommendation, and online advertising: a survey
Xiangyu Zhao
Long Xia
Jiliang Tang
Dawei Yin
OffRL
95
85
0
18 Dec 2018
Previous
12