ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.01187
  4. Cited By
Thompson Sampling Algorithms for Cascading Bandits
v1v2v3v4 (latest)

Thompson Sampling Algorithms for Cascading Bandits

2 October 2018
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
ArXiv (abs)PDFHTML

Papers citing "Thompson Sampling Algorithms for Cascading Bandits"

10 / 10 papers shown
Title
Optimal Design for Human Feedback
Optimal Design for Human Feedback
Subhojyoti Mukherjee
Anusha Lalitha
Kousha Kalantari
Aniket Deshmukh
Ge Liu
Yifei Ma
Branislav Kveton
65
0
0
22 Apr 2024
Cascading Reinforcement Learning
Cascading Reinforcement Learning
Yihan Du
R. Srikant
Wei Chen
50
1
0
17 Jan 2024
AdaptEx: A Self-Service Contextual Bandit Platform
AdaptEx: A Self-Service Contextual Bandit Platform
W. Black
Ercüment Ilhan
A. Marchini
Vilda K. Markeviciute
43
3
0
08 Aug 2023
Ranking with Popularity Bias: User Welfare under Self-Amplification
  Dynamics
Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics
Guy Tennenholtz
Martin Mladenov
Nadav Merlis
Robert L. Axtell
Craig Boutilier
53
0
0
24 May 2023
Multiplier Bootstrap-based Exploration
Multiplier Bootstrap-based Exploration
Runzhe Wan
Haoyu Wei
Branislav Kveton
R. Song
52
3
0
03 Feb 2023
Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits
Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits
Yunlong Hou
Vincent Y. F. Tan
Zixin Zhong
57
1
0
31 Jan 2023
Overcoming Prior Misspecification in Online Learning to Rank
Overcoming Prior Misspecification in Online Learning to Rank
Javad Azizi
Ofer Meshi
M. Zoghi
Maryam Karimzadehgan
70
1
0
25 Jan 2023
Minimax Regret for Cascading Bandits
Minimax Regret for Cascading Bandits
Daniel Vial
Sujay Sanghavi
Sanjay Shakkottai
R. Srikant
57
14
0
23 Mar 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning
  Framework
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
69
14
0
26 Feb 2022
Achieving the Pareto Frontier of Regret Minimization and Best Arm
  Identification in Multi-Armed Bandits
Achieving the Pareto Frontier of Regret Minimization and Best Arm Identification in Multi-Armed Bandits
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
51
5
0
16 Oct 2021
1