Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.12624
Cited By
v1
v2 (latest)
Top-k Combinatorial Bandits with Full-Bandit Feedback
28 May 2019
Idan Rejwan
Yishay Mansour
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Top-k Combinatorial Bandits with Full-Bandit Feedback"
35 / 35 papers shown
Title
Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback
Vaneet Aggarwal
Shweta Jain
Subham Pokhriyal
Christopher J. Quinn
519
0
0
15 Mar 2025
Bandit and Delayed Feedback in Online Structured Prediction
Yuki Shibukawa
Taira Tsuchiya
Shinsaku Sakaue
Kenji Yamanishi
OffRL
102
0
0
26 Feb 2025
On the Low-Complexity of Fair Learning for Combinatorial Multi-Armed Bandit
Xiaoyi Wu
Bo Ji
Bin Li
FaML
113
0
0
01 Jan 2025
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Tatsuhiro Shimizu
Koichi Tanaka
Ren Kishimoto
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
CML
OffRL
88
1
0
20 Aug 2024
Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents
Lucia Gordon
Esther Rolf
Milind Tambe
50
1
0
06 Aug 2024
A Contextual Combinatorial Bandit Approach to Negotiation
Yexin Li
Zhancun Mu
Siyuan Qi
54
2
0
30 Jun 2024
No-Regret M
♮
{}^{\natural}
♮
-Concave Function Maximization: Stochastic Bandit Algorithms and NP-Hardness of Adversarial Full-Information Setting
Taihei Oki
Shinsaku Sakaue
88
0
0
21 May 2024
Nearly Minimax Optimal Regret for Multinomial Logistic Bandit
Joongkyu Lee
Min-hwan Oh
93
7
0
16 May 2024
Combinatorial Stochastic-Greedy Bandit
Fares Fourati
Christopher J. Quinn
Mohamed-Slim Alouini
Vaneet Aggarwal
78
9
0
13 Dec 2023
Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Han Huang
Li Shen
Deheng Ye
Wei Liu
29
0
0
24 Aug 2023
Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback
Yiliu Wang
Wei Chen
Milan Vojnović
21
3
0
25 May 2023
Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit Feedback
M. Pedramfar
Vaneet Aggarwal
79
2
0
23 Mar 2023
Neural Bandits for Data Mining: Searching for Dangerous Polypharmacy
Alexandre Larouche
Audrey Durand
Richard Khoury
C. Sirois
18
0
0
10 Dec 2022
Top-k data selection via distributed sample quantile inference
Xu Zhang
M. Vasconcelos
57
1
0
01 Dec 2022
Mixed-Effect Thompson Sampling
Imad Aouali
Branislav Kveton
S. Katariya
OffRL
90
12
0
30 May 2022
Gaussian Process Bandits with Aggregated Feedback
Mengyan Zhang
Russell Tsuchida
Cheng Soon Ong
53
6
0
24 Dec 2021
Scaling Blockchains: Can Committee-Based Consensus Help?
Alon Benhaim
B. Falk
Gerry Tsoukalas
48
14
0
16 Oct 2021
Pure Exploration and Regret Minimization in Matching Bandits
Flore Sentenac
Jialin Yi
Clément Calauzènes
Vianney Perchet
Milan Vojnović
31
6
0
31 Jul 2021
Dueling Bandits with Team Comparisons
Lee Cohen
Ulrike Schmidt-Kraepelin
Yishay Mansour
22
1
0
06 Jul 2021
Pure Exploration with Structured Preference Feedback
Shubham Gupta
Aadirupa Saha
S. Katariya
70
0
0
12 Apr 2021
Efficient Optimal Selection for Composited Advertising Creatives with Tree Structure
Jin Chen
T. Ge
Gangwei Jiang
Qing Cui
Defu Lian
Kai Zheng
42
5
0
02 Mar 2021
Top-
k
k
k
eXtreme Contextual Bandits with Arm Hierarchy
Rajat Sen
Alexander Rakhlin
Lexing Ying
Rahul Kidambi
Dean Phillips Foster
Daniel N. Hill
Inderjit Dhillon
116
10
0
15 Feb 2021
Adversarial Combinatorial Bandits with General Non-linear Reward Functions
Xi Chen
Yanjun Han
Yining Wang
66
17
0
05 Jan 2021
Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation
Yuko Kuroki
Junya Honda
Masashi Sugiyama
OffRL
48
1
0
31 Dec 2020
Fully Gap-Dependent Bounds for Multinomial Logit Bandit
Jiaqi Yang
108
2
0
19 Nov 2020
DART: aDaptive Accept RejecT for non-linear top-K subset identification
Mridul Agarwal
Vaneet Aggarwal
Christopher J. Quinn
A. Umrawal
16
4
0
16 Nov 2020
Probabilistic Sequential Shrinking: A Best Arm Identification Algorithm for Stochastic Bandits with Corruptions
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
AAML
54
13
0
15 Oct 2020
Online Dense Subgraph Discovery via Blurred-Graph Feedback
Yuko Kuroki
Atsushi Miyauchi
Junya Honda
Masashi Sugiyama
66
16
0
24 Jun 2020
Preference-based Reinforcement Learning with Finite-Time Guarantees
Yichong Xu
Ruosong Wang
Lin F. Yang
Aarti Singh
A. Dubrawski
106
60
0
16 Jun 2020
Combinatorial Pure Exploration with Full-Bandit or Partial Linear Feedback
Yihan Du
Yuko Kuroki
Wei Chen
OffRL
24
4
0
14 Jun 2020
Statistically Efficient, Polynomial Time Algorithms for Combinatorial Semi Bandits
Thibaut Cuvelier
Richard Combes
É. Gourdin
44
21
0
17 Feb 2020
Tight Lower Bounds for Combinatorial Multi-Armed Bandits
Nadav Merlis
Shie Mannor
42
19
0
13 Feb 2020
Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
65
8
0
23 Jan 2020
Thompson Sampling for Combinatorial Network Optimization in Unknown Environments
Alihan Huyuk
Cem Tekin
69
16
0
07 Jul 2019
Stochastic Top-
K
K
K
Subset Bandits with Linear Space and Non-Linear Feedback
Mridul Agarwal
Vaneet Aggarwal
Christopher J. Quinn
A. Umrawal
37
10
0
29 Nov 2018
1