Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1908.05814
Cited By
Linear Stochastic Bandits Under Safety Constraints
Neural Information Processing Systems (NeurIPS), 2019
16 August 2019
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Linear Stochastic Bandits Under Safety Constraints"
50 / 91 papers shown
Multi-Armed Bandits with Minimum Aggregated Revenue Constraints
Ahmed Ben Yahmed
Hafedh El Ferchichi
Marc Abeille
Vianney Perchet
97
0
0
14 Oct 2025
On the Regularity and Fairness of Combinatorial Multi-Armed Bandit
Xiaoyi Wu
Bin Li
FaML
335
0
0
15 Sep 2025
Secure Best Arm Identification in the Presence of a Copycat
Asaf Cohen
Onur Günlü
235
1
0
25 Jul 2025
Adapting to Heterophilic Graph Data with Structure-Guided Neighbor Discovery
Victor M. Tenorio
Madeline Navarro
Samuel Rey
Santiago Segarra
Antonio G. Marques
170
1
0
10 Jun 2025
Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget
Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Jie Bian
Vincent Y. F. Tan
260
0
0
03 Jun 2025
Adversarial bandit optimization for approximately linear functions
IFIP Working Conference on Database Semantics (IWDS), 2025
Zhuoyu Cheng
Kohei Hatano
Eiji Takimoto
528
1
0
27 May 2025
The Safety-Privacy Tradeoff in Linear Bandits
International Symposium on Information Theory (ISIT), 2025
Arghavan Zibaie
Spencer Hutchinson
Ramtin Pedarsani
Mahnoosh Alizadeh
226
0
0
23 Apr 2025
Constrained Linear Thompson Sampling
Aditya Gangrade
Venkatesh Saligrama
362
0
0
03 Mar 2025
Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces
Amirhossein Roknilamouki
A. Ghosh
Ming Shi
Fatemeh Nourzad
Eylem Ekici
Ness B. Shroff
372
5
0
25 Feb 2025
Near-Linear MIR Algorithms for Stochastically-Ordered Priors
Algorithmic Game Theory (AGT), 2025
Gal Bahar
Omer Ben-Porat
Kevin Leyton-Brown
Moshe Tennenholtz
454
0
0
18 Feb 2025
Improved Regret Bounds for Online Fair Division with Bandit Learning
AAAI Conference on Artificial Intelligence (AAAI), 2025
Benjamin G. Schiffer
Shirley Zhang
264
5
0
13 Jan 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
International Conference on Learning Representations (ICLR), 2025
Zhuohua Li
Maoli Liu
Xiangxiang Dai
John C. S. Lui
294
4
0
03 Jan 2025
Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints
Udvas Das
Debabrota Basu
301
0
0
24 Oct 2024
Flipping-based Policy for Chance-Constrained Markov Decision Processes
Neural Information Processing Systems (NeurIPS), 2024
Xun Shen
Shuo Jiang
Akifumi Wachi
Kaumune Hashimoto
Sebastien Gros
149
2
0
09 Oct 2024
Minimax-optimal trust-aware multi-armed bandits
Changxiao Cai
Jiacheng Zhang
242
0
0
04 Oct 2024
Honor Among Bandits: No-Regret Learning for Online Fair Division
Ariel D. Procaccia
Benjamin Schiffer
Shirley Zhang
FaML
294
9
0
01 Jul 2024
Testing the Feasibility of Linear Programs with Bandit Feedback
Aditya Gangrade
Aditya Gopalan
Venkatesh Saligrama
Clayton Scott
307
3
0
21 Jun 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
294
0
0
04 Jun 2024
Pure Exploration for Constrained Best Mixed Arm Identification with a Fixed Budget
Dengwang Tang
Rahul Jain
Ashutosh Nayyar
Pierluigi Nuzzo
295
2
0
23 May 2024
Safe Exploration Using Bayesian World Models and Log-Barrier Optimization
Yarden As
Bhavya Sukhija
Andreas Krause
OffRL
211
2
0
09 May 2024
On Safety in Safe Bayesian Optimization
Christian Fiedler
Johanna Menn
Lukas Kreisköther
Sebastian Trimpe
358
17
0
19 Mar 2024
Optimistic Safety for Online Convex Optimization with Unknown Linear Constraints
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Spencer Hutchinson
Tianyi Chen
Mahnoosh Alizadeh
333
1
0
09 Mar 2024
Truly No-Regret Learning in Constrained MDPs
Adrian Müller
Pragnya Alatur
Volkan Cevher
Giorgia Ramponi
Niao He
436
17
0
24 Feb 2024
Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints
IEEE Transactions on Signal and Information Processing over Networks (TSIPN), 2024
Jiabin Lin
Shana Moothedath
489
2
0
21 Jan 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
233
0
0
24 Dec 2023
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
Honghao Wei
Xin Liu
Lei Ying
217
7
0
22 Dec 2023
Risk-Aware Continuous Control with Neural Contextual Bandits
AAAI Conference on Artificial Intelligence (AAAI), 2023
J. Ayala-Romero
A. Garcia-Saavedra
Xavier Pérez Costa
262
4
0
15 Dec 2023
A safe exploration approach to constrained Markov decision processes
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Tingting Ni
Maryam Kamgarpour
398
5
0
01 Dec 2023
Robust Best-arm Identification in Linear Bandits
Wei Wang
Sattar Vakili
Ilija Bogunovic
231
0
0
08 Nov 2023
Convex Methods for Constrained Linear Bandits
Amirhossein Afsharrad
Ahmadreza Moradipari
Sanjay Lall
238
5
0
07 Nov 2023
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms
Neural Information Processing Systems (NeurIPS), 2023
Akifumi Wachi
Wataru Hashimoto
Xun Shen
Kazumune Hashimoto
323
27
0
05 Oct 2023
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
297
5
0
15 Sep 2023
Directional Optimism for Safe Linear Bandits
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Spencer Hutchinson
Berkay Turan
M. Alizadeh
336
9
0
29 Aug 2023
Clustered Linear Contextual Bandits with Knapsacks
Yichuan Deng
M. Mamakos
Zhao Song
216
0
0
21 Aug 2023
Online Ad Procurement in Non-stationary Autobidding Worlds
Neural Information Processing Systems (NeurIPS), 2023
Jason Cheuk Nam Liang
Haihao Lu
Baoyu Zhou
170
9
0
10 Jul 2023
Pure Exploration in Bandits with Linear Constraints
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Emil Carlsson
Debabrota Basu
Fredrik D. Johansson
Devdatt Dubhashi
365
12
0
22 Jun 2023
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints
International Conference on Machine Learning (ICML), 2023
Donghao Li
Ruiquan Huang
Cong Shen
Jing Yang
305
4
0
09 Jun 2023
Disincentivizing Polarization in Social Networks
C. Borgs
J. Chayes
Christian Ikeokwu
Ellen Vitercik
191
0
0
23 May 2023
The Impact of the Geometric Properties of the Constraint Set in Safe Optimization with Bandit Feedback
Conference on Learning for Dynamics & Control (L4DC), 2023
Spencer Hutchinson
Berkay Turan
M. Alizadeh
314
7
0
01 May 2023
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Liangyu Zhang
Yang Peng
Wenhao Yang
Zhihua Zhang
209
1
0
29 Apr 2023
Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems
Ting-Jui Chang
Sapana Chaudhary
D. Kalathil
Shahin Shahrampour
385
6
0
23 Feb 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints
International Conference on Machine Learning (ICML), 2023
Ming Shi
Yitao Liang
Ness B. Shroff
220
17
0
08 Feb 2023
Leveraging User-Triggered Supervision in Contextual Bandits
Alekh Agarwal
Claudio Gentile
T. V. Marinov
204
0
0
07 Feb 2023
Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback
International Conference on Machine Learning (ICML), 2023
Wonyoung Hedge Kim
G. Iyengar
A. Zeevi
205
4
0
31 Jan 2023
Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits
International Conference on Machine Learning (ICML), 2023
Yunlong Hou
Vincent Y. F. Tan
Zixin Zhong
212
1
0
31 Jan 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
226
6
0
27 Jan 2023
Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget
Fathima Zarin Faizal
Jayakrishnan Nair
142
11
0
27 Nov 2022
Benefits of Monotonicity in Safe Exploration with Gaussian Processes
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Arpan Losalka
Jonathan Scarlett
236
1
0
03 Nov 2022
Safe Linear Bandits over Unknown Polytopes
Annual Conference Computational Learning Theory (COLT), 2022
Aditya Gangrade
Tianrui Chen
Venkatesh Saligrama
419
15
0
27 Sep 2022
Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Mohammad Kachuee
Sungjin Lee
301
4
0
17 Sep 2022
1
2
Next
Page 1 of 2