Linear Stochastic Bandits Under Safety Constraints

Neural Information Processing Systems (NeurIPS), 2019

16 August 2019

Sanae Amani

M. Alizadeh

Christos Thrampoulidis

ArXiv (abs)PDF HTML

Papers citing "Linear Stochastic Bandits Under Safety Constraints"

50 / 91 papers shown

Multi-Armed Bandits with Minimum Aggregated Revenue Constraints

14 Oct 2025

On the Regularity and Fairness of Combinatorial Multi-Armed Bandit

Xiaoyi Wu

Bin Li

FaML

335

15 Sep 2025

Secure Best Arm Identification in the Presence of a Copycat

Asaf Cohen

Onur Günlü

235

25 Jul 2025

Adapting to Heterophilic Graph Data with Structure-Guided Neighbor Discovery

170

10 Jun 2025

Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed BudgetConference on Uncertainty in Artificial Intelligence (UAI), 2025

Jie Bian

Vincent Y. F. Tan

260

03 Jun 2025

Adversarial bandit optimization for approximately linear functionsIFIP Working Conference on Database Semantics (IWDS), 2025

Zhuoyu Cheng

Kohei Hatano

Eiji Takimoto

528

27 May 2025

The Safety-Privacy Tradeoff in Linear BanditsInternational Symposium on Information Theory (ISIT), 2025

226

23 Apr 2025

Constrained Linear Thompson Sampling

Aditya Gangrade

Venkatesh Saligrama

362

03 Mar 2025

Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces

Amirhossein Roknilamouki

372

25 Feb 2025

Near-Linear MIR Algorithms for Stochastically-Ordered PriorsAlgorithmic Game Theory (AGT), 2025

454

18 Feb 2025

Improved Regret Bounds for Online Fair Division with Bandit LearningAAAI Conference on Artificial Intelligence (AAAI), 2025

Benjamin G. Schiffer

Shirley Zhang

264

13 Jan 2025

Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial ContextsInternational Conference on Learning Representations (ICLR), 2025

294

03 Jan 2025

Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints

Udvas Das

Debabrota Basu

301

24 Oct 2024

Flipping-based Policy for Chance-Constrained Markov Decision ProcessesNeural Information Processing Systems (NeurIPS), 2024

149

09 Oct 2024

Minimax-optimal trust-aware multi-armed bandits

Changxiao Cai

Jiacheng Zhang

242

04 Oct 2024

Honor Among Bandits: No-Regret Learning for Online Fair Division

Ariel D. Procaccia

Benjamin Schiffer

Shirley Zhang

FaML

294

01 Jul 2024

Testing the Feasibility of Linear Programs with Bandit Feedback

Aditya Gangrade

Aditya Gopalan

Venkatesh Saligrama

Clayton Scott

307

21 Jun 2024

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

Subhojyoti Mukherjee

Josiah P. Hanna

Robert Nowak

OffRL

294

04 Jun 2024

Pure Exploration for Constrained Best Mixed Arm Identification with a Fixed Budget

295

23 May 2024

Safe Exploration Using Bayesian World Models and Log-Barrier Optimization

211

09 May 2024

On Safety in Safe Bayesian Optimization

358

19 Mar 2024

Optimistic Safety for Online Convex Optimization with Unknown Linear ConstraintsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

Spencer Hutchinson

Tianyi Chen

Mahnoosh Alizadeh

333

09 Mar 2024

Truly No-Regret Learning in Constrained MDPs

436

24 Feb 2024

Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise ConstraintsIEEE Transactions on Signal and Information Processing over Networks (TSIPN), 2024

Jiabin Lin

Shana Moothedath

489

21 Jan 2024

Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation

233

24 Dec 2023

Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration

Honghao Wei

Xin Liu

Lei Ying

217

22 Dec 2023

Risk-Aware Continuous Control with Neural Contextual BanditsAAAI Conference on Artificial Intelligence (AAAI), 2023

J. Ayala-Romero

A. Garcia-Saavedra

Xavier Pérez Costa

262

15 Dec 2023

A safe exploration approach to constrained Markov decision processesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Tingting Ni

Maryam Kamgarpour

398

01 Dec 2023

Robust Best-arm Identification in Linear Bandits

Wei Wang

Sattar Vakili

Ilija Bogunovic

231

08 Nov 2023

Convex Methods for Constrained Linear Bandits

Amirhossein Afsharrad

Ahmadreza Moradipari

Sanjay Lall

238

07 Nov 2023

Safe Exploration in Reinforcement Learning: A Generalized Formulation and AlgorithmsNeural Information Processing Systems (NeurIPS), 2023

323

05 Oct 2023

Price of Safety in Linear Best Arm Identification

297

15 Sep 2023

Directional Optimism for Safe Linear BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Spencer Hutchinson

Berkay Turan

M. Alizadeh

336

29 Aug 2023

Clustered Linear Contextual Bandits with Knapsacks

Yichuan Deng

M. Mamakos

Zhao Song

216

21 Aug 2023

Online Ad Procurement in Non-stationary Autobidding WorldsNeural Information Processing Systems (NeurIPS), 2023

Jason Cheuk Nam Liang

Haihao Lu

Baoyu Zhou

170

10 Jul 2023

Pure Exploration in Bandits with Linear ConstraintsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Emil Carlsson

Debabrota Basu

Fredrik D. Johansson

Devdatt Dubhashi

365

22 Jun 2023

Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise ConstraintsInternational Conference on Machine Learning (ICML), 2023

305

09 Jun 2023

Disincentivizing Polarization in Social Networks

191

23 May 2023

The Impact of the Geometric Properties of the Constraint Set in Safe Optimization with Bandit FeedbackConference on Learning for Dynamics & Control (L4DC), 2023

Spencer Hutchinson

Berkay Turan

M. Alizadeh

314

01 May 2023

Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

Liangyu Zhang

Yang Peng

Wenhao Yang

Zhihua Zhang

209

29 Apr 2023

Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems

385

23 Feb 2023

A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard ConstraintsInternational Conference on Machine Learning (ICML), 2023

Ming Shi

Yitao Liang

Ness B. Shroff

220

08 Feb 2023

Leveraging User-Triggered Supervision in Contextual Bandits

Alekh Agarwal

Claudio Gentile

T. V. Marinov

204

07 Feb 2023

Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit FeedbackInternational Conference on Machine Learning (ICML), 2023

Wonyoung Hedge Kim

G. Iyengar

A. Zeevi

205

31 Jan 2023

Probably Anytime-Safe Stochastic Combinatorial Semi-BanditsInternational Conference on Machine Learning (ICML), 2023

Yunlong Hou

Vincent Y. F. Tan

Zixin Zhong

212

31 Jan 2023

Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation

K. C. Kalagarla

Rahul Jain

Pierluigi Nuzzo

226

27 Jan 2023

Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget

Fathima Zarin Faizal

Jayakrishnan Nair

142

27 Nov 2022

Benefits of Monotonicity in Safe Exploration with Gaussian ProcessesConference on Uncertainty in Artificial Intelligence (UAI), 2022

Arpan Losalka

Jonathan Scarlett

236

03 Nov 2022

Safe Linear Bandits over Unknown PolytopesAnnual Conference Computational Learning Theory (COLT), 2022

Aditya Gangrade

Tianrui Chen

Venkatesh Saligrama

419

27 Sep 2022

Constrained Policy Optimization for Controlled Self-Learning in Conversational AI SystemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Mohammad Kachuee

Sungjin Lee

301

17 Sep 2022