ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.05814
  4. Cited By
Linear Stochastic Bandits Under Safety Constraints

Linear Stochastic Bandits Under Safety Constraints

Neural Information Processing Systems (NeurIPS), 2019
16 August 2019
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
ArXiv (abs)PDFHTML

Papers citing "Linear Stochastic Bandits Under Safety Constraints"

50 / 91 papers shown
Multi-Armed Bandits with Minimum Aggregated Revenue Constraints
Multi-Armed Bandits with Minimum Aggregated Revenue Constraints
Ahmed Ben Yahmed
Hafedh El Ferchichi
Marc Abeille
Vianney Perchet
97
0
0
14 Oct 2025
On the Regularity and Fairness of Combinatorial Multi-Armed Bandit
On the Regularity and Fairness of Combinatorial Multi-Armed Bandit
Xiaoyi Wu
Bin Li
FaML
335
0
0
15 Sep 2025
Secure Best Arm Identification in the Presence of a Copycat
Secure Best Arm Identification in the Presence of a Copycat
Asaf Cohen
Onur Günlü
235
1
0
25 Jul 2025
Adapting to Heterophilic Graph Data with Structure-Guided Neighbor Discovery
Victor M. Tenorio
Madeline Navarro
Samuel Rey
Santiago Segarra
Antonio G. Marques
170
1
0
10 Jun 2025
Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget
Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed BudgetConference on Uncertainty in Artificial Intelligence (UAI), 2025
Jie Bian
Vincent Y. F. Tan
260
0
0
03 Jun 2025
Adversarial bandit optimization for approximately linear functions
Adversarial bandit optimization for approximately linear functionsIFIP Working Conference on Database Semantics (IWDS), 2025
Zhuoyu Cheng
Kohei Hatano
Eiji Takimoto
528
1
0
27 May 2025
The Safety-Privacy Tradeoff in Linear Bandits
The Safety-Privacy Tradeoff in Linear BanditsInternational Symposium on Information Theory (ISIT), 2025
Arghavan Zibaie
Spencer Hutchinson
Ramtin Pedarsani
Mahnoosh Alizadeh
226
0
0
23 Apr 2025
Constrained Linear Thompson Sampling
Constrained Linear Thompson Sampling
Aditya Gangrade
Venkatesh Saligrama
362
0
0
03 Mar 2025
Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces
Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces
Amirhossein Roknilamouki
A. Ghosh
Ming Shi
Fatemeh Nourzad
Eylem Ekici
Ness B. Shroff
372
5
0
25 Feb 2025
Near-Linear MIR Algorithms for Stochastically-Ordered Priors
Near-Linear MIR Algorithms for Stochastically-Ordered PriorsAlgorithmic Game Theory (AGT), 2025
Gal Bahar
Omer Ben-Porat
Kevin Leyton-Brown
Moshe Tennenholtz
454
0
0
18 Feb 2025
Improved Regret Bounds for Online Fair Division with Bandit Learning
Improved Regret Bounds for Online Fair Division with Bandit LearningAAAI Conference on Artificial Intelligence (AAAI), 2025
Benjamin G. Schiffer
Shirley Zhang
264
5
0
13 Jan 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial ContextsInternational Conference on Learning Representations (ICLR), 2025
Zhuohua Li
Maoli Liu
Xiangxiang Dai
John C. S. Lui
294
4
0
03 Jan 2025
Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints
Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints
Udvas Das
Debabrota Basu
301
0
0
24 Oct 2024
Flipping-based Policy for Chance-Constrained Markov Decision Processes
Flipping-based Policy for Chance-Constrained Markov Decision ProcessesNeural Information Processing Systems (NeurIPS), 2024
Xun Shen
Shuo Jiang
Akifumi Wachi
Kaumune Hashimoto
Sebastien Gros
149
2
0
09 Oct 2024
Minimax-optimal trust-aware multi-armed bandits
Minimax-optimal trust-aware multi-armed bandits
Changxiao Cai
Jiacheng Zhang
242
0
0
04 Oct 2024
Honor Among Bandits: No-Regret Learning for Online Fair Division
Honor Among Bandits: No-Regret Learning for Online Fair Division
Ariel D. Procaccia
Benjamin Schiffer
Shirley Zhang
FaML
294
9
0
01 Jul 2024
Testing the Feasibility of Linear Programs with Bandit Feedback
Testing the Feasibility of Linear Programs with Bandit Feedback
Aditya Gangrade
Aditya Gopalan
Venkatesh Saligrama
Clayton Scott
307
3
0
21 Jun 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in
  Tabular MDP
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
294
0
0
04 Jun 2024
Pure Exploration for Constrained Best Mixed Arm Identification with a
  Fixed Budget
Pure Exploration for Constrained Best Mixed Arm Identification with a Fixed Budget
Dengwang Tang
Rahul Jain
Ashutosh Nayyar
Pierluigi Nuzzo
295
2
0
23 May 2024
Safe Exploration Using Bayesian World Models and Log-Barrier
  Optimization
Safe Exploration Using Bayesian World Models and Log-Barrier Optimization
Yarden As
Bhavya Sukhija
Andreas Krause
OffRL
211
2
0
09 May 2024
On Safety in Safe Bayesian Optimization
On Safety in Safe Bayesian Optimization
Christian Fiedler
Johanna Menn
Lukas Kreisköther
Sebastian Trimpe
358
17
0
19 Mar 2024
Optimistic Safety for Online Convex Optimization with Unknown Linear
  Constraints
Optimistic Safety for Online Convex Optimization with Unknown Linear ConstraintsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Spencer Hutchinson
Tianyi Chen
Mahnoosh Alizadeh
333
1
0
09 Mar 2024
Truly No-Regret Learning in Constrained MDPs
Truly No-Regret Learning in Constrained MDPs
Adrian Müller
Pragnya Alatur
Volkan Cevher
Giorgia Ramponi
Niao He
436
17
0
24 Feb 2024
Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints
Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise ConstraintsIEEE Transactions on Signal and Information Processing over Networks (TSIPN), 2024
Jiabin Lin
Shana Moothedath
489
2
0
21 Jan 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
233
0
0
24 Dec 2023
Safe Reinforcement Learning with Instantaneous Constraints: The Role of
  Aggressive Exploration
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
Honghao Wei
Xin Liu
Lei Ying
217
7
0
22 Dec 2023
Risk-Aware Continuous Control with Neural Contextual Bandits
Risk-Aware Continuous Control with Neural Contextual BanditsAAAI Conference on Artificial Intelligence (AAAI), 2023
J. Ayala-Romero
A. Garcia-Saavedra
Xavier Pérez Costa
262
4
0
15 Dec 2023
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Tingting Ni
Maryam Kamgarpour
398
5
0
01 Dec 2023
Robust Best-arm Identification in Linear Bandits
Robust Best-arm Identification in Linear Bandits
Wei Wang
Sattar Vakili
Ilija Bogunovic
231
0
0
08 Nov 2023
Convex Methods for Constrained Linear Bandits
Convex Methods for Constrained Linear Bandits
Amirhossein Afsharrad
Ahmadreza Moradipari
Sanjay Lall
238
5
0
07 Nov 2023
Safe Exploration in Reinforcement Learning: A Generalized Formulation
  and Algorithms
Safe Exploration in Reinforcement Learning: A Generalized Formulation and AlgorithmsNeural Information Processing Systems (NeurIPS), 2023
Akifumi Wachi
Wataru Hashimoto
Xun Shen
Kazumune Hashimoto
323
27
0
05 Oct 2023
Price of Safety in Linear Best Arm Identification
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
297
5
0
15 Sep 2023
Directional Optimism for Safe Linear Bandits
Directional Optimism for Safe Linear BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Spencer Hutchinson
Berkay Turan
M. Alizadeh
336
9
0
29 Aug 2023
Clustered Linear Contextual Bandits with Knapsacks
Clustered Linear Contextual Bandits with Knapsacks
Yichuan Deng
M. Mamakos
Zhao Song
216
0
0
21 Aug 2023
Online Ad Procurement in Non-stationary Autobidding Worlds
Online Ad Procurement in Non-stationary Autobidding WorldsNeural Information Processing Systems (NeurIPS), 2023
Jason Cheuk Nam Liang
Haihao Lu
Baoyu Zhou
170
9
0
10 Jul 2023
Pure Exploration in Bandits with Linear Constraints
Pure Exploration in Bandits with Linear ConstraintsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Emil Carlsson
Debabrota Basu
Fredrik D. Johansson
Devdatt Dubhashi
365
12
0
22 Jun 2023
Near-optimal Conservative Exploration in Reinforcement Learning under
  Episode-wise Constraints
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise ConstraintsInternational Conference on Machine Learning (ICML), 2023
Donghao Li
Ruiquan Huang
Cong Shen
Jing Yang
305
4
0
09 Jun 2023
Disincentivizing Polarization in Social Networks
Disincentivizing Polarization in Social Networks
C. Borgs
J. Chayes
Christian Ikeokwu
Ellen Vitercik
191
0
0
23 May 2023
The Impact of the Geometric Properties of the Constraint Set in Safe
  Optimization with Bandit Feedback
The Impact of the Geometric Properties of the Constraint Set in Safe Optimization with Bandit FeedbackConference on Learning for Dynamics & Control (L4DC), 2023
Spencer Hutchinson
Berkay Turan
M. Alizadeh
314
7
0
01 May 2023
Semi-Infinitely Constrained Markov Decision Processes and Efficient
  Reinforcement Learning
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Liangyu Zhang
Yang Peng
Wenhao Yang
Zhihua Zhang
209
1
0
29 Apr 2023
Dynamic Regret Analysis of Safe Distributed Online Optimization for
  Convex and Non-convex Problems
Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems
Ting-Jui Chang
Sapana Chaudhary
D. Kalathil
Shahin Shahrampour
385
6
0
23 Feb 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under
  Instantaneous Hard Constraints
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard ConstraintsInternational Conference on Machine Learning (ICML), 2023
Ming Shi
Yitao Liang
Ness B. Shroff
220
17
0
08 Feb 2023
Leveraging User-Triggered Supervision in Contextual Bandits
Leveraging User-Triggered Supervision in Contextual Bandits
Alekh Agarwal
Claudio Gentile
T. V. Marinov
204
0
0
07 Feb 2023
Improved Algorithms for Multi-period Multi-class Packing Problems with
  Bandit Feedback
Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit FeedbackInternational Conference on Machine Learning (ICML), 2023
Wonyoung Hedge Kim
G. Iyengar
A. Zeevi
205
4
0
31 Jan 2023
Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits
Probably Anytime-Safe Stochastic Combinatorial Semi-BanditsInternational Conference on Machine Learning (ICML), 2023
Yunlong Hou
Vincent Y. F. Tan
Zixin Zhong
212
1
0
31 Jan 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint
  Violation
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
226
6
0
27 Jan 2023
Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget
Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget
Fathima Zarin Faizal
Jayakrishnan Nair
142
11
0
27 Nov 2022
Benefits of Monotonicity in Safe Exploration with Gaussian Processes
Benefits of Monotonicity in Safe Exploration with Gaussian ProcessesConference on Uncertainty in Artificial Intelligence (UAI), 2022
Arpan Losalka
Jonathan Scarlett
236
1
0
03 Nov 2022
Safe Linear Bandits over Unknown Polytopes
Safe Linear Bandits over Unknown PolytopesAnnual Conference Computational Learning Theory (COLT), 2022
Aditya Gangrade
Tianrui Chen
Venkatesh Saligrama
419
15
0
27 Sep 2022
Constrained Policy Optimization for Controlled Self-Learning in
  Conversational AI Systems
Constrained Policy Optimization for Controlled Self-Learning in Conversational AI SystemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Mohammad Kachuee
Sungjin Lee
301
4
0
17 Sep 2022
12
Next
Page 1 of 2