ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.05814
  4. Cited By
Linear Stochastic Bandits Under Safety Constraints

Linear Stochastic Bandits Under Safety Constraints

16 August 2019
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
ArXivPDFHTML

Papers citing "Linear Stochastic Bandits Under Safety Constraints"

25 / 25 papers shown
Title
Constrained Online Decision-Making: A Unified Framework
Constrained Online Decision-Making: A Unified Framework
Haichen Hu
David Simchi-Levi
Navid Azizan
34
0
0
11 May 2025
Improved Regret Bounds for Online Fair Division with Bandit Learning
Improved Regret Bounds for Online Fair Division with Bandit Learning
Benjamin G. Schiffer
Shirley Zhang
36
0
0
13 Jan 2025
Honor Among Bandits: No-Regret Learning for Online Fair Division
Honor Among Bandits: No-Regret Learning for Online Fair Division
Ariel D. Procaccia
Benjamin Schiffer
Shirley Zhang
FaML
24
2
0
01 Jul 2024
Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints
Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints
Jiabin Lin
Shana Moothedath
45
1
0
21 Jan 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
23
0
0
24 Dec 2023
Semi-Infinitely Constrained Markov Decision Processes and Efficient
  Reinforcement Learning
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning
Liangyu Zhang
Yang Peng
Wenhao Yang
Zhihua Zhang
15
1
0
29 Apr 2023
Dynamic Regret Analysis of Safe Distributed Online Optimization for
  Convex and Non-convex Problems
Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems
Ting-Jui Chang
Sapana Chaudhary
D. Kalathil
Shahin Shahrampour
23
5
0
23 Feb 2023
Safe Linear Bandits over Unknown Polytopes
Safe Linear Bandits over Unknown Polytopes
Aditya Gangrade
Tianrui Chen
Venkatesh Saligrama
30
6
0
27 Sep 2022
Constrained Policy Optimization for Controlled Self-Learning in
  Conversational AI Systems
Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems
Mohammad Kachuee
Sungjin Lee
68
4
0
17 Sep 2022
Active Learning with Safety Constraints
Active Learning with Safety Constraints
Romain Camilleri
Andrew Wagenmaker
Jamie Morgenstern
Lalit P. Jain
Kevin G. Jamieson
23
12
0
22 Jun 2022
Safety Aware Changepoint Detection for Piecewise i.i.d. Bandits
Safety Aware Changepoint Detection for Piecewise i.i.d. Bandits
Subhojyoti Mukherjee
14
1
0
27 May 2022
Stochastic Conservative Contextual Linear Bandits
Stochastic Conservative Contextual Linear Bandits
Jiabin Lin
Xian Yeow Lee
Talukder Jubery
Shana Moothedath
S. Sarkar
Baskar Ganapathysubramanian
8
7
0
29 Mar 2022
Linear Stochastic Bandits over a Bit-Constrained Channel
Linear Stochastic Bandits over a Bit-Constrained Channel
A. Mitra
Hamed Hassani
George J. Pappas
34
8
0
02 Mar 2022
Safe Exploration for Efficient Policy Evaluation and Comparison
Safe Exploration for Efficient Policy Evaluation and Comparison
Runzhe Wan
B. Kveton
Rui Song
OffRL
16
10
0
26 Feb 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with
  Constraints
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
51
25
0
31 Jan 2022
Best Arm Identification with Safety Constraints
Best Arm Identification with Safety Constraints
Zhenlin Wang
Andrew Wagenmaker
Kevin G. Jamieson
21
21
0
23 Nov 2021
Safe Policy Optimization with Local Generalized Linear Function
  Approximations
Safe Policy Optimization with Local Generalized Linear Function Approximations
Akifumi Wachi
Yunyue Wei
Yanan Sui
OffRL
22
10
0
09 Nov 2021
Adaptive Data Debiasing through Bounded Exploration
Adaptive Data Debiasing through Bounded Exploration
Yifan Yang
Yang Liu
Parinaz Naghizadeh
FaML
30
7
0
25 Oct 2021
Learning Policies with Zero or Bounded Constraint Violation for
  Constrained MDPs
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
Tao-Wen Liu
Ruida Zhou
D. Kalathil
P. R. Kumar
Chao Tian
29
78
0
04 Jun 2021
Stochastic Linear Bandits with Protected Subspace
Stochastic Linear Bandits with Protected Subspace
Advait Parulekar
Soumya Basu
Aditya Gopalan
Karthikeyan Shanmugam
Sanjay Shakkottai
71
2
0
02 Nov 2020
Learning with Safety Constraints: Sample Complexity of Reinforcement
  Learning for Constrained MDPs
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs
Aria HasanzadeZonuzy
Archana Bura
D. Kalathil
S. Shakkottai
18
39
0
01 Aug 2020
Learning under Invariable Bayesian Safety
Learning under Invariable Bayesian Safety
Gal Bahar
Omer Ben-Porat
Kevin Leyton-Brown
Moshe Tennenholtz
19
0
0
08 Jun 2020
Safe Linear Thompson Sampling with Side Information
Safe Linear Thompson Sampling with Side Information
Ahmadreza Moradipari
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
18
42
0
06 Nov 2019
Resourceful Contextual Bandits
Resourceful Contextual Bandits
Ashwinkumar Badanidiyuru
John Langford
Aleksandrs Slivkins
40
117
0
27 Feb 2014
Safe Exploration in Markov Decision Processes
Safe Exploration in Markov Decision Processes
T. Moldovan
Pieter Abbeel
78
308
0
22 May 2012
1