v1v2v3v4 (latest)

Risk-Constrained Thompson Sampling for CVaR Bandits

16 November 2020

Joel Q. L. Chang

Qiuyu Zhu

Vincent Y. F. Tan

ArXiv (abs)PDF HTML

Papers citing "Risk-Constrained Thompson Sampling for CVaR Bandits"

7 / 7 papers shown

Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget

Fathima Zarin Faizal

Jayakrishnan Nair

116

27 Nov 2022

Off-Policy Risk Assessment in Markov Decision Processes

Audrey Huang

Liu Leqi

Zachary Chase Lipton

Kamyar Azizzadenesheli

OffRL

432

21 Sep 2022

Almost Optimal Variance-Constrained Best Arm IdentificationIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022

Yunlong Hou

Vincent Y. F. Tan

Zixin Zhong

255

25 Jan 2022

Risk averse non-stationary multi-armed bandits

Leo Benac

Frédéric Godin

181

28 Sep 2021

A Unifying Theory of Thompson Sampling for Continuous Risk-Averse BanditsAAAI Conference on Artificial Intelligence (AAAI), 2021

Joel Q. L. Chang

Vincent Y. F. Tan

474

25 Aug 2021

Thompson Sampling for Gaussian Entropic Risk Bandits

Ming Liang Ang

Eloise Y. Y. Lim

Joel Q. L. Chang

207

14 May 2021

Off-Policy Risk Assessment in Contextual BanditsNeural Information Processing Systems (NeurIPS), 2021

Audrey Huang

Liu Leqi

Zachary Chase Lipton

Kamyar Azizzadenesheli

OffRL

193

18 Apr 2021