ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.05324
  4. Cited By
A Simple Approach for Non-stationary Linear Bandits
v1v2 (latest)

A Simple Approach for Non-stationary Linear Bandits

International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
9 March 2021
Peng Zhao
Lijun Zhang
Yuan Jiang
Zhi Zhou
ArXiv (abs)PDFHTML

Papers citing "A Simple Approach for Non-stationary Linear Bandits"

50 / 58 papers shown
Title
Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits
Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits
Shaoang Li
Jian Li
104
0
0
18 Sep 2025
Finite-Time Guarantees for Multi-Agent Combinatorial Bandits with Nonstationary Rewards
Finite-Time Guarantees for Multi-Agent Combinatorial Bandits with Nonstationary Rewards
Katherine Adams
J. Boutilier
Qinyang He
Yonatan Dov Mintz
OffRL
80
0
0
28 Aug 2025
Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of Plasticity
Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of PlasticityKnowledge Discovery and Data Mining (KDD), 2025
Zhiyuan Su
Sunhao Dai
Xiao Zhang
202
0
0
14 Jun 2025
Quick-Draw Bandits: Quickly Optimizing in Nonstationary Environments with Extremely Many Arms
Quick-Draw Bandits: Quickly Optimizing in Nonstationary Environments with Extremely Many ArmsKnowledge Discovery and Data Mining (KDD), 2025
Derek Everett
Fred Lu
Edward Raff
Fernando Camacho
James Holt
266
0
0
30 May 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
J. Gornet
Yilin Mo
Bruno Sinopoli
157
1
0
04 Apr 2025
Heavy-Tailed Linear Bandits: Huber Regression with One-Pass Update
Heavy-Tailed Linear Bandits: Huber Regression with One-Pass Update
Jing Wang
Yu Zhang
Peng Zhao
Zhi Zhou
313
1
0
01 Mar 2025
Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization
Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization
Linfeng Cao
Ming Shi
Ness B. Shroff
157
0
0
19 Feb 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits
Joe Suk
Jung-hun Kim
320
1
0
31 Jan 2025
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPsNeural Information Processing Systems (NeurIPS), 2024
Long-Fei Li
Peng Zhao
Zhi Zhou
224
2
0
05 Nov 2024
Diminishing Exploration: A Minimalist Approach to Piecewise Stationary
  Multi-Armed Bandits
Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Kuan-Ta Li
Ping-Chun Hsieh
Yu-Chih Huang
149
2
0
08 Oct 2024
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Seongho Son
William Bankes
Sayak Ray Chowdhury
Brooks Paige
Ilija Bogunovic
381
8
0
26 Jul 2024
Causally Abstracted Multi-armed Bandits
Causally Abstracted Multi-armed Bandits
Fabio Massimo Zennaro
Nicholas Bishop
Joel Dyer
Yorgos Felekis
Anisoara Calinescu
Michael Wooldridge
Theodoros Damoulas
287
6
0
26 Apr 2024
Adaptive Memory Replay for Continual Learning
Adaptive Memory Replay for Continual Learning
James Seale Smith
Lazar Valkov
Shaunak Halbe
V. Gutta
Rogerio Feris
Z. Kira
Leonid Karlinsky
167
15
0
18 Apr 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang
Jize Xie
Yi Chen
J. C. Lui
Dongruo Zhou
253
1
0
15 Mar 2024
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble
  Sampling
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
217
1
0
11 Oct 2023
Dynamic Embedding Size Search with Minimum Regret for Streaming
  Recommender System
Dynamic Embedding Size Search with Minimum Regret for Streaming Recommender SystemInternational Conference on Information and Knowledge Management (CIKM), 2023
Bowei He
Xu He
Renrui Zhang
Yingxue Zhang
Ruiming Tang
Chen Ma
AI4TS
130
13
0
15 Aug 2023
Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits
Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits
Nicklas Werge
Yi-Shan Wu
Abdullah Akgul
M. Kandemir
301
0
0
07 Jul 2023
A Black-box Approach for Non-stationary Multi-agent Reinforcement
  Learning
A Black-box Approach for Non-stationary Multi-agent Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Haozhe Jiang
Qiwen Cui
Zhihan Xiong
Maryam Fazel
S. Du
195
6
0
12 Jun 2023
Non-stationary Reinforcement Learning under General Function
  Approximation
Non-stationary Reinforcement Learning under General Function ApproximationInternational Conference on Machine Learning (ICML), 2023
Songtao Feng
Ming Yin
Ruiquan Huang
Yu Wang
J. Yang
Yitao Liang
135
9
0
01 Jun 2023
Learning to Seek: Multi-Agent Online Source Seeking Against
  Non-Stochastic Disturbances
Learning to Seek: Multi-Agent Online Source Seeking Against Non-Stochastic Disturbances
Bin Du
Kun Qian
Christian G. Claudel
Dengfeng Sun
281
0
0
29 Apr 2023
Revisiting Weighted Strategy for Non-stationary Parametric Bandits
Revisiting Weighted Strategy for Non-stationary Parametric BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Jing Wang
Peng Zhao
Zhihong Zhou
193
9
0
05 Mar 2023
MNL-Bandit in non-stationary environments
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
261
3
0
04 Mar 2023
A Definition of Non-Stationary Bandits
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
273
11
0
23 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear
  Contextual Bandits
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits
Yue Kang
Cho-Jui Hsieh
T. C. Lee
236
2
0
18 Feb 2023
Linear Bandits with Memory: from Rotting to Rising
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
135
3
0
16 Feb 2023
Adapting to Continuous Covariate Shift via Online Density Ratio
  Estimation
Adapting to Continuous Covariate Shift via Online Density Ratio EstimationNeural Information Processing Systems (NeurIPS), 2023
Yu Zhang
Zhenyu Zhang
Peng Zhao
Masashi Sugiyama
OOD
265
16
0
06 Feb 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls
  Oracle
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls OracleAllerton Conference on Communication, Control, and Computing (Allerton), 2023
Hyunwook Kang
P. R. Kumar
OffRL
157
1
0
29 Jan 2023
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed
  Bandit with Constraints
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with ConstraintsConference on Learning for Dynamics & Control (L4DC), 2022
Heng Guo
Qi Zhu
Xin Liu
249
15
0
27 Nov 2022
Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal
  Dynamic Regret, Adaptive Detection, and Separation Design
Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation DesignAAAI Conference on Artificial Intelligence (AAAI), 2022
Yuhao Ding
Ming Jin
Javad Lavaei
116
7
0
19 Nov 2022
Competing Bandits in Time Varying Matching Markets
Competing Bandits in Time Varying Matching MarketsConference on Learning for Dynamics & Control (L4DC), 2022
Deepan Muthirayan
C. Maheshwari
Pramod P. Khargonekar
S. Shankar Sastry
203
5
0
21 Oct 2022
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022
Peng Zhao
Longfei Li
Zhi Zhou
OffRL
184
19
0
26 Aug 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
224
16
0
12 Jul 2022
Recent Advances in Bayesian Optimization
Recent Advances in Bayesian OptimizationACM Computing Surveys (ACM CSUR), 2022
Xilu Wang
Yaochu Jin
Sebastian Schmitt
Markus Olhofer
227
377
0
07 Jun 2022
Open-environment Machine Learning
Open-environment Machine LearningNational Science Review (NSR), 2022
Zhi Zhou
VLM
332
168
0
01 Jun 2022
Non-Stationary Bandit Learning via Predictive Sampling
Non-Stationary Bandit Learning via Predictive SamplingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yueyang Liu
Kuang Xu
Benjamin Van Roy
342
21
0
04 May 2022
Bayesian Non-stationary Linear Bandits for Large-Scale Recommender
  Systems
Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems
Saeed Ghoorchian
E. Kortukov
S. Maghsudi
OffRL
174
0
0
07 Feb 2022
Rotting Infinitely Many-armed Bandits
Rotting Infinitely Many-armed BanditsInternational Conference on Machine Learning (ICML), 2022
Jung-hun Kim
Milan Vojnović
Se-Young Yun
212
5
0
31 Jan 2022
Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for
  Online Convex Optimization
Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex OptimizationJournal of machine learning research (JMLR), 2021
Peng Zhao
Yu Zhang
Lijun Zhang
Zhi Zhou
289
75
0
29 Dec 2021
The Pareto Frontier of model selection for general Contextual Bandits
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
203
25
0
25 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary
  MDPs
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
295
21
0
18 Oct 2021
Distribution-free Contextual Dynamic Pricing
Distribution-free Contextual Dynamic Pricing
Yiyun Luo
W. Sun
Yufeng Liu
335
41
0
15 Sep 2021
Weighted Gaussian Process Bandits for Non-stationary Environments
Weighted Gaussian Process Bandits for Non-stationary Environments
Yuntian Deng
Xingyu Zhou
Baekjin Kim
Ambuj Tewari
Abhishek Gupta
Ness B. Shroff
208
29
0
06 Jul 2021
When and Whom to Collaborate with in a Changing Environment: A
  Collaborative Dynamic Bandit Solution
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit SolutionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
Chuanhao Li
Qingyun Wu
Hongning Wang
161
6
0
14 Apr 2021
Regret Bounds for Generalized Linear Bandits under Parameter Drift
Regret Bounds for Generalized Linear Bandits under Parameter Drift
Louis Faury
Yoan Russac
Marc Abeille
Clément Calauzènes
150
12
0
09 Mar 2021
No-Regret Algorithms for Time-Varying Bayesian Optimization
No-Regret Algorithms for Time-Varying Bayesian OptimizationAnnual Conference on Information Sciences and Systems (CISS), 2021
Xingyu Zhou
Ness B. Shroff
127
24
0
11 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An
  Optimal Black-box Approach
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box ApproachAnnual Conference Computational Learning Theory (COLT), 2021
Chen-Yu Wei
Haipeng Luo
OffRL
385
121
0
10 Feb 2021
Non-stationary Online Learning with Memory and Non-stochastic Control
Non-stationary Online Learning with Memory and Non-stochastic ControlInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Peng Zhao
Yu-Hu Yan
Yu Wang
Zhi Zhou
460
50
0
07 Feb 2021
Learning User Preferences in Non-Stationary Environments
Learning User Preferences in Non-Stationary EnvironmentsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Wasim Huleihel
S. Pal
O. Shayevitz
228
13
0
29 Jan 2021
Non-Stationary Latent Bandits
Non-Stationary Latent Bandits
Joey Hong
Branislav Kveton
Manzil Zaheer
Yinlam Chow
Amr Ahmed
Mohammad Ghavamzadeh
Craig Boutilier
OffRL
307
15
0
01 Dec 2020
Self-Concordant Analysis of Generalized Linear Bandits with Forgetting
Self-Concordant Analysis of Generalized Linear Bandits with ForgettingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Yoan Russac
Louis Faury
Olivier Cappé
Aurélien Garivier
234
19
0
02 Nov 2020
12
Next