ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.06731
  4. Cited By
Bandits with Partially Observable Confounded Data
v1v2 (latest)

Bandits with Partially Observable Confounded Data

11 June 2020
Guy Tennenholtz
Uri Shalit
Shie Mannor
Yonathan Efroni
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Bandits with Partially Observable Confounded Data"

20 / 20 papers shown
Title
Deconfounded Warm-Start Thompson Sampling with Applications to Precision Medicine
Deconfounded Warm-Start Thompson Sampling with Applications to Precision Medicine
Prateek Jaiswal
Esmaeil Keyvanshokooh
Junyu Cao
44
0
0
22 May 2025
Benchmarks for Reinforcement Learning with Biased Offline Data and
  Imperfect Simulators
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
76
1
0
30 Jun 2024
Leveraging Offline Data in Linear Latent Bandits
Leveraging Offline Data in Linear Latent Bandits
Chinmaya Kausik
Kevin Tan
Ambuj Tewari
OffRL
69
2
0
27 May 2024
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data
Wang Chi Cheung
Lixing Lyu
OffRL
108
6
0
04 May 2024
Predictive Performance Comparison of Decision Policies Under Confounding
Predictive Performance Comparison of Decision Policies Under Confounding
Luke M. Guerdan
Amanda Coston
Kenneth Holstein
Zhiwei Steven Wu
OffRL
193
0
0
01 Apr 2024
Robustly Improving Bandit Algorithms with Confounded and Selection
  Biased Offline Data: A Causal Approach
Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach
Wen Huang
Xintao Wu
OffRLCML
51
0
0
20 Dec 2023
Online Decision Mediation
Online Decision Mediation
Daniel Jarrett
Alihan Huyuk
M. Schaar
97
4
0
28 Oct 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden
  Confounding
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
62
7
0
01 Jun 2023
Ranking with Popularity Bias: User Welfare under Self-Amplification
  Dynamics
Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics
Guy Tennenholtz
Martin Mladenov
Nadav Merlis
Robert L. Axtell
Craig Boutilier
53
0
0
24 May 2023
A Unified Framework of Policy Learning for Contextual Bandit with
  Confounding Bias and Missing Observations
A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations
Siyu Chen
Yitan Wang
Zhaoran Wang
Zhuoran Yang
OffRL
104
2
0
20 Mar 2023
Causal Deep Learning
Causal Deep Learning
Jeroen Berrevoets
Krzysztof Kacprzyk
Zhaozhi Qian
M. Schaar
CMLAI4CE
67
25
0
03 Mar 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
135
30
0
10 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous
  Unobserved Confounders
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
54
10
0
01 Feb 2023
Leveraging Offline Data in Online Reinforcement Learning
Leveraging Offline Data in Online Reinforcement Learning
Andrew Wagenmaker
Aldo Pacchiano
OffRLOnRL
103
41
0
09 Nov 2022
Dual Instrumental Method for Confounded Kernelized Bandits
Dual Instrumental Method for Confounded Kernelized Bandits
Xueping Gong
Jiheng Zhang
97
1
0
07 Sep 2022
Worst-case Performance of Greedy Policies in Bandits with Imperfect
  Context Observations
Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations
Hongju Park
Mohamad Kazem Shirani Faradonbeh
OffRL
59
2
0
10 Apr 2022
Efficient Algorithms for Learning to Control Bandits with Unobserved
  Contexts
Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts
Hongju Park
Mohamad Kazem Shirani Faradonbeh
43
6
0
02 Feb 2022
Analysis of Thompson Sampling for Partially Observable Contextual
  Multi-Armed Bandits
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits
Yash J. Patel
Mohamad Kazem Shirani Faradonbeh
55
15
0
23 Oct 2021
On Covariate Shift of Latent Confounders in Imitation and Reinforcement
  Learning
On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Guy Tennenholtz
Assaf Hallak
Gal Dalal
Shie Mannor
Gal Chechik
Uri Shalit
OODOffRL
118
16
0
13 Oct 2021
Invariant Policy Learning: A Causal Perspective
Invariant Policy Learning: A Causal Perspective
Sorawit Saengkyongam
Nikolaj Thams
J. Peters
Niklas Pfister
CMLOffRL
85
15
0
01 Jun 2021
1