ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.06272
  4. Cited By
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

International Conference on Learning Representations (ICLR), 2018
15 November 2018
Lars Buesing
T. Weber
Yori Zwols
S. Racanière
A. Guez
Jean-Baptiste Lespiau
N. Heess
    CML
ArXiv (abs)PDFHTML

Papers citing "Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search"

49 / 99 papers shown
Dynamic Causal Bayesian Optimization
Dynamic Causal Bayesian Optimization
Virginia Aglietti
Neil Dhir
Javier I. González
Theodoros Damoulas
128
32
0
26 Oct 2021
Accelerating the Learning of TAMER with Counterfactual Explanations
Accelerating the Learning of TAMER with Counterfactual Explanations
Jakob Karalus
F. Lindner
OffRL
173
6
0
03 Aug 2021
Counterfactual Explanations in Sequential Decision Making Under
  Uncertainty
Counterfactual Explanations in Sequential Decision Making Under Uncertainty
Stratis Tsirtsis
A. De
Manuel Gomez Rodriguez
359
51
0
06 Jul 2021
Next-Gen Machine Learning Supported Diagnostic Systems for Spacecraft
Next-Gen Machine Learning Supported Diagnostic Systems for Spacecraft
Athanasios Vlontzos
Gabriel Sutherland
Siddha Ganju
Frank Soboczenski
108
2
0
10 Jun 2021
Independent mechanism analysis, a new concept?
Independent mechanism analysis, a new concept?Neural Information Processing Systems (NeurIPS), 2021
Luigi Gresele
Julius von Kügelgen
Vincent Stimper
Bernhard Schölkopf
M. Besserve
CML
316
113
0
09 Jun 2021
Causal Influence Detection for Improving Efficiency in Reinforcement
  Learning
Causal Influence Detection for Improving Efficiency in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Maximilian Seitzer
Bernhard Schölkopf
Georg Martius
CML
273
95
0
07 Jun 2021
Did I do that? Blame as a means to identify controlled effects in
  reinforcement learning
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
249
3
0
01 Jun 2021
Pathdreamer: A World Model for Indoor Navigation
Pathdreamer: A World Model for Indoor Navigation
Jing Yu Koh
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
344
111
0
18 May 2021
Learning Under Adversarial and Interventional Shifts
Learning Under Adversarial and Interventional Shifts
Harvineet Singh
Shalmali Joshi
Finale Doshi-Velez
Himabindu Lakkaraju
OOD
170
4
0
29 Mar 2021
Synthetic Returns for Long-Term Credit Assignment
Synthetic Returns for Long-Term Credit Assignment
David Raposo
Samuel Ritter
Adam Santoro
Greg Wayne
T. Weber
M. Botvinick
H. V. Hasselt
Francis Song
AI4TS
191
35
0
24 Feb 2021
Towards Causal Representation Learning
Towards Causal Representation Learning
Bernhard Schölkopf
Francesco Locatello
Stefan Bauer
Nan Rosemary Ke
Nal Kalchbrenner
Anirudh Goyal
Yoshua Bengio
OODCMLAI4CE
336
341
0
22 Feb 2021
Instrumental Variable Value Iteration for Causal Offline Reinforcement
  Learning
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Luofeng Liao
Zuyue Fu
Zhuoran Yang
Yixin Wang
Mladen Kolar
Zhaoran Wang
OffRL
278
39
0
19 Feb 2021
Directive Explanations for Actionable Explainability in Machine Learning
  Applications
Directive Explanations for Actionable Explainability in Machine Learning Applications
Ronal Singh
Paul Dourish
Piers Howe
Tim Miller
L. Sonenberg
Eduardo Velloso
F. Vetere
140
42
0
03 Feb 2021
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data
  Augmentation
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation
Chaochao Lu
Erdun Gao
Ke Wang
José Miguel Hernández-Lobato
Kun Zhang
Bernhard Schölkopf
CMLOODOffRL
213
66
0
16 Dec 2020
Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Counterfactual Credit Assignment in Model-Free Reinforcement LearningInternational Conference on Machine Learning (ICML), 2020
Thomas Mesnard
T. Weber
Fabio Viola
S. Thakoor
Alaa Saade
...
A. Guez
Éric Moulines
Marcus Hutter
Lars Buesing
Rémi Munos
CMLOffRL
234
67
0
18 Nov 2020
Causal Campbell-Goodhart's law and Reinforcement Learning
Causal Campbell-Goodhart's law and Reinforcement LearningInternational Conference on Agents and Artificial Intelligence (ICAART), 2020
Hal Ashton
CML
147
4
0
02 Nov 2020
Forethought and Hindsight in Credit Assignment
Forethought and Hindsight in Credit AssignmentNeural Information Processing Systems (NeurIPS), 2020
Veronica Chelu
Doina Precup
H. V. Hasselt
262
27
0
26 Oct 2020
Disentangling causal effects for hierarchical reinforcement learning
Disentangling causal effects for hierarchical reinforcement learning
Oriol Corcoll
Raul Vicente
CML
212
11
0
03 Oct 2020
Multi-task Causal Learning with Gaussian Processes
Multi-task Causal Learning with Gaussian ProcessesNeural Information Processing Systems (NeurIPS), 2020
Virginia Aglietti
Theodoros Damoulas
Mauricio A. Alvarez
Javier I. González
CML
146
22
0
27 Sep 2020
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with
  Latent Confounders
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent ConfoundersInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Andrew Bennett
Nathan Kallus
Lihong Li
Ali Mousavi
OffRL
167
46
0
27 Jul 2020
Counterfactual Data Augmentation using Locally Factored Dynamics
Counterfactual Data Augmentation using Locally Factored Dynamics
Silviu Pitis
Elliot Creager
Animesh Garg
BDLOffRL
238
105
0
06 Jul 2020
Meta Learning for Causal Direction
Meta Learning for Causal Direction
Jean-François Ton
Dino Sejdinovic
Kenji Fukumizu
CMLOOD
122
23
0
06 Jul 2020
Learning "What-if" Explanations for Sequential Decision-Making
Learning "What-if" Explanations for Sequential Decision-Making
Ioana Bica
Daniel Jarrett
Alihan Huyuk
M. Schaar
OffRL
242
2
0
02 Jul 2020
Learning Post-Hoc Causal Explanations for Recommendation
Learning Post-Hoc Causal Explanations for Recommendation
Shuyuan Xu
Yunqi Li
Shuchang Liu
Zuohui Fu
Xu Chen
Zelong Li
CML
122
21
0
30 Jun 2020
Reinforcement Learning and its Connections with Neuroscience and
  Psychology
Reinforcement Learning and its Connections with Neuroscience and PsychologyNeural Networks (NN), 2020
Ajay Subramanian
Sharad Chitlangia
V. Baths
OffRL
524
35
0
25 Jun 2020
Provably Efficient Causal Reinforcement Learning with Confounded
  Observational Data
Provably Efficient Causal Reinforcement Learning with Confounded Observational Data
Lingxiao Wang
Zhuoran Yang
Zhaoran Wang
OffRL
176
55
0
22 Jun 2020
Counterfactually Guided Off-policy Transfer in Clinical Settings
Counterfactually Guided Off-policy Transfer in Clinical Settings
Taylor W. Killian
Marzyeh Ghassemi
Shalmali Joshi
CMLOffRLOOD
239
12
0
20 Jun 2020
Causality and Batch Reinforcement Learning: Complementary Approaches To
  Planning In Unknown Domains
Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains
James Bannon
Bradford T. Windsor
Wenbo Song
Tao Li
CMLOODOffRL
155
22
0
03 Jun 2020
Causal Bayesian Optimization
Causal Bayesian Optimization
Virginia Aglietti
Xiaoyu Lu
Andrei Paleyes
Javier Gonz' alez
CML
142
59
0
24 May 2020
Counterfactual Off-Policy Training for Neural Response Generation
Counterfactual Off-Policy Training for Neural Response Generation
Qingfu Zhu
Weinan Zhang
Ting Liu
William Yang Wang
OffRLCML
293
3
0
29 Apr 2020
Importance of using appropriate baselines for evaluation of
  data-efficiency in deep reinforcement learning for Atari
Importance of using appropriate baselines for evaluation of data-efficiency in deep reinforcement learning for Atari
Kacper Kielak
OffRL
132
9
0
23 Mar 2020
Counterfactual Policy Evaluation for Decision-Making in Autonomous
  Driving
Counterfactual Policy Evaluation for Decision-Making in Autonomous Driving
Patrick Hart
Alois Knoll
CMLOffRL
183
3
0
20 Mar 2020
Value-driven Hindsight Modelling
Value-driven Hindsight ModellingNeural Information Processing Systems (NeurIPS), 2020
A. Guez
Fabio Viola
T. Weber
Lars Buesing
Steven Kapturowski
Doina Precup
David Silver
N. Heess
OffRL
144
12
0
19 Feb 2020
Resolving Spurious Correlations in Causal Models of Environments via
  Interventions
Resolving Spurious Correlations in Causal Models of Environments via Interventions
S. Volodin
Nevan Wichers
Jeremy Nixon
CML
219
18
0
12 Feb 2020
Causally Correct Partial Models for Reinforcement Learning
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
226
36
0
07 Feb 2020
Hindsight Credit Assignment
Hindsight Credit AssignmentNeural Information Processing Systems (NeurIPS), 2019
Anna Harutyunyan
Will Dabney
Thomas Mesnard
M. G. Azar
Bilal Piot
...
H. V. Hasselt
Greg Wayne
Satinder Singh
Doina Precup
Rémi Munos
176
84
0
05 Dec 2019
Causality for Machine Learning
Causality for Machine Learning
Bernhard Schölkopf
CMLAI4CELRM
345
516
0
24 Nov 2019
Integrating Markov processes with structural causal modeling enables
  counterfactual inference in complex systems
Integrating Markov processes with structural causal modeling enables counterfactual inference in complex systemsNeural Information Processing Systems (NeurIPS), 2019
Robert Osazuwa Ness
Kaushal Paneri
O. Vitek
98
7
0
06 Nov 2019
Learning to Predict Without Looking Ahead: World Models Without Forward
  Prediction
Learning to Predict Without Looking Ahead: World Models Without Forward PredictionNeural Information Processing Systems (NeurIPS), 2019
C. Freeman
Luke Metz
David R Ha
162
36
0
29 Oct 2019
MultiVerse: Causal Reasoning using Importance Sampling in Probabilistic
  Programming
MultiVerse: Causal Reasoning using Importance Sampling in Probabilistic ProgrammingSymposium on Advances in Approximate Bayesian Inference (AABI), 2019
Yura N. Perov
L. Graham
Kostis Gourgoulias
Jonathan G. Richens
Ciarán M. Gilligan-Lee
Adam Baker
Saurabh Johri
LRM
237
18
0
17 Oct 2019
Causal Induction from Visual Observations for Goal Directed Tasks
Causal Induction from Visual Observations for Goal Directed Tasks
Sunjay Cauligi
Yuke Zhu
D. Stefan
Tamara Rezk
CMLLRM
177
67
0
03 Oct 2019
Causal Modeling for Fairness in Dynamical Systems
Causal Modeling for Fairness in Dynamical SystemsInternational Conference on Machine Learning (ICML), 2019
Elliot Creager
David Madras
T. Pitassi
R. Zemel
177
69
0
18 Sep 2019
Reward Tampering Problems and Solutions in Reinforcement Learning: A
  Causal Influence Diagram Perspective
Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Tom Everitt
Marcus Hutter
Ramana Kumar
Victoria Krakovna
372
121
0
13 Aug 2019
Direct Policy Gradients: Direct Optimization of Policies in Discrete
  Action Spaces
Direct Policy Gradients: Direct Optimization of Policies in Discrete Action SpacesNeural Information Processing Systems (NeurIPS), 2019
Guy Lorberbom
Chris J. Maddison
N. Heess
Tamir Hazan
Daniel Tarlow
187
8
0
14 Jun 2019
Options as responses: Grounding behavioural hierarchies in multi-agent
  RL
Options as responses: Grounding behavioural hierarchies in multi-agent RL
A. Vezhnevets
Yuhuai Wu
Rémi Leblond
Joel Z Leibo
AI4CE
205
18
0
04 Jun 2019
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement
  Learning
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2019
Caleb Chuck
Supawit Chockchowwat
S. Niekum
177
14
0
27 May 2019
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal
  Models
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal ModelsInternational Conference on Machine Learning (ICML), 2019
Michael Oberst
David Sontag
CMLOffRL
330
183
0
14 May 2019
Model-Based Reinforcement Learning for Atari
Model-Based Reinforcement Learning for AtariInternational Conference on Learning Representations (ICLR), 2019
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
...
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
829
942
0
01 Mar 2019
Deconfounding Reinforcement Learning in Observational Settings
Deconfounding Reinforcement Learning in Observational Settings
Chaochao Lu
Bernhard Schölkopf
José Miguel Hernández-Lobato
CMLOOD
256
76
0
26 Dec 2018
Previous
12