Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.06272
Cited By
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
International Conference on Learning Representations (ICLR), 2018
15 November 2018
Lars Buesing
T. Weber
Yori Zwols
S. Racanière
A. Guez
Jean-Baptiste Lespiau
N. Heess
CML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search"
50 / 99 papers shown
CauSight: Learning to Supersense for Visual Causal Discovery
Yize Zhang
M. Chen
Sirui Chen
Bo Peng
Y. Zhang
Tianyu Li
Chaochao Lu
CML
ReLM
LRM
145
0
0
01 Dec 2025
ExoPredicator: Learning Abstract Models of Dynamic Worlds for Robot Planning
Yichao Liang
Dat Nguyen
Cambridge Yang
Tianyang Li
J. Tenenbaum
Carl Edward Rasmussen
Adrian Weller
Zenna Tavares
Tom Silver
Kevin Ellis
192
1
0
30 Sep 2025
Goal Discovery with Causal Capacity for Efficient Reinforcement Learning
Yan Yu
Yaodong Yang
Zhengbo Lu
Chengdong Ma
Wengang Zhou
Houqiang Li
CML
133
0
0
13 Aug 2025
Abstract Counterfactuals for Language Model Agents
Edoardo Pona
Milad Kazemi
Yali Du
David Watson
Nicola Paoletti
LLMAG
270
1
0
03 Jun 2025
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
327
2
0
06 May 2025
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Chenran Zhao
Dianxi Shi
Mengzhu Wang
Jianqiang Xia
Huanhuan Yang
Songchang Jin
Shaowu Yang
Chunping Qiu
270
0
0
04 May 2025
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
Yuanchen Yuan
Jin Cheng
Núria Armengol Urpí
Stelian Coros
442
1
0
02 Feb 2025
Dynamical-VAE-based Hindsight to Learn the Causal Dynamics of Factored-POMDPs
Chao Han
Debabrota Basu
M. Mangan
Eleni Vasilaki
Aditya Gilra
320
2
0
12 Nov 2024
Counterfactual Token Generation in Large Language Models
CLEaR (CLEaR), 2024
Ivi Chatzi
N. C. Benz
Eleni Straitouri
Stratis Tsirtsis
Manuel Gomez Rodriguez
LRM
404
10
0
25 Sep 2024
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Hao-ming Lin
Wenhao Ding
Jian Chen
Laixi Shi
Jiacheng Zhu
Yue Liu
Ding Zhao
OffRL
CML
493
3
0
15 Jul 2024
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
261
4
0
30 Jun 2024
Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning
Inwoo Hwang
Yunhyeok Kwak
Jaein Kim
Byoung-Tak Zhang
Sanghack Lee
296
6
0
05 Jun 2024
Causal Action Influence Aware Counterfactual Data Augmentation
Núria Armengol Urpí
Marco Bagatella
Marin Vlastelica
Georg Martius
CML
191
10
0
29 May 2024
Learning Causal Dynamics Models in Object-Oriented Environments
Zhongwei Yu
Jingqing Ruan
Dengpeng Xing
233
4
0
21 May 2024
Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning
Sean Vaskov
Wilko Schwarting
Chris Baker
262
2
0
19 May 2024
What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement Learning
Zhihong Deng
Jing Jiang
Guodong Long
Chengqi Zhang
247
3
0
16 Apr 2024
Automated Discovery of Functional Actual Causes in Complex Environments
Caleb Chuck
Sankaran Vaidyanathan
Stephen Giguere
Amy Zhang
David Jensen
S. Niekum
CML
309
3
0
16 Apr 2024
Mitigating Cascading Effects in Large Adversarial Graph Environments
James Cunningham
Conrad S. Tucker
AI4CE
AAML
134
0
0
12 Apr 2024
Counterfactual Influence in Markov Decision Processes
Milad Kazemi
Jessica Lally
Ekaterina Tishchenko
Hana Chockler
Nicola Paoletti
313
2
0
13 Feb 2024
Where and How to Attack? A Causality-Inspired Recipe for Generating Counterfactual Adversarial Examples
Ruichu Cai
Yuxuan Zhu
Jie Qiao
Zefeng Liang
Furui Liu
Zhifeng Hao
CML
371
5
0
21 Dec 2023
Personalized Path Recourse for Reinforcement Learning Agents
Dat Hong
Tong Wang
331
0
0
14 Dec 2023
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs
International Conference on Machine Learning (ICML), 2023
Stelios Triantafyllou
A. Sukovic
Debmalya Mandal
Goran Radanović
389
0
0
17 Oct 2023
Offline Imitation Learning with Variational Counterfactual Reasoning
Neural Information Processing Systems (NeurIPS), 2023
Bowei He
Zexu Sun
Jinxin Liu
Shuai Zhang
Xu Chen
Chen Ma
OffRL
267
10
0
07 Oct 2023
Estimation of Counterfactual Interventions under Uncertainties
Asian Conference on Machine Learning (ACML), 2023
Juliane Weilbach
S. Gerwinn
M. Kandemir
Martin Fraenzle
197
0
0
15 Sep 2023
Bayesian Inverse Transition Learning for Offline Settings
Leo Benac
S. Parbhoo
Finale Doshi-Velez
OffRL
129
0
0
09 Aug 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
274
2
0
21 Jul 2023
Adversarial Conversational Shaping for Intelligent Agents
Piotr Tarasiewicz
Sultan Kenjeyev
Ilana Sebag
Shehab Alshehabi
GAN
173
0
0
20 Jul 2023
Causal Reinforcement Learning: A Survey
Zhi-Hong Deng
Jing Jiang
Guodong Long
Chen Zhang
CML
LRM
345
32
0
04 Jul 2023
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Neural Information Processing Systems (NeurIPS), 2023
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
354
6
0
29 Jun 2023
Finding Counterfactually Optimal Action Sequences in Continuous State Spaces
Neural Information Processing Systems (NeurIPS), 2023
Stratis Tsirtsis
Manuel Gomez Rodriguez
CML
OffRL
328
13
0
06 Jun 2023
Partial Counterfactual Identification of Continuous Outcomes with a Curvature Sensitivity Model
Neural Information Processing Systems (NeurIPS), 2023
Valentyn Melnychuk
Dennis Frauen
Stefan Feuerriegel
519
13
0
02 Jun 2023
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
C. Cunha
Wen Liu
T. French
Lin Wang
166
3
0
26 Feb 2023
Towards Computationally Efficient Responsibility Attribution in Decentralized Partially Observable MDPs
Adaptive Agents and Multi-Agent Systems (AAMAS), 2023
Stelios Triantafyllou
Goran Radanović
182
5
0
24 Feb 2023
A Survey on Causal Reinforcement Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
430
52
0
10 Feb 2023
Causal Temporal Reasoning for Markov Decision Processes
Milad Kazemi
Nicola Paoletti
LRM
AI4CE
237
2
0
16 Dec 2022
Counterfactuals for the Future
AAAI Conference on Artificial Intelligence (AAAI), 2022
Lucius E.J. Bynum
Joshua R. Loftus
Julia Stoyanovich
167
12
0
07 Dec 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
International Conference on Machine Learning (ICML), 2022
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
246
6
0
18 Nov 2022
The Benefits of Model-Based Generalization in Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
K. Young
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
334
15
0
04 Nov 2022
Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jiao Ou
Jinchao Zhang
Yang Feng
Jie Zhou
229
14
0
30 Oct 2022
MoCoDA: Model-based Counterfactual Data Augmentation
Neural Information Processing Systems (NeurIPS), 2022
Silviu Pitis
Elliot Creager
Ajay Mandlekar
Animesh Garg
OffRL
186
49
0
20 Oct 2022
Causal Dynamics Learning for Task-Independent State Abstraction
International Conference on Machine Learning (ICML), 2022
Zizhao Wang
Xuesu Xiao
Zifan Xu
Yuke Zhu
Peter Stone
CML
198
70
0
27 Jun 2022
Adversarial Counterfactual Environment Model Learning
Neural Information Processing Systems (NeurIPS), 2023
Xiong-Hui Chen
Yang Yu
Zhenghong Zhu
Zhihua Yu
Zhen-Yu Chen
...
Yinan Wu
Hongqiu Wu
Rongjun Qin
Rui Ding
Fangsheng Huang
CML
OffRL
213
17
0
10 Jun 2022
Counterfactual Analysis in Dynamic Latent State Models
International Conference on Machine Learning (ICML), 2022
Martin Haugh
Raghav Singal
CML
264
6
0
27 May 2022
Counterfactual harm
Neural Information Processing Systems (NeurIPS), 2022
Jonathan G. Richens
R. Beard
Daniel H. Thompson
372
33
0
27 Apr 2022
On the link between conscious function and general intelligence in humans and machines
Arthur Juliani
Kai Arulkumaran
Shuntaro Sasai
Ryota Kanai
277
27
0
24 Mar 2022
Learning to reason about and to act on physical cascading events
International Conference on Machine Learning (ICML), 2022
Yuval Atzmon
E. Meirom
Shie Mannor
Gal Chechik
LRM
172
0
0
02 Feb 2022
A Validation Tool for Designing Reinforcement Learning Environments
Ruiyang Xu
Zhengxing Chen
OffRL
100
0
0
10 Dec 2021
Counterfactual Temporal Point Processes
Neural Information Processing Systems (NeurIPS), 2021
Kimia Noorbakhsh
Manuel Gomez Rodriguez
194
27
0
15 Nov 2021
Causal Multi-Agent Reinforcement Learning: Review and Open Problems
St John Grimbly
Jonathan P. Shock
Arnu Pretorius
218
23
0
12 Nov 2021
Learning Generalized Gumbel-max Causal Mechanisms
Neural Information Processing Systems (NeurIPS), 2021
Guy Lorberbom
Daniel D. Johnson
Chris J. Maddison
Daniel Tarlow
Tamir Hazan
CML
133
21
0
11 Nov 2021
1
2
Next