Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.05254
Cited By
Gradient Estimation Using Stochastic Computation Graphs
17 June 2015
John Schulman
N. Heess
T. Weber
Pieter Abbeel
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Gradient Estimation Using Stochastic Computation Graphs"
40 / 90 papers shown
Title
Fast Task Inference with Variational Intrinsic Successor Features
Steven Hansen
Will Dabney
André Barreto
T. Wiele
David Warde-Farley
Volodymyr Mnih
BDL
44
151
0
12 Jun 2019
Differentiable Algorithm Networks for Composable Robot Learning
Peter Karkus
Xiao Ma
David Hsu
L. Kaelbling
Wee Sun Lee
Tomas Lozano-Perez
14
70
0
28 May 2019
Asynchronous Coagent Networks
James E. Kostas
Chris Nota
Philip S. Thomas
GNN
17
9
0
15 Feb 2019
CommunityGAN: Community Detection with Generative Adversarial Nets
Yuting Jia
Qinqin Zhang
Weinan Zhang
Xinbing Wang
GNN
GAN
20
115
0
20 Jan 2019
Neural Joint Source-Channel Coding
Kristy Choi
Kedar Tatwawadi
Aditya Grover
Tsachy Weissman
Stefano Ermon
13
38
0
19 Nov 2018
Composing Modeling and Inference Operations with Probabilistic Program Combinators
Eli Sennesh
Adam Scibior
Hao Wu
Jan-Willem van de Meent
TPM
13
1
0
14 Nov 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
35
53
0
03 Nov 2018
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
Songlin Yang
Shawn Tan
Alessandro Sordoni
Aaron Courville
32
323
0
22 Oct 2018
ProMP: Proximal Meta-Policy Search
Jonas Rothfuss
Dennis Lee
I. Clavera
Tamim Asfour
Pieter Abbeel
35
209
0
16 Oct 2018
Seq2Slate: Re-ranking and Slate Optimization with RNNs
Irwan Bello
Sayali Kulkarni
Sagar Jain
Craig Boutilier
Ed H. Chi
Elad Eban
Xiyang Luo
Alan Mackey
Ofer Meshi
30
91
0
04 Oct 2018
Improved Gradient-Based Optimization Over Discrete Distributions
Evgeny Andriyash
Arash Vahdat
W. Macready
16
9
0
29 Sep 2018
Learning to Generate Structured Queries from Natural Language with Indirect Supervision
Ziwei Bai
Bo Yu
Bowen Wu
Zhuoran Wang
Baoxun Wang
18
3
0
10 Sep 2018
Pathwise Derivatives Beyond the Reparameterization Trick
M. Jankowiak
F. Obermeyer
30
110
0
05 Jun 2018
Efficient Entropy for Policy Gradient with Multidimensional Action Space
Yiming Zhang
Q. Vuong
Kenny Song
Xiao-Yue Gong
Keith Ross
27
17
0
02 Jun 2018
A Stochastic Decoder for Neural Machine Translation
P. Schulz
Wilker Aziz
Trevor Cohn
BDL
30
29
0
28 May 2018
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
LRM
25
116
0
03 Mar 2018
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
49
227
0
13 Feb 2018
Learning to Search with MCTSnets
A. Guez
T. Weber
Ioannis Antonoglou
Karen Simonyan
Oriol Vinyals
Daan Wierstra
Rémi Munos
David Silver
28
85
0
13 Feb 2018
TensorFlow Distributions
Joshua V. Dillon
I. Langmore
Dustin Tran
E. Brevdo
Srinivas Vasudevan
David A. Moore
Brian Patton
Alexander A. Alemi
Matt Hoffman
Rif A. Saurous
GP
46
346
0
28 Nov 2017
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
53
300
0
31 Oct 2017
Learning to Compose Domain-Specific Transformations for Data Augmentation
Alexander J. Ratner
Henry R. Ehrenberg
Zeshan Hussain
Jared A. Dunnmon
Christopher Ré
30
346
0
06 Sep 2017
Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
Victor Zhong
Caiming Xiong
R. Socher
RALM
43
1,164
0
31 Aug 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
53
2,775
0
19 Aug 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
51
551
0
19 Jul 2017
An online sequence-to-sequence model for noisy speech recognition
Chung-Cheng Chiu
Dieterich Lawson
Yuping Luo
George Tucker
Kevin Swersky
Ilya Sutskever
Navdeep Jaitly
19
7
0
16 Jun 2017
Learning Disentangled Representations with Semi-Supervised Deep Generative Models
Siddharth Narayanaswamy
Brooks Paige
Jan-Willem van de Meent
Alban Desmaison
Noah D. Goodman
Pushmeet Kohli
Frank Wood
Philip Torr
DRL
CoGe
19
359
0
01 Jun 2017
Joint Positioning and Radio Map Generation Based on Stochastic Variational Bayesian Inference for FWIPS
Caifa Zhou
Yang Gu
13
4
0
17 May 2017
Motion Prediction Under Multimodality with Conditional Stochastic Networks
Katerina Fragkiadaki
Jonathan Huang
Alexander A. Alemi
Sudheendra Vijayanarasimhan
Susanna Ricco
Rahul Sukthankar
3DH
25
25
0
05 May 2017
Traffic Light Control Using Deep Policy-Gradient and Value-Function Based Reinforcement Learning
Seyed Sajad Mousavi
Michael Schukat
Enda Howley
23
304
0
28 Apr 2017
Bandit Structured Prediction for Neural Sequence-to-Sequence Learning
Julia Kreutzer
Artem Sokolov
Stefan Riezler
27
49
0
21 Apr 2017
Equivalence Between Policy Gradients and Soft Q-Learning
John Schulman
Xi Chen
Pieter Abbeel
OffRL
29
342
0
21 Apr 2017
Learning to superoptimize programs - Workshop Version
Rudy Bunel
Alban Desmaison
M. P. Kumar
Philip Torr
Pushmeet Kohli
25
10
0
04 Dec 2016
The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables
Chris J. Maddison
A. Mnih
Yee Whye Teh
BDL
24
2,505
0
02 Nov 2016
Deep Amortized Inference for Probabilistic Programs
Daniel E. Ritchie
Paul Horsfall
Noah D. Goodman
TPM
24
81
0
18 Oct 2016
Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms
C. A. Naesseth
Francisco J. R. Ruiz
Scott W. Linderman
David M. Blei
BDL
25
107
0
18 Oct 2016
Learning from the Hindsight Plan -- Episodic MPC Improvement
Aviv Tamar
G. Thomas
Tianhao Zhang
Sergey Levine
Pieter Abbeel
32
64
0
28 Sep 2016
Asymptotically exact inference in differentiable generative models
Matthew M. Graham
Amos J. Storkey
BDL
21
33
0
25 May 2016
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards
S. Krishnan
Animesh Garg
Richard Liaw
Lauren Miller
Florian T. Pokorny
Ken Goldberg
20
40
0
21 Apr 2016
MuProp: Unbiased Backpropagation for Stochastic Neural Networks
S. Gu
Sergey Levine
Ilya Sutskever
A. Mnih
BDL
10
143
0
16 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
14
13,110
0
09 Sep 2015
Previous
1
2