Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2202.09497
Cited By
v1
v2
v3
v4
v5
v6
v7
v8 (latest)
Gradient Estimation with Discrete Stein Operators
Neural Information Processing Systems (NeurIPS), 2022
19 February 2022
Jiaxin Shi
Yuhao Zhou
Jessica Hwang
Michalis K. Titsias
Lester W. Mackey
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Gradient Estimation with Discrete Stein Operators"
19 / 19 papers shown
Discrete Neural Flow Samplers with Locally Equivariant Transformer
Chinmay Pani
Ruixiang Zhang
Yingzhen Li
450
10
0
23 May 2025
Secrets of GFlowNets' Learning Behavior: A Theoretical Study
Tianshu Yu
181
1
0
04 May 2025
Learning-Order Autoregressive Models with Application to Molecular Graph Generation
Zhe Wang
Jiaxin Shi
N. Heess
Arthur Gretton
Michalis K. Titsias
403
16
0
07 Mar 2025
Simplified and Generalized Masked Diffusion for Discrete Data
Neural Information Processing Systems (NeurIPS), 2024
Jiaxin Shi
Kehang Han
Zehao Wang
Arnaud Doucet
Michalis K. Titsias
DiffM
738
410
0
17 Jan 2025
On Divergence Measures for Training GFlowNets
Neural Information Processing Systems (NeurIPS), 2024
Tiago da Silva
Eliezer de Souza da Silva
Diego Mesquita
BDL
499
5
0
12 Oct 2024
Informed Correctors for Discrete Diffusion Models
Yixiu Zhao
Jiaxin Shi
F. Chen
Shaul Druckmann
Lester W. Mackey
Scott W. Linderman
803
42
0
30 Jul 2024
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Longrong Yang
Dong Shen
Chaoxiang Cai
Fan Yang
Size Li
Tingting Gao
Xi Li
MoE
497
9
0
28 Jun 2024
Adaptive Instrument Design for Indirect Experiments
International Conference on Learning Representations (ICLR), 2023
Yash Chandak
Shiv Shankar
Vasilis Syrgkanis
Emma Brunskill
250
5
0
05 Dec 2023
Differentiable Sampling of Categorical Distributions Using the CatLog-Derivative Trick
Neural Information Processing Systems (NeurIPS), 2023
Lennert De Smet
Emanuele Sansone
Pedro Zuidberg Dos Martires
231
15
0
21 Nov 2023
Sparse Backpropagation for MoE Training
Liyuan Liu
Jianfeng Gao
Weizhu Chen
MoE
231
15
0
01 Oct 2023
DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation
Pau Mulet Arabí
Alec Flowers
Lukas Mauch
Fabien Cardinaux
BDL
236
0
0
07 Sep 2023
When can Regression-Adjusted Control Variates Help? Rare Events, Sobolev Embedding and Minimax Optimality
Neural Information Processing Systems (NeurIPS), 2023
Jose H. Blanchet
Haoxuan Chen
Yiping Lu
Lexing Ying
255
6
0
25 May 2023
Bayesian Numerical Integration with Neural Networks
Katharina Ott
Michael Tiemann
Philipp Hennig
F. Briol
BDL
330
5
0
22 May 2023
Bridging Discrete and Backpropagation: Straight-Through and Beyond
Neural Information Processing Systems (NeurIPS), 2023
Liyuan Liu
Chengyu Dong
Xiaodong Liu
Bin Yu
Jianfeng Gao
BDL
335
38
0
17 Apr 2023
Meta-learning Control Variates: Variance Reduction with Limited Data
Conference on Uncertainty in Artificial Intelligence (UAI), 2023
Z. Sun
Chris J. Oates
F. Briol
BDL
375
13
0
08 Mar 2023
A kernel Stein test of goodness of fit for sequential models
International Conference on Machine Learning (ICML), 2022
Jerome Baum
Heishiro Kanagawa
Arthur Gretton
519
12
0
19 Oct 2022
Gradient Estimation for Binary Latent Variables via Gradient Variance Clipping
AAAI Conference on Artificial Intelligence (AAAI), 2022
Russell Z. Kunes
Mingzhang Yin
Max Land
Doron Haviv
Dana Peér
Simon Tavaré
BDL
277
5
0
12 Aug 2022
Generalised Bayesian Inference for Discrete Intractable Likelihood
Journal of the American Statistical Association (JASA), 2022
Takuo Matsubara
Jeremias Knoblauch
F. Briol
Chris J. Oates
414
25
0
16 Jun 2022
A Kernel Stein Test for Comparing Latent Variable Models
Heishiro Kanagawa
Wittawat Jitkrittum
Lester W. Mackey
Kenji Fukumizu
Arthur Gretton
515
20
0
01 Jul 2019
1
Page 1 of 1