ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXivPDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,874 papers shown
Title
Instance-Adaptive Video Compression: Improving Neural Codecs by Training
  on the Test Set
Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test Set
T. V. Rozendaal
Johann Brehmer
Yunfan Zhang
Reza Pourreza
Auke Wiggers
Taco S. Cohen
44
24
0
19 Nov 2021
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
Bichen Wu
Chaojian Li
Hang Zhang
Xiaoliang Dai
Peizhao Zhang
Matthew Yu
Jialiang Wang
Yingyan Lin
Peter Vajda
ViT
44
24
0
19 Nov 2021
Single-pass Object-adaptive Data Undersampling and Reconstruction for
  MRI
Single-pass Object-adaptive Data Undersampling and Reconstruction for MRI
Zhishen Huang
S. Ravishankar
MedIm
65
9
0
17 Nov 2021
Towards Interpretable and Reliable Reading Comprehension: A Pipeline
  Model with Unanswerability Prediction
Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction
Kosuke Nishida
Kyosuke Nishida
Itsumi Saito
Sen Yoshida
51
7
0
17 Nov 2021
Multi-Vector Models with Textual Guidance for Fine-Grained Scientific
  Document Similarity
Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity
Sheshera Mysore
Arman Cohan
Tom Hope
42
39
0
16 Nov 2021
Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI
Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI
Jiangchao Yao
Shengyu Zhang
Yang Yao
Feng Wang
Jianxin Ma
...
Kun Kuang
Chao-Xiang Wu
Fei Wu
Jingren Zhou
Hongxia Yang
42
91
0
11 Nov 2021
Multi-Objective Optimization for Value-Sensitive and Sustainable Basket
  Recommendations
Multi-Objective Optimization for Value-Sensitive and Sustainable Basket Recommendations
Thomas Asikis
37
2
0
10 Nov 2021
Efficient Neural Network Training via Forward and Backward Propagation
  Sparsification
Efficient Neural Network Training via Forward and Backward Propagation Sparsification
Xiao Zhou
Weizhong Zhang
Zonghao Chen
Shizhe Diao
Tong Zhang
47
46
0
10 Nov 2021
AnalogNets: ML-HW Co-Design of Noise-robust TinyML Models and Always-On
  Analog Compute-in-Memory Accelerator
AnalogNets: ML-HW Co-Design of Noise-robust TinyML Models and Always-On Analog Compute-in-Memory Accelerator
Chuteng Zhou
F. García-Redondo
Julian Büchel
I. Boybat
Xavier Timoneda Comas
S. Nandakumar
Shidhartha Das
Abu Sebastian
Manuel Le Gallo
P. Whatmough
43
16
0
10 Nov 2021
A Survey on Green Deep Learning
A Survey on Green Deep Learning
Jingjing Xu
Wangchunshu Zhou
Zhiyi Fu
Hao Zhou
Lei Li
VLM
88
83
0
08 Nov 2021
MQBench: Towards Reproducible and Deployable Model Quantization
  Benchmark
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Yuhang Li
Mingzhu Shen
Jian Ma
Yan Ren
Mingxin Zhao
Qi Zhang
Ruihao Gong
F. Yu
Junjie Yan
MQ
37
49
0
05 Nov 2021
Is Bang-Bang Control All You Need? Solving Continuous Control with
  Bernoulli Policies
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies
Tim Seyde
Igor Gilitschenski
Wilko Schwarting
Bartolomeo Stellato
Martin Riedmiller
Markus Wulfmeier
Daniela Rus
43
44
0
03 Nov 2021
PatchGame: Learning to Signal Mid-level Patches in Referential Games
PatchGame: Learning to Signal Mid-level Patches in Referential Games
Kamal Gupta
Gowthami Somepalli
Anubhav Gupta
Vinoj Jayasundara
Matthias Zwicker
Abhinav Shrivastava
30
3
0
02 Nov 2021
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise
  Mixed Schemes and Multiple Precisions
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple Precisions
Sung-En Chang
Yanyu Li
Mengshu Sun
Weiwen Jiang
Sijia Liu
Yanzhi Wang
Xue Lin
MQ
36
10
0
30 Oct 2021
Sparsely Changing Latent States for Prediction and Planning in Partially
  Observable Domains
Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
Christian Gumbsch
Martin Volker Butz
Georg Martius
AI4CE
31
21
0
29 Oct 2021
Learning to Ground Multi-Agent Communication with Autoencoders
Learning to Ground Multi-Agent Communication with Autoencoders
Toru Lin
Minyoung Huh
C. Stauffer
Ser-Nam Lim
Phillip Isola
AI4CE
48
52
0
28 Oct 2021
PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks
  with Probabilities over Representations
PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks with Probabilities over Representations
Louis Fortier-Dubois
Gaël Letarte
Benjamin Leblanc
Franccois Laviolette
Pascal Germain
UQCV
32
0
0
28 Oct 2021
Learning where to learn: Gradient sparsity in meta and continual
  learning
Learning where to learn: Gradient sparsity in meta and continual learning
J. Oswald
Dominic Zhao
Seijin Kobayashi
Simon Schug
Massimo Caccia
Nicolas Zucchet
João Sacramento
CLL
33
47
0
27 Oct 2021
Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks
Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks
Yonggan Fu
Qixuan Yu
Yang Zhang
Shan-Hung Wu
Ouyang Xu
David D. Cox
Yingyan Lin
AAML
OOD
38
29
0
26 Oct 2021
CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator
CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator
Alek Dimitriev
Mingyuan Zhou
23
7
0
26 Oct 2021
Understanding Interlocking Dynamics of Cooperative Rationalization
Understanding Interlocking Dynamics of Cooperative Rationalization
Mo Yu
Yang Zhang
Shiyu Chang
Tommi Jaakkola
40
41
0
26 Oct 2021
Part & Whole Extraction: Towards A Deep Understanding of Quantitative
  Facts for Percentages in Text
Part & Whole Extraction: Towards A Deep Understanding of Quantitative Facts for Percentages in Text
Lei Fang
Jian-Guang Lou
30
4
0
26 Oct 2021
Multitask Adaptation by Retrospective Exploration with Learned World
  Models
Multitask Adaptation by Retrospective Exploration with Learned World Models
Artem Zholus
Aleksandr I. Panov
CLL
27
0
0
25 Oct 2021
Demystifying and Generalizing BinaryConnect
Demystifying and Generalizing BinaryConnect
Abhishek Sharma
Yaoliang Yu
Eyyub Sari
Mahdi Zolnouri
V. Nia
MQ
28
9
0
25 Oct 2021
Efficient and Robust Mixed-Integer Optimization Methods for Training
  Binarized Deep Neural Networks
Efficient and Robust Mixed-Integer Optimization Methods for Training Binarized Deep Neural Networks
Jannis Kurtz
B. Bah
MQ
26
4
0
21 Oct 2021
Wideband and Entropy-Aware Deep Soft Bit Quantization
Wideband and Entropy-Aware Deep Soft Bit Quantization
Marius Arvinte
Jonathan I. Tamir
MQ
21
0
0
18 Oct 2021
BERMo: What can BERT learn from ELMo?
BERMo: What can BERT learn from ELMo?
Sangamesh Kodge
Kaushik Roy
45
3
0
18 Oct 2021
Sub-bit Neural Networks: Learning to Compress and Accelerate Binary
  Neural Networks
Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks
Yikai Wang
Yi Yang
Gang Hua
Anbang Yao
MQ
34
15
0
18 Oct 2021
Taming Visually Guided Sound Generation
Taming Visually Guided Sound Generation
Vladimir E. Iashin
Esa Rahtu
VLM
43
123
0
17 Oct 2021
Case-based Reasoning for Better Generalization in Textual Reinforcement
  Learning
Case-based Reasoning for Better Generalization in Textual Reinforcement Learning
Mattia Atzeni
Shehzaad Dhuliawala
K. Murugesan
Mrinmaya Sachan
OOD
OffRL
38
11
0
16 Oct 2021
Hindsight Network Credit Assignment: Efficient Credit Assignment in
  Networks of Discrete Stochastic Units
Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
K. Young
31
0
0
14 Oct 2021
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Xiangyang Liu
Tianxiang Sun
Junliang He
Jiawen Wu
Lingling Wu
Xinyu Zhang
Hao Jiang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
ELM
33
46
0
13 Oct 2021
Model-Agnostic Meta-Attack: Towards Reliable Evaluation of Adversarial
  Robustness
Model-Agnostic Meta-Attack: Towards Reliable Evaluation of Adversarial Robustness
Xiao Yang
Yinpeng Dong
Wenzhao Xiang
Tianyu Pang
Hang Su
Jun Zhu
AAML
32
4
0
13 Oct 2021
Towards Mixed-Precision Quantization of Neural Networks via Constrained
  Optimization
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
Weihan Chen
Peisong Wang
Jian Cheng
MQ
49
63
0
13 Oct 2021
Gated Information Bottleneck for Generalization in Sequential
  Environments
Gated Information Bottleneck for Generalization in Sequential Environments
Francesco Alesiani
Shujian Yu
Xi Yu
OOD
AAML
24
13
0
12 Oct 2021
Improving Binary Neural Networks through Fully Utilizing Latent Weights
Improving Binary Neural Networks through Fully Utilizing Latent Weights
Weixiang Xu
Qiang Chen
Xiangyu He
Peisong Wang
Jian Cheng
MQ
42
6
0
12 Oct 2021
RWN: Robust Watermarking Network for Image Cropping Localization
RWN: Robust Watermarking Network for Image Cropping Localization
Qichao Ying
Xiaoxiao Hu
Xinming Zhang
Zhenxing Qian
Xinpeng Zhang
22
11
0
12 Oct 2021
A comprehensive review of Binary Neural Network
A comprehensive review of Binary Neural Network
Chunyu Yuan
S. Agaian
MQ
50
95
0
11 Oct 2021
Haar Wavelet Feature Compression for Quantized Graph Convolutional
  Networks
Haar Wavelet Feature Compression for Quantized Graph Convolutional Networks
Moshe Eliasof
Ben Bodner
Eran Treister
GNN
40
8
0
10 Oct 2021
Invertible Tone Mapping with Selectable Styles
Invertible Tone Mapping with Selectable Styles
Zhuming Zhang
Menghan Xia
Xueting Liu
Chengze Li
T. Wong
14
0
0
09 Oct 2021
Weakly Supervised Concept Map Generation through Task-Guided Graph
  Translation
Weakly Supervised Concept Map Generation through Task-Guided Graph Translation
Jiaying Lu
Xiangjue Dong
Carl Yang
37
3
0
08 Oct 2021
FRL: Federated Rank Learning
FRL: Federated Rank Learning
Hamid Mozaffari
Virat Shejwalkar
Amir Houmansadr
FedML
34
11
0
08 Oct 2021
Sparse MoEs meet Efficient Ensembles
Sparse MoEs meet Efficient Ensembles
J. Allingham
F. Wenzel
Zelda E. Mariet
Basil Mustafa
J. Puigcerver
...
Balaji Lakshminarayanan
Jasper Snoek
Dustin Tran
Carlos Riquelme Ruiz
Rodolphe Jenatton
MoE
51
21
0
07 Oct 2021
End-to-End Supermask Pruning: Learning to Prune Image Captioning Models
End-to-End Supermask Pruning: Learning to Prune Image Captioning Models
J. Tan
C. Chan
Joon Huang Chuah
VLM
72
16
0
07 Oct 2021
Adversarial Attacks on Spiking Convolutional Neural Networks for
  Event-based Vision
Adversarial Attacks on Spiking Convolutional Neural Networks for Event-based Vision
Julian Buchel
Gregor Lenz
Yalun Hu
Sadique Sheik
M. Sorbaro
AAML
48
15
0
06 Oct 2021
CBP: Backpropagation with constraint on weight precision using a
  pseudo-Lagrange multiplier method
CBP: Backpropagation with constraint on weight precision using a pseudo-Lagrange multiplier method
Guhyun Kim
D. Jeong
MQ
57
2
0
06 Oct 2021
Unsupervised Speech Segmentation and Variable Rate Representation
  Learning using Segmental Contrastive Predictive Coding
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
SSL
69
22
0
05 Oct 2021
A Review of the Gumbel-max Trick and its Extensions for Discrete
  Stochasticity in Machine Learning
A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning
Iris A. M. Huijben
W. Kool
Max B. Paulus
Ruud J. G. van Sloun
48
95
0
04 Oct 2021
One Timestep is All You Need: Training Spiking Neural Networks with
  Ultra Low Latency
One Timestep is All You Need: Training Spiking Neural Networks with Ultra Low Latency
Sayeed Shafayet Chowdhury
Nitin Rathi
Kaushik Roy
46
40
0
01 Oct 2021
Towards Efficient Post-training Quantization of Pre-trained Language
  Models
Towards Efficient Post-training Quantization of Pre-trained Language Models
Haoli Bai
Lu Hou
Lifeng Shang
Xin Jiang
Irwin King
Michael R. Lyu
MQ
82
47
0
30 Sep 2021
Previous
123...212223...363738
Next