ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXivPDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,876 papers shown
Title
Robust Vector Quantized-Variational Autoencoder
Chieh-Hsin Lai
Dongmian Zou
Gilad Lerman
DRL
37
5
0
04 Feb 2022
Learning strides in convolutional neural networks
Learning strides in convolutional neural networks
Rachid Riad
O. Teboul
David Grangier
Neil Zeghidour
44
42
0
03 Feb 2022
Robust Binary Models by Pruning Randomly-initialized Networks
Robust Binary Models by Pruning Randomly-initialized Networks
Chen Liu
Ziqi Zhao
Sabine Süsstrunk
Mathieu Salzmann
TPM
AAML
MQ
39
4
0
03 Feb 2022
Adaptive Discrete Communication Bottlenecks with Dynamic Vector
  Quantization
Adaptive Discrete Communication Bottlenecks with Dynamic Vector Quantization
Dianbo Liu
Alex Lamb
Xu Ji
Pascal Junior Tikeng Notsawo
Michael C. Mozer
Yoshua Bengio
Kenji Kawaguchi
24
14
0
02 Feb 2022
Unified Scaling Laws for Routed Language Models
Unified Scaling Laws for Routed Language Models
Aidan Clark
Diego de Las Casas
Aurelia Guy
A. Mensch
Michela Paganini
...
Oriol Vinyals
Jack W. Rae
Erich Elsen
Koray Kavukcuoglu
Karen Simonyan
MoE
50
177
0
02 Feb 2022
Few-Bit Backward: Quantized Gradients of Activation Functions for Memory
  Footprint Reduction
Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction
Georgii Sergeevich Novikov
Daniel Bershatsky
Julia Gusak
Alex Shonenkov
Denis Dimitrov
Ivan Oseledets
MQ
39
17
0
01 Feb 2022
Recognition-Aware Learned Image Compression
Recognition-Aware Learned Image Compression
Maxime Kawawa-Beaudan
Ryan Roggenkemper
A. Zakhor
38
5
0
01 Feb 2022
Signing the Supermask: Keep, Hide, Invert
Signing the Supermask: Keep, Hide, Invert
Nils Koster
O. Grothe
Achim Rettinger
45
11
0
31 Jan 2022
Inverse design of photonic devices with strict foundry fabrication
  constraints
Inverse design of photonic devices with strict foundry fabrication constraints
M. Schubert
A. C. H. Cheung
Ian A. D. Williamson
Aleksandra Spyra
David H. Alexander
38
51
0
31 Jan 2022
OptG: Optimizing Gradient-driven Criteria in Network Sparsity
OptG: Optimizing Gradient-driven Criteria in Network Sparsity
Yuxin Zhang
Mingbao Lin
Mengzhao Chen
Rongrong Ji
Rongrong Ji
52
5
0
30 Jan 2022
Scale-arbitrary Invertible Image Downscaling
Scale-arbitrary Invertible Image Downscaling
Jinbo Xing
Wenbo Hu
T. Wong
65
12
0
29 Jan 2022
S$^3$NN: Time Step Reduction of Spiking Surrogate Gradients for Training
  Energy Efficient Single-Step Spiking Neural Networks
S3^33NN: Time Step Reduction of Spiking Surrogate Gradients for Training Energy Efficient Single-Step Spiking Neural Networks
Kazuma Suetake
Shin-ichi Ikegawa
Ryuji Saiin
Yoshihide Sawada
37
4
0
26 Jan 2022
Neural Network Quantization with AI Model Efficiency Toolkit (AIMET)
Neural Network Quantization with AI Model Efficiency Toolkit (AIMET)
S. Siddegowda
Marios Fournarakis
Markus Nagel
Tijmen Blankevoort
Chirag I. Patel
Abhijit Khobare
MQ
36
32
0
20 Jan 2022
Q-ViT: Fully Differentiable Quantization for Vision Transformer
Q-ViT: Fully Differentiable Quantization for Vision Transformer
Zhexin Li
Tong Yang
Peisong Wang
Jian Cheng
ViT
MQ
44
41
0
19 Jan 2022
Differentiable Rule Induction with Learned Relational Features
Differentiable Rule Induction with Learned Relational Features
R. Kusters
Yusik Kim
Marine Collery
C. Marie
Shubham Gupta
29
14
0
17 Jan 2022
Improving Performance of Semantic Segmentation CycleGANs by Noise
  Injection into the Latent Segmentation Space
Improving Performance of Semantic Segmentation CycleGANs by Noise Injection into the Latent Segmentation Space
Jonas Löhdefink
Tim Fingscheidt
31
2
0
17 Jan 2022
UWC: Unit-wise Calibration Towards Rapid Network Compression
UWC: Unit-wise Calibration Towards Rapid Network Compression
Chen Lin
Zheyang Li
Bo Peng
Haoji Hu
Wenming Tan
Ye Ren
Shiliang Pu
MQ
32
1
0
17 Jan 2022
Control of Dual-Sourcing Inventory Systems using Recurrent Neural
  Networks
Control of Dual-Sourcing Inventory Systems using Recurrent Neural Networks
Lucas Böttcher
Thomas Asikis
I. Fragkos
BDL
40
10
0
16 Jan 2022
ViT2Hash: Unsupervised Information-Preserving Hashing
ViT2Hash: Unsupervised Information-Preserving Hashing
Qinkang Gong
Liangdao Wang
Hanjiang Lai
Yan Pan
Jian Yin
16
4
0
14 Jan 2022
Progressively Optimized Bi-Granular Document Representation for Scalable
  Embedding Based Retrieval
Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval
Shitao Xiao
Zheng Liu
Weihao Han
Jianjin Zhang
Yingxia Shao
...
Hao Sun
Denvy Deng
Liangjie Zhang
Qi Zhang
Xing Xie
40
17
0
14 Jan 2022
Making a (Counterfactual) Difference One Rationale at a Time
Making a (Counterfactual) Difference One Rationale at a Time
Michael J. Plyler
Michal Green
Min Chi
34
11
0
13 Jan 2022
Automatic Sparse Connectivity Learning for Neural Networks
Automatic Sparse Connectivity Learning for Neural Networks
Zhimin Tang
Linkai Luo
Bike Xie
Yiyu Zhu
Rujie Zhao
Lvqing Bi
Chao Lu
35
40
0
13 Jan 2022
A Physics-Informed Vector Quantized Autoencoder for Data Compression of
  Turbulent Flow
A Physics-Informed Vector Quantized Autoencoder for Data Compression of Turbulent Flow
M. Momenifar
Enmao Diao
Vahid Tarokh
A. Bragg
AI4CE
11
4
0
10 Jan 2022
Learning with Latent Structures in Natural Language Processing: A Survey
Learning with Latent Structures in Natural Language Processing: A Survey
Zhaofeng Wu
BDL
DRL
34
4
0
03 Jan 2022
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural
  Networks
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
Runpei Dong
Zhanhong Tan
Mengdi Wu
Linfeng Zhang
Kaisheng Ma
MQ
52
11
0
30 Dec 2021
Automatic Mixed-Precision Quantization Search of BERT
Automatic Mixed-Precision Quantization Search of BERT
Changsheng Zhao
Ting Hua
Yilin Shen
Qian Lou
Hongxia Jin
MQ
25
19
0
30 Dec 2021
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via
  Dense-To-Sparse Gate
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate
Xiaonan Nie
Xupeng Miao
Shijie Cao
Lingxiao Ma
Qibin Liu
Jilong Xue
Youshan Miao
Yi Liu
Zhi-Xin Yang
Tengjiao Wang
MoMe
MoE
32
23
0
29 Dec 2021
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
36
49
0
27 Dec 2021
Learning Cross-Scale Weighted Prediction for Efficient Neural Video
  Compression
Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression
Zongyu Guo
Runsen Feng
Zhizheng Zhang
Xin Jin
Zhibo Chen
29
15
0
26 Dec 2021
BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of
  DNNs from Scratch
BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of DNNs from Scratch
Souvik Kundu
Shikai Wang
Qirui Sun
Peter A. Beerel
Massoud Pedram
MQ
43
18
0
24 Dec 2021
Implicit Neural Video Compression
Implicit Neural Video Compression
Yunfan Zhang
T. V. Rozendaal
Johann Brehmer
Markus Nagel
Taco S. Cohen
54
57
0
21 Dec 2021
A Theoretical View of Linear Backpropagation and Its Convergence
A Theoretical View of Linear Backpropagation and Its Convergence
Ziang Li
Yiwen Guo
Haodi Liu
Changshui Zhang
AAML
29
3
0
21 Dec 2021
Efficient Large Scale Language Modeling with Mixtures of Experts
Efficient Large Scale Language Modeling with Mixtures of Experts
Mikel Artetxe
Shruti Bhosale
Naman Goyal
Todor Mihaylov
Myle Ott
...
Jeff Wang
Luke Zettlemoyer
Mona T. Diab
Zornitsa Kozareva
Ves Stoyanov
MoE
61
190
0
20 Dec 2021
Between words and characters: A Brief History of Open-Vocabulary
  Modeling and Tokenization in NLP
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
48
144
0
20 Dec 2021
Elastic-Link for Binarized Neural Network
Elastic-Link for Binarized Neural Network
Jie Hu
Ziheng Wu
Vince Tan
Zhilin Lu
Mengze Zeng
Enhua Wu
MQ
35
6
0
19 Dec 2021
Dynamics-aware Adversarial Attack of 3D Sparse Convolution Network
An Tao
Yueqi Duan
He Wang
Ziyi Wu
Pengliang Ji
Haowen Sun
Jie Zhou
Jiwen Lu
51
1
0
17 Dec 2021
DNA: Dynamic Network Augmentation
DNA: Dynamic Network Augmentation
Scott Mahan
T. Doster
Henry Kvinge
22
0
0
17 Dec 2021
LC-FDNet: Learned Lossless Image Compression with Frequency
  Decomposition Network
LC-FDNet: Learned Lossless Image Compression with Frequency Decomposition Network
Hochang Rhee
Y. Jang
Seyun Kim
N. Cho
35
29
0
13 Dec 2021
Pruning Pretrained Encoders with a Multitask Objective
Pruning Pretrained Encoders with a Multitask Objective
Patrick Xia
Richard Shin
52
0
0
10 Dec 2021
Neural Network Quantization for Efficient Inference: A Survey
Neural Network Quantization for Efficient Inference: A Survey
Olivia Weng
MQ
33
23
0
08 Dec 2021
Segment and Complete: Defending Object Detectors against Adversarial
  Patch Attacks with Robust Patch Detection
Segment and Complete: Defending Object Detectors against Adversarial Patch Attacks with Robust Patch Detection
Jiangjiang Liu
Alexander Levine
Chun Pong Lau
Ramalingam Chellappa
Soheil Feizi
AAML
37
77
0
08 Dec 2021
DiPS: Differentiable Policy for Sketching in Recommender Systems
DiPS: Differentiable Policy for Sketching in Recommender Systems
Aritra Ghosh
Saayan Mitra
Andrew Lan
BDL
OffRL
31
2
0
08 Dec 2021
Enhanced Exploration in Neural Feature Selection for Deep Click-Through
  Rate Prediction Models via Ensemble of Gating Layers
Enhanced Exploration in Neural Feature Selection for Deep Click-Through Rate Prediction Models via Ensemble of Gating Layers
L. Guan
Xia Xiao
Ming-yue Chen
Youlong Cheng
32
1
0
07 Dec 2021
DANets: Deep Abstract Networks for Tabular Data Classification and
  Regression
DANets: Deep Abstract Networks for Tabular Data Classification and Regression
Jintai Chen
Kuan-Yu Liao
Yao Wan
Danny Chen
Jian Wu
LMTD
56
50
0
06 Dec 2021
AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural
  Networks
AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks
Huu Le
R. Høier
Che-Tsung Lin
Christopher Zach
55
17
0
06 Dec 2021
BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery
BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery
Chris Cundy
Aditya Grover
Stefano Ermon
CML
49
72
0
06 Dec 2021
Target Propagation via Regularized Inversion
Target Propagation via Regularized Inversion
Vincent Roulet
Zaïd Harchaoui
BDL
AAML
41
4
0
02 Dec 2021
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via
  Generalized Straight-Through Estimation
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation
Zechun Liu
Kwang-Ting Cheng
Dong Huang
Eric P. Xing
Zhiqiang Shen
MQ
48
104
0
29 Nov 2021
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Xiaoyi Dong
Jianmin Bao
Ting Zhang
Dongdong Chen
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
Baining Guo
ViT
57
240
0
24 Nov 2021
Sharpness-aware Quantization for Deep Neural Networks
Sharpness-aware Quantization for Deep Neural Networks
Jing Liu
Jianfei Cai
Bohan Zhuang
MQ
71
24
0
24 Nov 2021
Previous
123...202122...363738
Next