Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1308.3432
Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation
15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"
50 / 1,874 papers shown
Title
LEAP: Learnable Pruning for Transformer-based Models
Z. Yao
Xiaoxia Wu
Linjian Ma
Sheng Shen
Kurt Keutzer
Michael W. Mahoney
Yuxiong He
30
7
0
30 May 2021
ARMS: Antithetic-REINFORCE-Multi-Sample Gradient for Binary Variables
Alek Dimitriev
Mingyuan Zhou
21
12
0
28 May 2021
Learning to Extend Program Graphs to Work-in-Progress Code
Xuechen Li
Chris J. Maddison
Daniel Tarlow
26
2
0
28 May 2021
Differentiable Artificial Reverberation
Sungho Lee
Hyeong-Seok Choi
Kyogu Lee
VLM
38
25
0
28 May 2021
Early Exiting with Ensemble Internal Classifiers
Tianxiang Sun
Yunhua Zhou
Xiangyang Liu
Xinyu Zhang
Hao Jiang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
32
30
0
28 May 2021
Towards Efficient Full 8-bit Integer DNN Online Training on Resource-limited Devices without Batch Normalization
Yukuan Yang
Xiaowei Chi
Lei Deng
Tianyi Yan
Feng Gao
Guoqi Li
MQ
28
6
0
27 May 2021
Integrating Semantics and Neighborhood Information with Graph-Driven Generative Models for Document Retrieval
Zijing Ou
Qinliang Su
Jianxing Yu
Bang Liu
Jingwen Wang
Ruihui Zhao
Changyou Chen
Yefeng Zheng
16
3
0
27 May 2021
CogView: Mastering Text-to-Image Generation via Transformers
Ming Ding
Zhuoyi Yang
Wenyi Hong
Wendi Zheng
Chang Zhou
...
Junyang Lin
Xu Zou
Zhou Shao
Hongxia Yang
Jie Tang
ViT
VLM
54
770
0
26 May 2021
BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer
Haoping Bai
Mengsi Cao
Ping Huang
Jiulong Shan
MQ
27
34
0
19 May 2021
Parallel Attention Network with Sequence Matching for Video Grounding
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
25
40
0
18 May 2021
Dynamic Multi-Branch Layers for On-Device Neural Machine Translation
Zhixing Tan
Zeyuan Yang
Meng Zhang
Qun Liu
Maosong Sun
Yang Liu
AI4CE
24
4
0
14 May 2021
End-to-End Sequential Sampling and Reconstruction for MRI
Tianwei Yin
Zihui Wu
He Sun
Adrian Dalca
Yisong Yue
Katherine Bouman
21
19
0
13 May 2021
BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch Whitening
Wenqi Shao
Hang Yu
Zhaoyang Zhang
Hang Xu
Zhenguo Li
Ping Luo
AAML
17
2
0
13 May 2021
Unsupervised Hashing with Contrastive Information Bottleneck
Zexuan Qiu
Qinliang Su
Zijing Ou
Jianxing Yu
Changyou Chen
SSL
23
84
0
13 May 2021
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation
Ahmad Rashid
Vasileios Lioutas
Mehdi Rezagholizadeh
AAML
26
36
0
12 May 2021
Discrete representations in neural models of spoken language
Bertrand Higy
Lieke Gelderloos
Afra Alishahi
Grzegorz Chrupała
29
6
0
12 May 2021
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Yikang Shen
Chun-Fu Chen
Quanfu Fan
Ximeng Sun
Kate Saenko
A. Oliva
Rogerio Feris
41
47
0
11 May 2021
Using Deep Neural Networks to Predict and Improve the Performance of Polar Codes
Mathieu Léonardon
Vincent Gripon
14
5
0
11 May 2021
Rationalization through Concepts
Diego Antognini
Boi Faltings
FAtt
27
19
0
11 May 2021
3U-EdgeAI: Ultra-Low Memory Training, Ultra-Low BitwidthQuantization, and Ultra-Low Latency Acceleration
Yao Chen
Cole Hawkins
Kaiqi Zhang
Zheng Zhang
Cong Hao
26
8
0
11 May 2021
Continual Learning via Bit-Level Information Preserving
Yujun Shi
Li-xin Yuan
Yunpeng Chen
Jiashi Feng
CLL
23
42
0
10 May 2021
Effective Sparsification of Neural Networks with Global Sparsity Constraint
Xiao Zhou
Weizhong Zhang
Hang Xu
Tong Zhang
21
62
0
03 May 2021
Stealthy Backdoors as Compression Artifacts
Yulong Tian
Fnu Suya
Fengyuan Xu
David Evans
40
22
0
30 Apr 2021
Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety
Sebastian Houben
Stephanie Abrecht
Maram Akila
Andreas Bär
Felix Brockherde
...
Serin Varghese
Michael Weber
Sebastian J. Wirkert
Tim Wirtz
Matthias Woehrle
AAML
13
58
0
29 Apr 2021
3D Scene Compression through Entropy Penalized Neural Representation Functions
Thomas Bird
Johannes Ballé
Saurabh Singh
P. Chou
41
30
0
26 Apr 2021
Skip-Convolutions for Efficient Video Processing
A. Habibian
Davide Abati
Taco S. Cohen
B. Bejnordi
56
50
0
23 Apr 2021
Differentiable Model Compression via Pseudo Quantization Noise
Alexandre Défossez
Yossi Adi
Gabriel Synnaeve
DiffM
MQ
26
48
0
20 Apr 2021
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
Jie Ren
Yewen Li
Zihan Ding
Wei Pan
Hao Dong
BDL
MoE
23
25
0
19 Apr 2021
Lottery Jackpots Exist in Pre-trained Models
Yuxin Zhang
Mingbao Lin
Yan Wang
Rongrong Ji
Rongrong Ji
35
15
0
18 Apr 2021
"BNN - BN = ?": Training Binary Neural Networks without Batch Normalization
Tianlong Chen
Zhenyu Zhang
Xu Ouyang
Zechun Liu
Zhiqiang Shen
Zhangyang Wang
MQ
46
36
0
16 Apr 2021
Matching-oriented Product Quantization For Ad-hoc Retrieval
Shitao Xiao
Zheng Liu
Yingxia Shao
Defu Lian
Xing Xie
MQ
12
9
0
16 Apr 2021
Disentangling Representations of Text by Masking Transformers
Xiongyi Zhang
Jan-Willem van de Meent
Byron C. Wallace
DRL
19
18
0
14 Apr 2021
End-to-end Keyword Spotting using Neural Architecture Search and Quantization
David Peter
Wolfgang Roth
Franz Pernkopf
MQ
35
14
0
14 Apr 2021
ENOS: Energy-Aware Network Operator Search for Hybrid Digital and Compute-in-Memory DNN Accelerators
Shamma Nasrin
A. Shylendra
Yuti Kadakia
N. Iliev
Wilfred Gomes
Theja Tulabandhula
A. R. Trivedi
MQ
17
2
0
12 Apr 2021
Unsupervised Learning of Explainable Parse Trees for Improved Generalisation
Atul Sahay
Ayush Maheshwari
Ritesh Kumar
Ganesh Ramakrishnan
M. Hanawal
K. Arya
LRM
11
1
0
11 Apr 2021
Neural Feature Search for RGB-Infrared Person Re-Identification
Yehansen Chen
Lin Wan
Zhihang Li
Qianyan Jing
Zongyuan Sun
52
138
0
06 Apr 2021
Network Quantization with Element-wise Gradient Scaling
Junghyup Lee
Dohyung Kim
Bumsub Ham
MQ
18
117
0
02 Apr 2021
Unsupervised Multi-Index Semantic Hashing
Christian B. Hansen
Casper Hansen
J. Simonsen
Stephen Alstrup
Christina Lioma
27
8
0
26 Mar 2021
Projected Hamming Dissimilarity for Bit-Level Importance Coding in Collaborative Filtering
Christian B. Hansen
Casper Hansen
J. Simonsen
Christina Lioma
29
5
0
26 Mar 2021
Boosting Binary Masks for Multi-Domain Learning through Affine Transformations
Massimiliano Mancini
Elisa Ricci
Barbara Caputo
Samuel Rota Buló
19
7
0
25 Mar 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
60
94
0
23 Mar 2021
ReCU: Reviving the Dead Weights in Binary Neural Networks
Zihan Xu
Mingbao Lin
Jianzhuang Liu
Jie Chen
Ling Shao
Yue Gao
Yonghong Tian
Rongrong Ji
MQ
26
81
0
23 Mar 2021
Weakly Supervised Recovery of Semantic Attributes
Ameen Ali
Tomer Galanti
Evgeniy Zheltonozhskiy
Chaim Baskin
Lior Wolf
37
0
0
22 Mar 2021
Learning Optimal Fronthauling and Decentralized Edge Computation in Fog Radio Access Networks
Hoon Lee
Junbeom Kim
Seok-Hwan Park
22
12
0
21 Mar 2021
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE
Jialun Peng
Dong Liu
Songcen Xu
Houqiang Li
DiffM
25
191
0
18 Mar 2021
Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network
James Diffenderfer
B. Kailkhura
MQ
37
75
0
17 Mar 2021
Learnable Companding Quantization for Accurate Low-bit Neural Networks
Kohei Yamamoto
MQ
36
64
0
12 Mar 2021
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDL
DRL
32
23
0
10 Mar 2021
Wav2vec-C: A Self-supervised Model for Speech Representation Learning
Samik Sadhu
Di He
Che-Wei Huang
Sri Harish Reddy Mallidi
Minhua Wu
Ariya Rastrow
A. Stolcke
J. Droppo
Roland Maas
SSL
20
48
0
09 Mar 2021
BERTese: Learning to Speak to BERT
Adi Haviv
Jonathan Berant
Amir Globerson
32
123
0
09 Mar 2021
Previous
1
2
3
...
24
25
26
...
36
37
38
Next