ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXivPDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,872 papers shown
Title
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models
Elias Frantar
Dan Alistarh
MQ
MoE
34
25
0
25 Oct 2023
SpikingJelly: An open-source machine learning infrastructure platform
  for spike-based intelligence
SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence
Wei Fang
Yanqing Chen
Jianhao Ding
Zhaofei Yu
T. Masquelier
Ding Chen
Liwei Huang
Huihui Zhou
Guoqi Li
Yonghong Tian
36
206
0
25 Oct 2023
On the Interplay between Fairness and Explainability
On the Interplay between Fairness and Explainability
Stephanie Brandl
Emanuele Bugliarello
Ilias Chalkidis
FaML
27
4
0
25 Oct 2023
Graph Deep Learning for Time Series Forecasting
Graph Deep Learning for Time Series Forecasting
Andrea Cini
Ivan Marisca
Daniele Zambon
Cesare Alippi
AI4TS
AI4CE
32
14
0
24 Oct 2023
VMAF Re-implementation on PyTorch: Some Experimental Results
VMAF Re-implementation on PyTorch: Some Experimental Results
Kirill Aistov
Maxim Koroteev
41
1
0
24 Oct 2023
Towards Hybrid-grained Feature Interaction Selection for Deep Sparse
  Network
Towards Hybrid-grained Feature Interaction Selection for Deep Sparse Network
Fuyuan Lyu
Xing Tang
Dugang Liu
Chen Ma
Weihong Luo
Liang Chen
Xiuqiang He
Xue Liu
21
2
0
23 Oct 2023
Projected Stochastic Gradient Descent with Quantum Annealed Binary
  Gradients
Projected Stochastic Gradient Descent with Quantum Annealed Binary Gradients
Maximilian Krahn
Michele Sasdelli
Fengyi Yang
Vladislav Golyanik
Arno Solin
Tat-Jun Chin
Tolga Birdal
MQ
87
2
0
23 Oct 2023
SpVOS: Efficient Video Object Segmentation with Triple Sparse
  Convolution
SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution
Weihao Lin
Tao Chen
Chong Yu
VOS
21
3
0
23 Oct 2023
Hierarchical Vector Quantized Transformer for Multi-class Unsupervised
  Anomaly Detection
Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection
Ruiying Lu
YuJie Wu
Long Tian
Dongsheng Wang
Bo Chen
Xiyang Liu
Ruimin Hu
33
39
0
22 Oct 2023
Calibrating Neural Simulation-Based Inference with Differentiable
  Coverage Probability
Calibrating Neural Simulation-Based Inference with Differentiable Coverage Probability
Maciej Falkiewicz
Naoya Takeishi
Imahn Shekhzadeh
Antoine Wehenkel
Arnaud Delaunoy
Gilles Louppe
Alexandros Kalousis
31
6
0
20 Oct 2023
DIG-MILP: a Deep Instance Generator for Mixed-Integer Linear Programming
  with Feasibility Guarantee
DIG-MILP: a Deep Instance Generator for Mixed-Integer Linear Programming with Feasibility Guarantee
Haoyu Wang
Jialin Liu
Xiaohan Chen
Xinshang Wang
Pan Li
Wotao Yin
34
3
0
20 Oct 2023
BitNet: Scaling 1-bit Transformers for Large Language Models
BitNet: Scaling 1-bit Transformers for Large Language Models
Hongyu Wang
Shuming Ma
Li Dong
Shaohan Huang
Huaijie Wang
Lingxiao Ma
Fan Yang
Ruiping Wang
Yi Wu
Furu Wei
MQ
34
100
0
17 Oct 2023
Tracking and Mapping in Medical Computer Vision: A Review
Tracking and Mapping in Medical Computer Vision: A Review
Adam Schmidt
Omid Mohareri
S. DiMaio
Michael C. Yip
Septimiu E. Salcudean
50
34
0
17 Oct 2023
TEQ: Trainable Equivalent Transformation for Quantization of LLMs
TEQ: Trainable Equivalent Transformation for Quantization of LLMs
Wenhua Cheng
Yiyang Cai
Kaokao Lv
Haihao Shen
MQ
33
7
0
17 Oct 2023
Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence
  Classification
Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence Classification
Junjie Dong
Mudi Jiang
Lianyu Hu
Zengyou He
25
0
0
16 Oct 2023
STORM: Efficient Stochastic Transformer based World Models for
  Reinforcement Learning
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning
Weipu Zhang
Gang Wang
Jian Sun
Yetian Yuan
Gao Huang
63
34
0
14 Oct 2023
Sub-network Discovery and Soft-masking for Continual Learning of Mixed
  Tasks
Sub-network Discovery and Soft-masking for Continual Learning of Mixed Tasks
Zixuan Ke
Bing Liu
Wenhan Xiong
Asli Celikyilmaz
Haoran Li
CLL
37
5
0
13 Oct 2023
Real-Time Neural BRDF with Spherically Distributed Primitives
Real-Time Neural BRDF with Spherically Distributed Primitives
Yishun Dou
Zhong Zheng
Qiaoqiao Jin
Bingbing Ni
Yugang Chen
Junxiang Ke
3DH
24
1
0
12 Oct 2023
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large
  Language Models
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
Jing Liu
Ruihao Gong
Xiuying Wei
Zhiwei Dong
Jianfei Cai
Bohan Zhuang
MQ
35
51
0
12 Oct 2023
AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE
AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE
Wei Ao
Vishnu Boddeti
AAML
33
18
0
12 Oct 2023
NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration
NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration
A. Sridhar
Dhruv Shah
Catherine Glossop
Sergey Levine
42
115
0
11 Oct 2023
Generalized Neural Sorting Networks with Error-Free Differentiable Swap
  Functions
Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions
Jungtaek Kim
Jeongbeen Yoon
Minsu Cho
13
1
0
11 Oct 2023
Breaking Down Word Semantics from Pre-trained Language Models through
  Layer-wise Dimension Selection
Breaking Down Word Semantics from Pre-trained Language Models through Layer-wise Dimension Selection
Nayoung Choi
21
0
0
08 Oct 2023
Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM
  Inference?
Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Cheng Zhang
Jianyi Cheng
Ilia Shumailov
George A. Constantinides
Yiren Zhao
MQ
21
9
0
08 Oct 2023
Exploiting Activation Sparsity with Dense to Dynamic-k
  Mixture-of-Experts Conversion
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion
Filip Szatkowski
Eric Elmoznino
Younesse Kaddar
Simone Scardapane
MoE
41
5
0
06 Oct 2023
Taming Binarized Neural Networks and Mixed-Integer Programs
Taming Binarized Neural Networks and Mixed-Integer Programs
Johannes Aspman
Georgios Korpas
Jakub Marecek
AI4CE
16
7
0
05 Oct 2023
EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit
  Diffusion Models
EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models
Yefei He
Jing Liu
Weijia Wu
Hong Zhou
Bohan Zhuang
DiffM
MQ
24
48
0
05 Oct 2023
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models
Deniz Bayazit
Negar Foroutan
Zeming Chen
Gail Weiss
Antoine Bosselut
KELM
32
14
0
04 Oct 2023
Soft Convex Quantization: Revisiting Vector Quantization with Convex
  Optimization
Soft Convex Quantization: Revisiting Vector Quantization with Convex Optimization
Tanmay Gautam
Reid Pryzant
Ziyi Yang
Chenguang Zhu
Somayeh Sojoudi
MQ
24
4
0
04 Oct 2023
NOLA: Compressing LoRA using Linear Combination of Random Basis
NOLA: Compressing LoRA using Linear Combination of Random Basis
Soroush Abbasi Koohpayegani
K. Navaneet
Parsa Nooralinejad
Soheil Kolouri
Hamed Pirsiavash
40
12
0
04 Oct 2023
QuATON: Quantization Aware Training of Optical Neurons
QuATON: Quantization Aware Training of Optical Neurons
Hasindu Kariyawasam
Ramith Hettiarachchi
Quansan Yang
Alex Matlock
Takahiro Nambara
Hiroyuki Kusaka
Yuichiro Kunai
Peter T C So
Edward S Boyden
D. Wadduwage
MQ
27
1
0
04 Oct 2023
Feather: An Elegant Solution to Effective DNN Sparsification
Feather: An Elegant Solution to Effective DNN Sparsification
Athanasios Glentis Georgoulakis
George Retsinas
Petros Maragos
32
0
0
03 Oct 2023
FedL2P: Federated Learning to Personalize
FedL2P: Federated Learning to Personalize
Royson Lee
Minyoung Kim
Da Li
Xinchi Qiu
Timothy M. Hospedales
Ferenc Huszár
Nicholas D. Lane
FedML
18
0
0
03 Oct 2023
Dodo: Dynamic Contextual Compression for Decoder-only LMs
Dodo: Dynamic Contextual Compression for Decoder-only LMs
Guanghui Qin
Corby Rosset
Ethan C. Chau
Nikhil Rao
Benjamin Van Durme
32
8
0
03 Oct 2023
Equivariant Adaptation of Large Pretrained Models
Equivariant Adaptation of Large Pretrained Models
Arnab Kumar Mondal
Siba Smarak Panigrahi
Sekouba Kaba
Sai Rajeswar
Siamak Ravanbakhsh
58
28
0
02 Oct 2023
BTR: Binary Token Representations for Efficient Retrieval Augmented
  Language Models
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
Qingqing Cao
Sewon Min
Yizhong Wang
Hannaneh Hajishirzi
MQ
RALM
40
4
0
02 Oct 2023
Modularity in Deep Learning: A Survey
Modularity in Deep Learning: A Survey
Haozhe Sun
Isabelle Guyon
MoMe
40
2
0
02 Oct 2023
Sparse Backpropagation for MoE Training
Sparse Backpropagation for MoE Training
Liyuan Liu
Jianfeng Gao
Weizhu Chen
MoE
34
9
0
01 Oct 2023
Consciousness-Inspired Spatio-Temporal Abstractions for Better
  Generalization in Reinforcement Learning
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
Mingde Zhao
Safa Alver
H. V. Seijen
Romain Laroche
Doina Precup
Yoshua Bengio
20
3
0
30 Sep 2023
PB-LLM: Partially Binarized Large Language Models
PB-LLM: Partially Binarized Large Language Models
Yuzhang Shang
Zhihang Yuan
Qiang Wu
Zhen Dong
MQ
31
44
0
29 Sep 2023
SHACIRA: Scalable HAsh-grid Compression for Implicit Neural
  Representations
SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations
Sharath Girish
Abhinav Shrivastava
Kamal Gupta
45
23
0
27 Sep 2023
Finite Scalar Quantization: VQ-VAE Made Simple
Finite Scalar Quantization: VQ-VAE Made Simple
Fabian Mentzer
David C. Minnen
E. Agustsson
Michael Tschannen
47
155
0
27 Sep 2023
LAPP: Layer Adaptive Progressive Pruning for Compressing CNNs from
  Scratch
LAPP: Layer Adaptive Progressive Pruning for Compressing CNNs from Scratch
P. Zhai
K. Guo
F. Liu
Xiaofen Xing
Xiangmin Xu
26
3
0
25 Sep 2023
Flow Factorized Representation Learning
Flow Factorized Representation Learning
Yue Song
Thomas Anderson Keller
N. Sebe
Max Welling
DRL
OOD
31
3
0
22 Sep 2023
SupeRBNN: Randomized Binary Neural Network Using Adiabatic
  Superconductor Josephson Devices
SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices
Zechao Li
Geng Yuan
Tomoharu Yamauchi
Zabihi Masoud
Yanyue Xie
...
Xulong Tang
Nobuyuki Yoshikawa
Devesh Tiwari
Yanzhi Wang
O. Chen
MQ
25
4
0
21 Sep 2023
BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision
Jinzhao Zhou
Yiqun Duan
Yu-Cheng Chang
Yu-Kai Wang
Chin-Teng Lin
39
6
0
21 Sep 2023
Information based explanation methods for deep learning agents -- with
  applications on large open-source chess models
Information based explanation methods for deep learning agents -- with applications on large open-source chess models
Patrik Hammersborg
Inga Strümke
15
1
0
18 Sep 2023
Mitigating Adversarial Attacks in Federated Learning with Trusted
  Execution Environments
Mitigating Adversarial Attacks in Federated Learning with Trusted Execution Environments
Simon Queyrut
V. Schiavoni
Pascal Felber
AAML
FedML
37
6
0
13 Sep 2023
Differentiable JPEG: The Devil is in the Details
Differentiable JPEG: The Devil is in the Details
Christoph Reich
Biplob K. Debnath
Deep Patel
S. Chakradhar
DiffM
26
9
0
13 Sep 2023
Optimize Weight Rounding via Signed Gradient Descent for the
  Quantization of LLMs
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
Wenhua Cheng
Weiwei Zhang
Haihao Shen
Yiyang Cai
Xin He
Kaokao Lv
Yi. Liu
MQ
36
22
0
11 Sep 2023
Previous
123...91011...363738
Next