ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXivPDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,872 papers shown
Title
End-to-End Training of Neural Networks for Automotive Radar Interference
  Mitigation
End-to-End Training of Neural Networks for Automotive Radar Interference Mitigation
Christian Oswald
Máté Tóth
Paul Meissner
Franz Pernkopf
AAML
26
3
0
15 Dec 2023
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech
  Recognition with Universal Speech Models
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models
Shaojin Ding
David Qiu
David Rim
Yanzhang He
Oleg Rybakov
...
Tara N. Sainath
Zhonglin Han
Jian Li
Amir Yazdanbakhsh
Shivani Agrawal
MQ
34
9
0
13 Dec 2023
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs
  for Embodied AI
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI
Kai Huang
Boyuan Yang
Wei Gao
37
1
0
13 Dec 2023
IDKM: Memory Efficient Neural Network Quantization via Implicit,
  Differentiable k-Means
IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means
Sean Jaffe
Ambuj K. Singh
Francesco Bullo
MQ
22
0
0
12 Dec 2023
Expand-and-Quantize: Unsupervised Semantic Segmentation Using
  High-Dimensional Space and Product Quantization
Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization
Jiyoung Kim
Kyuhong Shim
Insu Lee
B. Shim
19
2
0
12 Dec 2023
Building Universal Foundation Models for Medical Image Analysis with
  Spatially Adaptive Networks
Building Universal Foundation Models for Medical Image Analysis with Spatially Adaptive Networks
Lingxiao Luo
Xuanzhong Chen
Bingda Tang
Xinsheng Chen
Rong Han
Chengpeng Hu
Yujiang Li
Ting Chen
MedIm
34
2
0
12 Dec 2023
MaxQ: Multi-Axis Query for N:M Sparsity Network
MaxQ: Multi-Axis Query for N:M Sparsity Network
Jingyang Xiang
Siqi Li
Junhao Chen
Zhuangzhi Chen
Tianxin Huang
Linpeng Peng
Yong-Jin Liu
18
0
0
12 Dec 2023
When Bio-Inspired Computing meets Deep Learning: Low-Latency, Accurate,
  & Energy-Efficient Spiking Neural Networks from Artificial Neural Networks
When Bio-Inspired Computing meets Deep Learning: Low-Latency, Accurate, & Energy-Efficient Spiking Neural Networks from Artificial Neural Networks
Gourav Datta
Zeyu Liu
James Diffenderfer
B. Kailkhura
P. Beerel
46
0
0
12 Dec 2023
Noise Adaptor in Spiking Neural Networks
Noise Adaptor in Spiking Neural Networks
Chen Li
Bipin Rajendran
39
0
0
08 Dec 2023
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language
  Models with 3D Parallelism
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism
Yanxi Chen
Xuchen Pan
Yaliang Li
Bolin Ding
Jingren Zhou
LRM
41
31
0
08 Dec 2023
Finding Interpretable Class-Specific Patterns through Efficient Neural
  Search
Finding Interpretable Class-Specific Patterns through Efficient Neural Search
Nils Philipp Walter
Jonas Fischer
Jilles Vreeken
20
4
0
07 Dec 2023
SmoothQuant+: Accurate and Efficient 4-bit Post-Training
  WeightQuantization for LLM
SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM
Jiayi Pan
Chengcan Wang
Kaifu Zheng
Yangguang Li
Zhenyu Wang
Bin Feng
MQ
43
7
0
06 Dec 2023
Balanced Marginal and Joint Distributional Learning via Mixture
  Cramer-Wold Distance
Balanced Marginal and Joint Distributional Learning via Mixture Cramer-Wold Distance
SeungHwan An
Sungchul Hong
Jong-June Jeon
33
0
0
06 Dec 2023
Compositional Generalization for Data-to-Text Generation
Compositional Generalization for Data-to-Text Generation
Xinnuo Xu
Ivan Titov
Mirella Lapata
32
2
0
05 Dec 2023
Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise
  Distillation
Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation
Tianhao Peng
Ge Gao
Heming Sun
Fan Zhang
David Bull
16
4
0
05 Dec 2023
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D
  Hybrid Prior
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Xusen Sun
Longhao Zhang
Hao Zhu
Peng Zhang
Bang Zhang
Xinya Ji
Kangneng Zhou
Daiheng Gao
Liefeng Bo
Xun Cao
VGen
33
24
0
04 Dec 2023
Low-Precision Mixed-Computation Models for Inference on Edge
Low-Precision Mixed-Computation Models for Inference on Edge
Seyedarmin Azizi
M. Nazemi
M. Kamal
Massoud Pedram
MQ
38
1
0
03 Dec 2023
Learning High-Order Relationships of Brain Regions
Learning High-Order Relationships of Brain Regions
Weikang Qiu
Huangrui Chu
Selena Wang
Haolan Zuo
Xiaoxiao Li
Yize Zhao
Rex Ying
37
6
0
02 Dec 2023
Harnessing Discrete Representations For Continual Reinforcement Learning
Harnessing Discrete Representations For Continual Reinforcement Learning
Edan Meyer
Adam White
Marlos C. Machado
OffRL
41
4
0
02 Dec 2023
Mixed-Precision Quantization for Federated Learning on
  Resource-Constrained Heterogeneous Devices
Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices
Huancheng Chen
H. Vikalo
FedML
MQ
23
7
0
29 Nov 2023
Implicit-explicit Integrated Representations for Multi-view Video
  Compression
Implicit-explicit Integrated Representations for Multi-view Video Compression
Chen Zhu
Guo Lu
Bing He
Rong Xie
Li-Na Song
32
3
0
29 Nov 2023
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with
  Semantic Vector-Quantized Tokenizer
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer
Jacob Zhiyuan Fang
Skyler Zheng
Vasu Sharma
Robinson Piramuthu
VLM
38
0
0
28 Nov 2023
Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for
  Imbalanced Medical Classification
Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification
Jiahuan Yan
Haojun Gao
Zhang Kai
Weize Liu
Danny Chen
Jian Wu
Jintai Chen
26
3
0
28 Nov 2023
Learning to Skip for Language Modeling
Learning to Skip for Language Modeling
Dewen Zeng
Nan Du
Tao Wang
Yuanzhong Xu
Tao Lei
Zhifeng Chen
Claire Cui
25
11
0
26 Nov 2023
CRISP: Hybrid Structured Sparsity for Class-aware Model Pruning
CRISP: Hybrid Structured Sparsity for Class-aware Model Pruning
Shivam Aggarwal
Kuluhan Binici
Tulika Mitra
VLM
14
2
0
24 Nov 2023
Compact 3D Gaussian Representation for Radiance Field
Compact 3D Gaussian Representation for Radiance Field
J. Lee
Daniel Rho
Xiangyu Sun
Jong Hwan Ko
Eunbyung Park
3DGS
52
173
0
22 Nov 2023
Differentiable Sampling of Categorical Distributions Using the
  CatLog-Derivative Trick
Differentiable Sampling of Categorical Distributions Using the CatLog-Derivative Trick
Lennert De Smet
Emanuele Sansone
Pedro Zuidberg Dos Martires
22
11
0
21 Nov 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive
  Review
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
M. Lê
Pierre Wolinski
Julyan Arbel
34
8
0
20 Nov 2023
Low-Precision Floating-Point for Efficient On-Board Deep Neural Network
  Processing
Low-Precision Floating-Point for Efficient On-Board Deep Neural Network Processing
Cédric Gernigon
Silviu-Ioan Filip
Olivier Sentieys
Clément Coggiola
Mickael Bruno
MQ
24
8
0
18 Nov 2023
Interpretable Reinforcement Learning for Robotics and Continuous Control
Interpretable Reinforcement Learning for Robotics and Continuous Control
Rohan R. Paleja
Letian Chen
Yaru Niu
Andrew Silva
Zhaoxin Li
...
K. Chang
H. E. Tseng
Yan Wang
S. Nageshrao
Matthew C. Gombolay
37
7
0
16 Nov 2023
Adversarially Robust Spiking Neural Networks Through Conversion
Adversarially Robust Spiking Neural Networks Through Conversion
Ozan Özdenizci
Robert Legenstein
AAML
38
8
0
15 Nov 2023
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized
  Representation
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Jiangzong Wang
Pengcheng Li
Xulong Zhang
Ning Cheng
Jing Xiao
32
0
0
14 Nov 2023
Explainable History Distillation by Marked Temporal Point Process
Explainable History Distillation by Marked Temporal Point Process
Sishun Liu
Ke Deng
Yan Wang
Xiuzhen Zhang
35
0
0
13 Nov 2023
Pruning random resistive memory for optimizing analogue AI
Pruning random resistive memory for optimizing analogue AI
Yi Li
Song-jian Wang
Yaping Zhao
Shaocong Wang
Woyu Zhang
...
Xiaoxin Xu
Dashan Shang
Qi Liu
Kwang-Ting Cheng
Ming Liu
23
1
0
13 Nov 2023
AccEPT: An Acceleration Scheme for Speeding Up Edge Pipeline-parallel
  Training
AccEPT: An Acceleration Scheme for Speeding Up Edge Pipeline-parallel Training
Yuhao Chen
Yuxuan Yan
Qianqian Yang
Yuanchao Shu
Shibo He
Zhiguo Shi
Jiming Chen
43
0
0
10 Nov 2023
Real-Time Neural Rasterization for Large Scenes
Real-Time Neural Rasterization for Large Scenes
Jeffrey Yunfan Liu
Yun Chen
Ze Yang
Jingkang Wang
S. Manivasagam
R. Urtasun
AI4TS
AI4CE
53
35
0
09 Nov 2023
RepQ: Generalizing Quantization-Aware Training for Re-Parametrized
  Architectures
RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures
Anastasiia Prutianova
Alexey Zaytsev
Chung-Kuei Lee
Fengyu Sun
Ivan Koryakovskiy
MQ
21
0
0
09 Nov 2023
Reducing the Side-Effects of Oscillations in Training of Quantized YOLO
  Networks
Reducing the Side-Effects of Oscillations in Training of Quantized YOLO Networks
Kartik Gupta
Akshay Asthana
MQ
29
8
0
09 Nov 2023
Recursion in Recursion: Two-Level Nested Recursion for Length
  Generalization with Scalability
Recursion in Recursion: Two-Level Nested Recursion for Length Generalization with Scalability
Jishnu Ray Chowdhury
Cornelia Caragea
37
5
0
08 Nov 2023
AFPQ: Asymmetric Floating Point Quantization for LLMs
AFPQ: Asymmetric Floating Point Quantization for LLMs
Yijia Zhang
Sicheng Zhang
Shijie Cao
Dayou Du
Jianyu Wei
Ting Cao
Ningyi Xu
MQ
33
5
0
03 Nov 2023
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient
  Private Inference
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference
Wenxuan Zeng
Meng Li
Haichuan Yang
Wen-jie Lu
Runsheng Wang
Ru Huang
23
6
0
03 Nov 2023
Attacking Graph Neural Networks with Bit Flips: Weisfeiler and Lehman Go
  Indifferent
Attacking Graph Neural Networks with Bit Flips: Weisfeiler and Lehman Go Indifferent
Lorenz Kummer
Samir Moustafa
Nils N. Kriege
Wilfried N. Gansterer
GNN
AAML
33
0
0
02 Nov 2023
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via
  Discrete Diffusion
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion
Lunjun Zhang
Yuwen Xiong
Ze Yang
Sergio Casas
Rui Hu
R. Urtasun
47
51
0
02 Nov 2023
Fully Quantized Always-on Face Detector Considering Mobile Image Sensors
Fully Quantized Always-on Face Detector Considering Mobile Image Sensors
Haechang Lee
Wongi Jeong
Dongil Ryu
Hyunwoo Je
Albert No
Kijeong Kim
Se Young Chun
CVBM
31
0
0
02 Nov 2023
Learn to Categorize or Categorize to Learn? Self-Coding for Generalized
  Category Discovery
Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category Discovery
Sarah Rastegar
Hazel Doughty
Cees G. M. Snoek
38
15
0
30 Oct 2023
Differentiable Learning of Generalized Structured Matrices for Efficient
  Deep Neural Networks
Differentiable Learning of Generalized Structured Matrices for Efficient Deep Neural Networks
Changwoo Lee
Hun-Seok Kim
35
3
0
29 Oct 2023
Improving Compositional Generalization Using Iterated Learning and
  Simplicial Embeddings
Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings
Yi Ren
Samuel Lavoie
Mikhail Galkin
Danica J. Sutherland
Aaron Courville
41
15
0
28 Oct 2023
Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical
  Volumetric Segmentation
Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical Volumetric Segmentation
Haoran Shen
Yifu Zhang
Wenxuan Wang
Chen Chen
Jing Liu
Shanshan Song
Jiangyun Li
MedIm
27
0
0
28 Oct 2023
Scale-Adaptive Feature Aggregation for Efficient Space-Time Video
  Super-Resolution
Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution
Zhewei Huang
Ailin Huang
Xiaotao Hu
Chen Hu
Jun Xu
Shuchang Zhou
35
7
0
26 Oct 2023
Codebook Features: Sparse and Discrete Interpretability for Neural
  Networks
Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Alex Tamkin
Mohammad Taufeeque
Noah D. Goodman
35
27
0
26 Oct 2023
Previous
123...8910...363738
Next