ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXivPDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,869 papers shown
Title
Efficient Ternary Weight Embedding Model: Bridging Scalability and
  Performance
Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance
Jiayi Chen
Chen Wu
S. Zhang
Nan Li
Lefei Zhang
Qi Zhang
74
0
0
23 Nov 2024
Exploring the Robustness and Transferability of Patch-Based Adversarial Attacks in Quantized Neural Networks
Exploring the Robustness and Transferability of Patch-Based Adversarial Attacks in Quantized Neural Networks
Amira Guesmi
B. Ouni
Muhammad Shafique
AAML
79
0
0
22 Nov 2024
Quantization without Tears
Quantization without Tears
Minghao Fu
Hao Yu
Jie Shao
Junjie Zhou
Ke Zhu
Jianxin Wu
MQ
64
1
0
21 Nov 2024
Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI
  Conversations
Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations
Igor Fedorov
Kate Plawiak
Lemeng Wu
Tarek Elgamal
Naveen Suda
...
Bilge Soran
Zacharie Delpierre Coudert
Rachad Alao
Raghuraman Krishnamoorthi
Vikas Chandra
80
4
0
18 Nov 2024
Bi-Mamba: Towards Accurate 1-Bit State Space Models
Shengkun Tang
Liqun Ma
Yiming Li
Mingjie Sun
Zhiqiang Shen
Mamba
78
3
0
18 Nov 2024
Complexity-Aware Training of Deep Neural Networks for Optimal Structure
  Discovery
Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery
Valentin Frank Ingmar Guenter
Athanasios Sideris
CVBM
23
0
0
14 Nov 2024
GFT: Graph Foundation Model with Transferable Tree Vocabulary
GFT: Graph Foundation Model with Transferable Tree Vocabulary
Zehong Wang
Zheyuan Zhang
Nitesh V. Chawla
Chuxu Zhang
Yanfang Ye
46
10
0
09 Nov 2024
When are 1.58 bits enough? A Bottom-up Exploration of BitNet
  Quantization
When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization
Jacob Nielsen
Lukas Galke
Peter Schneider-Kamp
MQ
32
1
0
08 Nov 2024
Poor Man's Training on MCUs: A Memory-Efficient Quantized
  Back-Propagation-Free Approach
Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach
Yequan Zhao
Hai Li
Ian Young
Zheng-Wei Zhang
MQ
39
2
0
07 Nov 2024
Finding Strong Lottery Ticket Networks with Genetic Algorithms
Finding Strong Lottery Ticket Networks with Genetic Algorithms
Philipp Altmann
Julian Schonberger
Maximilian Zorn
Thomas Gabor
26
1
0
07 Nov 2024
Image Understanding Makes for A Good Tokenizer for Image Generation
Image Understanding Makes for A Good Tokenizer for Image Generation
Luting Wang
Yang Zhao
Zijian Zhang
Jiashi Feng
Si Liu
Bingyi Kang
VLM
41
4
0
07 Nov 2024
Neuromorphic Wireless Split Computing with Multi-Level Spikes
Neuromorphic Wireless Split Computing with Multi-Level Spikes
Dengyu Wu
Jiechen Chen
Bipin Rajendran
H. Vincent Poor
Osvaldo Simeone
49
1
0
07 Nov 2024
CPIG: Leveraging Consistency Policy with Intention Guidance for
  Multi-agent Exploration
CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration
Y. Fu
Yuanheng Zhu
Haoran Li
Zijie Zhao
Jiajun Chai
Dongbin Zhao
42
0
0
06 Nov 2024
The Differentiable Feasibility Pump
The Differentiable Feasibility Pump
M. Cacciola
Alexandre Forel
A. Frangioni
Andrea Lodi
36
0
0
05 Nov 2024
Addressing Representation Collapse in Vector Quantized Models with One
  Linear Layer
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Yongxin Zhu
B. Li
Yifei Xin
Linli Xu
41
10
0
04 Nov 2024
Learning Where to Edit Vision Transformers
Learning Where to Edit Vision Transformers
Yunqiao Yang
Long-Kai Huang
Shengzhuang Chen
Kede Ma
Ying Wei
KELM
40
1
0
04 Nov 2024
Bootstrapping Top-down Information for Self-modulating Slot Attention
Bootstrapping Top-down Information for Self-modulating Slot Attention
Dongwon Kim
Seoyeon Kim
Suha Kwak
OCL
ObjD
35
0
0
04 Nov 2024
Optimizing Contextual Speech Recognition Using Vector Quantization for
  Efficient Retrieval
Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval
Nikolaos Flemotomos
Roger Hsiao
P. Swietojanski
Takaaki Hori
Dogan Can
Xiaodan Zhuang
49
0
0
01 Nov 2024
HoloChrome: Polychromatic Illumination for Speckle Reduction in
  Holographic Near-Eye Displays
HoloChrome: Polychromatic Illumination for Speckle Reduction in Holographic Near-Eye Displays
Florian Schiffers
Grace Kuo
N. Matsuda
Douglas Lanman
O. Cossairt
36
2
0
31 Oct 2024
ELMGS: Enhancing memory and computation scaLability through coMpression
  for 3D Gaussian Splatting
ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting
Muhammad Salman Ali
Sung-Ho Bae
Enzo Tartaglione
3DGS
45
7
0
30 Oct 2024
SimSiam Naming Game: A Unified Approach for Representation Learning and
  Emergent Communication
SimSiam Naming Game: A Unified Approach for Representation Learning and Emergent Communication
Nguyen Le Hoang
T. Taniguchi
Fang Tianwei
Akira Taniguchi
36
1
0
29 Oct 2024
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
Hanyu Wang
Saksham Suri
Yixuan Ren
Hao Chen
Abhinav Shrivastava
VGen
31
10
0
28 Oct 2024
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
40
2
0
28 Oct 2024
Vector Quantization Prompting for Continual Learning
Vector Quantization Prompting for Continual Learning
L. Jiao
Qiuxia Lai
Yu LI
Qiang Xu
VLM
CLL
41
3
0
27 Oct 2024
Content-Aware Radiance Fields: Aligning Model Complexity with Scene
  Intricacy Through Learned Bitwidth Quantization
Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization
Wei Liu
Xue Xian Zheng
Jingyi Yu
Xin Lou
MQ
34
0
0
25 Oct 2024
Spatial-Temporal Search for Spiking Neural Networks
Spatial-Temporal Search for Spiking Neural Networks
Kaiwei Che
Zhaokun Zhou
Li-xin Yuan
Jianguo Zhang
Yonghong Tian
Luziwei Leng
32
0
0
24 Oct 2024
Taipan: Efficient and Expressive State Space Language Models with
  Selective Attention
Taipan: Efficient and Expressive State Space Language Models with Selective Attention
Chien Van Nguyen
Huy Huu Nguyen
Thang M. Pham
Ruiyi Zhang
Hanieh Deilamsalehy
...
Ryan A. Rossi
Trung Bui
Viet Dac Lai
Franck Dernoncourt
Thien Huu Nguyen
Mamba
RALM
37
1
0
24 Oct 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
97
10
0
24 Oct 2024
Lossless KV Cache Compression to 2%
Lossless KV Cache Compression to 2%
Zhen Yang
Jizong Han
Kan Wu
Ruobing Xie
An Wang
Xingchen Sun
Zhanhui Kang
VLM
MQ
36
2
0
20 Oct 2024
Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs
Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs
Zifei Xu
Sayeh Sharify
W. Yazar
T. Webb
Xin Wang
MQ
43
0
0
18 Oct 2024
A Complexity-Based Theory of Compositionality
A Complexity-Based Theory of Compositionality
Eric Elmoznino
Thomas Jiralerspong
Yoshua Bengio
Guillaume Lajoie
CoGe
64
4
0
18 Oct 2024
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers
Shwai He
Tao Ge
Guoheng Sun
Bowei Tian
Xiaoyang Wang
Ang Li
MoE
54
1
0
17 Oct 2024
End-to-end Planner Training for Language Modeling
End-to-end Planner Training for Language Modeling
Nathan Cornille
Florian Mai
Jingyuan Sun
Marie-Francine Moens
28
0
0
16 Oct 2024
DISP-LLM: Dimension-Independent Structural Pruning for Large Language
  Models
DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models
Shangqian Gao
Chi-Heng Lin
Ting Hua
Tang Zheng
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
30
3
0
15 Oct 2024
MoH: Multi-Head Attention as Mixture-of-Head Attention
MoH: Multi-Head Attention as Mixture-of-Head Attention
Peng Jin
Bo Zhu
Li Yuan
Shuicheng Yan
MoE
31
13
0
15 Oct 2024
Advancing Training Efficiency of Deep Spiking Neural Networks through
  Rate-based Backpropagation
Advancing Training Efficiency of Deep Spiking Neural Networks through Rate-based Backpropagation
Chengting Yu
Lei Liu
Gaoang Wang
Erping Li
Aili Wang
26
1
0
15 Oct 2024
A CLIP-Powered Framework for Robust and Generalizable Data Selection
A CLIP-Powered Framework for Robust and Generalizable Data Selection
Steve Yang
Peng Ye
Wanli Ouyang
Dongzhan Zhou
Furao Shen
29
1
0
15 Oct 2024
Learning to Optimize for Mixed-Integer Non-linear Programming
Learning to Optimize for Mixed-Integer Non-linear Programming
Bo Tang
Elias Boutros Khalil
Ján Drgoňa
35
2
0
14 Oct 2024
LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete
  Latent Space
LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space
Shunsuke Sakai
Tatushito Hasegawa
Makoto Koshino
25
1
0
14 Oct 2024
QE-EBM: Using Quality Estimators as Energy Loss for Machine Translation
QE-EBM: Using Quality Estimators as Energy Loss for Machine Translation
Gahyun Yoo
Jay Yoon Lee
29
0
0
14 Oct 2024
Gaussian Mixture Vector Quantization with Aggregated Categorical
  Posterior
Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior
Mingyuan Yan
Jiawei Wu
Rushi Shah
Dianbo Liu
23
0
0
14 Oct 2024
Differentiable Weightless Neural Networks
Differentiable Weightless Neural Networks
Alan T. L. Bacellar
Zachary Susskind
Mauricio Breternitz Jr.
E. John
L. John
P. Lima
F. M. G. França
30
3
0
14 Oct 2024
GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation
GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation
Dingdong Yang
Yizhi Wang
Konrad Schindler
Ali Mahdavi Amiri
Hao Zhang
50
1
0
13 Oct 2024
MoIN: Mixture of Introvert Experts to Upcycle an LLM
MoIN: Mixture of Introvert Experts to Upcycle an LLM
Ajinkya Tejankar
K. Navaneet
Ujjawal Panchal
Kossar Pourahmadi
Hamed Pirsiavash
MoE
29
0
0
13 Oct 2024
Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings
Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings
Di Wu
Siyuan Li
Chen Feng
Lu Cao
Yuyao Zhang
Jie Yang
Mohamad Sawan
33
0
0
13 Oct 2024
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent
  Reinforcement Learning
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning
Xinran Li
Ling Pan
Jun Zhang
20
1
0
11 Oct 2024
Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale
  Models with Structured Pruning in Resource-Limited Clients
Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited Clients
Yan Li
Mingyi Li
Xiao Zhang
Guangwei Xu
Feng Chen
Yuan Yuan
Yifei Zou
Mengying Zhao
Jianbo Lu
Dongxiao Yu
32
0
0
11 Oct 2024
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Wenlong Wang
Ivana Dusparic
Yucheng Shi
Ke Zhang
Vinny Cahill
Mamba
173
0
0
11 Oct 2024
Fast Feedforward 3D Gaussian Splatting Compression
Fast Feedforward 3D Gaussian Splatting Compression
Yihang Chen
Qianyi Wu
Mengyao Li
Weiyao Lin
Mehrtash Harandi
Jianfei Cai
3DGS
48
6
0
10 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
90
0
0
10 Oct 2024
Previous
123456...363738
Next