Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1308.3432
Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation
15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"
50 / 1,869 papers shown
Title
PCGS: Progressive Compression of 3D Gaussian Splatting
Yihang Chen
Mengyao Li
Qianyi Wu
Weiyao Lin
Mehrtash Harandi
Jianfei Cai
3DGS
60
0
0
11 Mar 2025
Towards Experience Replay for Class-Incremental Learning in Fully-Binary Networks
Yanis Basso-Bert
Anca Molnos
Romain Lemaire
William Guicquero
Antoine Dupret
CLL
46
0
0
10 Mar 2025
MergeQuant: Accurate 4-bit Static Quantization of Large Language Models by Channel-wise Calibration
Jinguang Wang
Yufei Guo
Haifeng Sun
Tingting Yang
Zirui Zhuang
Wanyi Ning
Yuexi Yin
Q. Qi
Jianxin Liao
MQ
MoMe
51
0
0
07 Mar 2025
SMILENet: Unleashing Extra-Large Capacity Image Steganography via a Synergistic Mosaic InvertibLE Hiding Network
Jun-Jie Huang
Zihan Chen
Tianrui Liu
Wentao Zhao
Xin Deng
Xinwang Liu
Meng Wang
Pier Luigi Dragotti
48
0
0
07 Mar 2025
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
72
0
0
06 Mar 2025
Boosting Offline Optimizers with Surrogate Sensitivity
Manh Cuong Dao
Phi Le Nguyen
Thao Nguyen Truong
Trong Nghia Hoang
OffRL
62
4
0
06 Mar 2025
10K is Enough: An Ultra-Lightweight Binarized Network for Infrared Small-Target Detection
Biqiao Xin
Qianchen Mao
Bingshu Wang
Jiangbin Zheng
Yong Zhao
C. L. P. Chen
MQ
64
0
0
04 Mar 2025
Deep Robust Reversible Watermarking
Jiale Chen
Wei Wang
Chongyang Shi
Li Dong
Yuanman Li
Xiping Hu
43
0
0
04 Mar 2025
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Xinbing Wang
Mingqi Jiang
Z. Ma
Ziyu Zhang
Shixuan Liu
...
Zhifei Li
Xie Chen
Lei Xie
Y. Guo
Wei Xue
84
13
0
03 Mar 2025
Discrete-Time Hybrid Automata Learning: Legged Locomotion Meets Skateboarding
Hang Liu
Sangli Teng
Ben Liu
Wei Zhang
Maani Ghaffari
74
3
0
03 Mar 2025
Protein Structure Tokenization: Benchmarking and New Recipe
Xinyu Yuan
Zichen Wang
Marcus Collins
Huzefa Rangwala
41
0
0
28 Feb 2025
Oscillation-Reduced MXFP4 Training for Vision Transformers
Yuxiang Chen
Haocheng Xi
Jun Zhu
Jianfei Chen
MQ
62
2
0
28 Feb 2025
Vector-Quantized Vision Foundation Models for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
OCL
VLM
233
0
0
27 Feb 2025
Binary Neural Networks for Large Language Model: A Survey
Liangdong Liu
Zhitong Zheng
Cong Wang
TianHuang Su
ZhenYu Yang
MQ
67
0
0
26 Feb 2025
Iterative Counterfactual Data Augmentation
Mitchell Plyler
Min Chi
70
0
0
25 Feb 2025
Interleaved Block-based Learned Image Compression with Feature Enhancement and Quantization Error Compensation
Shiqi Jiang
Hui Yuan
Shuai Li
R. Hamzaoui
Xu Wang
Junyan Huo
58
0
0
24 Feb 2025
Optimizing Singular Spectrum for Large Language Model Compression
Dengjie Li
Tiancheng Shen
Yao Zhou
Baisong Yang
Zhongying Liu
Masheng Yang
Guohao Li
Yibo Yang
Yujie Zhong
Ming-Hsuan Yang
68
0
0
24 Feb 2025
Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism
Yu Liang
Wenjie Wei
A. Belatreche
Honglin Cao
Zijian Zhou
Shuai Wang
Malu Zhang
Yuqing Yang
MQ
66
0
0
21 Feb 2025
MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Jiayu Qin
Jianchao Tan
Kaipeng Zhang
Xunliang Cai
Wei Wang
45
0
0
19 Feb 2025
Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
Yudong Xu
Wenhao Li
Scott Sanner
Elias Boutros Khalil
44
0
0
18 Feb 2025
Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
Jacob Nielsen
Peter Schneider-Kamp
Lukas Galke
MQ
61
1
0
17 Feb 2025
ADMN: A Layer-Wise Adaptive Multimodal Network for Dynamic Input Noise and Compute Resources
Jason Wu
Kang Yang
Lance M. Kaplan
Mani B. Srivastava
36
0
0
11 Feb 2025
Membership Inference Risks in Quantized Models: A Theoretical and Empirical Study
Eric Aubinais
Philippe Formont
Pablo Piantanida
Elisabeth Gassiat
50
0
0
10 Feb 2025
The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG to Noisier EEG
Francesco Stefano Carzaniga
Gary Tom Hoppeler
Michael Hersche
Kaspar Anton Schindler
Abbas Rahimi
51
0
0
10 Feb 2025
PrismAvatar: Real-time animated 3D neural head avatars on edge devices
Prashant Raina
Felix Taubner
Mathieu Tuli
Eu Wern Teh
Kevin Ferreira
3DH
69
1
0
10 Feb 2025
LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-Tuning
Zhekai Du
Yinjie Min
Jingjing Li
Ke Lu
Changliang Zou
Liuhua Peng
Tingjin Chu
Mingming Gong
186
1
0
05 Feb 2025
Compact Rule-Based Classifier Learning via Gradient Descent
Javier Fumanal-Idocin
Raquel Fernandez-Peralta
Javier Andreu-Perez
62
0
0
03 Feb 2025
Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity
Alessandro Pierro
Steven Abreu
Jonathan Timcheck
Philipp Stratmann
Andreas Wild
S. Shrestha
72
0
0
03 Feb 2025
Nearly Lossless Adaptive Bit Switching
Haiduo Huang
Zhenhua Liu
Tian Xia
Wenzhe zhao
Pengju Ren
MQ
63
0
0
03 Feb 2025
Choose Your Model Size: Any Compression by a Single Gradient Descent
Martin Genzel
Patrick Putzky
Pengfei Zhao
Shri Kiran Srinivasan
Mattes Mollenhauer
Robert Seidel
Stefan Dietzel
Thomas Wollmann
41
0
0
03 Feb 2025
Optimizing Large Language Model Training Using FP4 Quantization
Ruizhe Wang
Yeyun Gong
Xiao Liu
Guoshuai Zhao
Ziyue Yang
Baining Guo
Zhengjun Zha
Peng Cheng
MQ
67
5
0
28 Jan 2025
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
Armand Foucault
Franck Mamalet
François Malgouyres
MQ
79
0
0
28 Jan 2025
BILLNET: A Binarized Conv3D-LSTM Network with Logic-gated residual architecture for hardware-efficient video inference
Van Thien Nguyen
William Guicquero
Gilles Sicard
3DV
MQ
79
2
0
24 Jan 2025
Channel-wise Parallelizable Spiking Neuron with Multiplication-free Dynamics and Large Temporal Receptive Fields
Peng Xue
Wei Fang
Zhengyu Ma
Zihan Huang
Zhaokun Zhou
Yonghong Tian
T. Masquelier
Huihui Zhou
54
0
0
24 Jan 2025
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
Shengshi Yao
Jincheng Dai
Xiaoqi Qin
Sixian Wang
Siye Wang
K. Niu
Ping Zhang
38
0
0
22 Jan 2025
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework
Bingyi Liu
Jian Teng
Hongfei Xue
Enshu Wang
Chuanhui Zhu
Pu Wang
Libing Wu
85
0
0
21 Jan 2025
Sparse Binary Representation Learning for Knowledge Tracing
Yahya Badran
Christine Preisach
47
0
0
20 Jan 2025
MOGNET: A Mux-residual quantized Network leveraging Online-Generated weights
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
75
1
0
17 Jan 2025
Histogram-Equalized Quantization for logic-gated Residual Neural Networks
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
41
1
0
10 Jan 2025
Fairness in Reinforcement Learning with Bisimulation Metrics
S. Rezaei-Shoshtari
Hanna Yurchyk
Scott Fujimoto
Doina Precup
D. Meger
85
0
0
03 Jan 2025
Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification
Changchang Sun
Ren Wang
Yihua Zhang
Jinghan Jia
Jiancheng Liu
Gaowen Liu
Sijia Liu
Yan Yan
AAML
MU
95
0
0
21 Dec 2024
Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart
Chengting Yu
Shu Yang
Fengzhao Zhang
Hanzhi Ma
Aili Wang
Er-ping Li
MQ
81
2
0
20 Dec 2024
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Hongyu Chen
Zihan Wang
Xianrui Li
Xingchen Sun
Fangyi Chen
Jiang Liu
Jiadong Wang
Bhiksha Raj
Zicheng Liu
Emad Barsoum
VLM
114
7
0
14 Dec 2024
Video Seal: Open and Efficient Video Watermarking
Pierre Fernandez
Hady ElSahar
I. Zeki Yalniz
Alexandre Mourachko
VLM
89
5
0
12 Dec 2024
ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models
Xubing Ye
Yukang Gan
Yixiao Ge
Xiao Zhang
Yansong Tang
101
7
0
30 Nov 2024
Deep End-to-end Adaptive k-Space Sampling, Reconstruction, and Registration for Dynamic MRI
George Yiasemis
J. Sonke
Jonas Teuwen
76
0
0
27 Nov 2024
Complexity Experts are Task-Discriminative Learners for Any Image Restoration
Eduard Zamfir
Zongwei Wu
Nancy Mehta
Yuedong Tan
Danda Pani Paudel
Yulun Zhang
Radu Timofte
MoE
205
1
0
27 Nov 2024
Noise Adaptor: Enhancing Low-Latency Spiking Neural Networks through Noise-Injected Low-Bit ANN Conversion
Chen Li
Bipin Rajendran
72
0
0
26 Nov 2024
Tree Transformers are an Ineffective Model of Syntactic Constituency
Michael Ginn
72
0
0
25 Nov 2024
Representation Collapsing Problems in Vector Quantization
Wenhao Zhao
Qiran Zou
Rushi Shah
Dianbo Liu
74
1
0
25 Nov 2024
Previous
1
2
3
4
5
...
36
37
38
Next