Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.05279
Cited By
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
16 March 2016
Mohammad Rastegari
Vicente Ordonez
Joseph Redmon
Ali Farhadi
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks"
50 / 564 papers shown
Title
PROM: Prioritize Reduction of Multiplications Over Lower Bit-Widths for Efficient CNNs
Lukas Meiner
Jens Mehnert
A. P. Condurache
MQ
42
0
0
06 May 2025
Efficient Continual Learning in Keyword Spotting using Binary Neural Networks
Quynh Nguyen Phuong Vu
Luciano S. Martinez-Rau
Yuxuan Zhang
Nho-Duc Tran
Bengt Oelmann
Michele Magno
Sebastian Bader
CLL
38
0
0
05 May 2025
Practical Boolean Backpropagation
Simon Golbert
25
0
0
01 May 2025
Optimizing Deep Neural Networks using Safety-Guided Self Compression
Mohammad Zbeeb
Mariam Salman
Mohammad Bazzi
Ammar Mohanna
28
0
0
01 May 2025
BackSlash: Rate Constrained Optimized Training of Large Language Models
Jun Wu
Jiangtao Wen
Yuxing Han
34
0
0
23 Apr 2025
Breaking the Limits of Quantization-Aware Defenses: QADT-R for Robustness Against Patch-Based Adversarial Attacks in QNNs
Amira Guesmi
B. Ouni
Muhammad Shafique
MQ
AAML
36
0
0
10 Mar 2025
10K is Enough: An Ultra-Lightweight Binarized Network for Infrared Small-Target Detection
Biqiao Xin
Qianchen Mao
Bingshu Wang
Jiangbin Zheng
Yong Zhao
C. L. P. Chen
MQ
64
0
0
04 Mar 2025
Cauchy-Schwarz Regularizers
Sueda Taner
Ziyi Wang
Christoph Studer
41
0
0
03 Mar 2025
Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism
Yu Liang
Wenjie Wei
A. Belatreche
Honglin Cao
Zijian Zhou
Shuai Wang
Malu Zhang
Y. Yang
MQ
63
0
0
21 Feb 2025
On Space Folds of ReLU Neural Networks
Michal Lewandowski
Hamid Eghbalzadeh
Bernhard Heinzl
Raphael Pisoni
Bernhard A.Moser
MLT
75
1
0
17 Feb 2025
Progressive Binarization with Semi-Structured Pruning for LLMs
X. Yan
Tianao Zhang
Zhiteng Li
Yulun Zhang
MQ
54
0
0
03 Feb 2025
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning
Zhen Li
Yupeng Su
Runming Yang
C. Xie
Z. Wang
Zhongwei Xie
Ngai Wong
Hongxia Yang
MQ
LRM
44
3
0
06 Jan 2025
Dedicated Inference Engine and Binary-Weight Neural Networks for Lightweight Instance Segmentation
Tse-Wei Chen
Wei Tao
Dongyue Zhao
Kazuhiro Mima
Tadayuki Ito
Kinya Osa
Masami Kato
MQ
31
0
0
03 Jan 2025
Threshold Neuron: A Brain-inspired Artificial Neuron for Efficient On-device Inference
Zihao Zheng
Yuanchun Li
Jiayu Chen
Peng Zhou
Xiang Chen
Yunxin Liu
75
0
0
18 Dec 2024
Your Data Is Not Perfect: Towards Cross-Domain Out-of-Distribution Detection in Class-Imbalanced Data
Xiang Fang
Arvind Easwaran
B. Genest
Ponnuthurai Nagaratnam Suganthan
83
14
0
09 Dec 2024
Exploring the Robustness and Transferability of Patch-Based Adversarial Attacks in Quantized Neural Networks
Amira Guesmi
B. Ouni
Muhammad Shafique
AAML
74
0
0
22 Nov 2024
AdaShadow: Responsive Test-time Model Adaptation in Non-stationary Mobile Environments
Cheng Fang
Sicong Liu
Zimu Zhou
Bin Guo
Jiaqi Tang
Ke Ma
Zhiwen Yu
TTA
31
1
0
10 Oct 2024
Mixture Compressor for Mixture-of-Experts LLMs Gains More
Wei Huang
Yue Liao
Jianhui Liu
Ruifei He
Haoru Tan
Shiming Zhang
Hongsheng Li
Si Liu
Xiaojuan Qi
MoE
39
3
0
08 Oct 2024
ARB-LLM: Alternating Refined Binarizations for Large Language Models
Zhiteng Li
X. Yan
Tianao Zhang
Haotong Qin
Dong Xie
Jiang Tian
Zhongchao Shi
Linghe Kong
Yulun Zhang
Xiaokang Yang
MQ
29
2
0
04 Oct 2024
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi
Hyeyoon Lee
Dain Kwon
Sunjong Park
Kyuyeun Kim
Noseong Park
Jinho Lee
Jinho Lee
MQ
48
1
0
29 Jul 2024
Temporal Feature Matters: A Framework for Diffusion Model Quantization
Yushi Huang
Ruihao Gong
Xianglong Liu
Jing Liu
Yuhang Li
Jiwen Lu
Dacheng Tao
DiffM
MQ
49
0
0
28 Jul 2024
MetaAug: Meta-Data Augmentation for Post-Training Quantization
Cuong Pham
Hoang Anh Dung
Cuong C. Nguyen
Trung Le
Dinh Q. Phung
Gustavo Carneiro
Thanh-Toan Do
MQ
40
0
0
20 Jul 2024
Quality Scalable Quantization Methodology for Deep Learning on Edge
S. Khaliq
Rehan Hafiz
MQ
38
1
0
15 Jul 2024
xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems
Georg Rutishauser
Joan Mihali
Moritz Scherer
Luca Benini
26
1
0
29 May 2024
BDC-Occ: Binarized Deep Convolution Unit For Binarized Occupancy Network
Zongkai Zhang
Zidong Xu
Wenming Yang
Qingmin Liao
Jing-Hao Xue
MQ
3DV
46
1
0
27 May 2024
Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization
Jianting Yang
Srecko Ðurasinovic
Jean B. Lasserre
Victor Magron
Jun Zhao
AAML
39
1
0
27 May 2024
BOLD: Boolean Logic Deep Learning
Van Minh Nguyen
Cristian Ocampo
Aymen Askri
Louis Leconte
Ba-Hien Tran
AI4CE
37
0
0
25 May 2024
TerDiT: Ternary Diffusion Models with Transformers
Xudong Lu
Aojun Zhou
Ziyi Lin
Qi Liu
Yuhui Xu
Renrui Zhang
Yafei Wen
Shuai Ren
Peng Gao
Junchi Yan
MQ
45
2
0
23 May 2024
Designed Dithering Sign Activation for Binary Neural Networks
Brayan Monroy
Juan Estupiñán
T. Gelvez-Barrera
Jorge Bacca
Henry Arguello
MQ
35
1
0
03 May 2024
Binarized Low-light Raw Video Enhancement
Gengchen Zhang
Yulun Zhang
Xin Yuan
Ying Fu
MQ
32
3
0
29 Mar 2024
PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off
Sachit Kuhar
Yash Jain
Alexey Tumanov
MQ
54
0
0
04 Dec 2023
TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models
Yushi Huang
Ruihao Gong
Jing Liu
Tianlong Chen
Xianglong Liu
DiffM
MQ
22
37
0
27 Nov 2023
Automated Heterogeneous Low-Bit Quantization of Multi-Model Deep Learning Inference Pipeline
Jayeeta Mondal
Swarnava Dey
Arijit Mukherjee
MQ
23
1
0
10 Nov 2023
Adaptive Compression-Aware Split Learning and Inference for Enhanced Network Efficiency
Akrit Mudvari
Antero Vainio
Iason Ofeidis
Sasu Tarkoma
Leandros Tassiulas
24
3
0
09 Nov 2023
NOLA: Compressing LoRA using Linear Combination of Random Basis
Soroush Abbasi Koohpayegani
K. Navaneet
Parsa Nooralinejad
Soheil Kolouri
Hamed Pirsiavash
40
12
0
04 Oct 2023
eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models
Minsik Cho
Keivan Alizadeh Vahid
Qichen Fu
Saurabh N. Adya
C. C. D. Mundo
Mohammad Rastegari
Devang Naik
Peter Zatloukal
MQ
21
6
0
02 Sep 2023
Maestro: Uncovering Low-Rank Structures via Trainable Decomposition
Samuel Horváth
Stefanos Laskaridis
Shashank Rajput
Hongyi Wang
BDL
32
4
0
28 Aug 2023
Quantized Feature Distillation for Network Quantization
Kevin Zhu
Yin He
Jianxin Wu
MQ
29
9
0
20 Jul 2023
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
Vasileios Leon
Muhammad Abdullah Hanif
Giorgos Armeniakos
Xun Jiao
Muhammad Shafique
K. Pekmestzi
Dimitrios Soudris
29
3
0
20 Jul 2023
Injecting Logical Constraints into Neural Networks via Straight-Through Estimators
Zhun Yang
Joohyung Lee
Chi-youn Park
22
18
0
10 Jul 2023
Learning Discrete Weights and Activations Using the Local Reparameterization Trick
G. Berger
Aviv Navon
Ethan Fetaya
MQ
22
0
0
04 Jul 2023
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
Jun Chen
Shipeng Bai
Tianxin Huang
Mengmeng Wang
Guanzhong Tian
Y. Liu
MQ
34
18
0
02 Jul 2023
PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration
Ahmed F. AbouElhamayed
Angela Cui
Javier Fernandez-Marques
Nicholas D. Lane
Mohamed S. Abdelfattah
MQ
23
4
0
25 May 2023
Evaluation Metrics for DNNs Compression
Abanoub Ghobrial
S. Budgett
Dieter Balemans
Hamid Asgari
Philippe Reiter
Kerstin Eder
27
1
0
18 May 2023
Photonic Advantage of Optical Encoders
Luocheng Huang
Quentin A. A. Tanguy
Johannes E. Froch
Saswata Mukherjee
K. Böhringer
A. Majumdar
29
21
0
02 May 2023
CORSD: Class-Oriented Relational Self Distillation
Muzhou Yu
S. Tan
Kailu Wu
Runpei Dong
Linfeng Zhang
Kaisheng Ma
24
0
0
28 Apr 2023
Big-Little Adaptive Neural Networks on Low-Power Near-Subthreshold Processors
Zichao Shen
Neil Howard
J. Núñez-Yáñez
18
2
0
19 Apr 2023
A priori compression of convolutional neural networks for wave simulators
Hamza Boukraichi
N. Akkari
F. Casenave
David Ryckelynck
18
2
0
11 Apr 2023
Training Neural Networks for Execution on Approximate Hardware
Tianmu Li
Shurui Li
Puneet Gupta
27
1
0
08 Apr 2023
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
27
0
0
07 Apr 2023
1
2
3
4
...
10
11
12
Next