Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.01396
Cited By
v1
v2 (latest)
DeepReDuce: ReLU Reduction for Fast Private Inference
International Conference on Machine Learning (ICML), 2021
2 March 2021
N. Jha
Zahra Ghodsi
S. Garg
Brandon Reagen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DeepReDuce: ReLU Reduction for Fast Private Inference"
50 / 54 papers shown
Title
CrypTorch: PyTorch-based Auto-tuning Compiler for Machine Learning with Multi-party Computation
Jinyu Liu
Gang Tan
Kiwan Maeng
68
0
0
24 Nov 2025
Coordinate Descent for Network Linearization
Vlad Rakhlin
Amir Jevnisek
S. Avidan
80
0
0
14 Nov 2025
FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks
Zhaoxuan Kan
Husheng Han
Shangyi Shi
Tenghui Hua
Hang Lu
Xiaowei Li
Jianan Mu
Xing Hu
GNN
602
0
0
12 Jun 2025
Flash: A Hybrid Private Inference Protocol for Deep CNNs with High Accuracy and Low Latency on CPU
H. Roh
Jinsu Yeo
Yeongil Ko
Gu-Yeon Wei
David Brooks
Woo-Seok Choi
371
2
0
20 Jan 2025
ReLU's Revival: On the Entropic Overload in Normalization-Free Large Language Models
N. Jha
Brandon Reagen
OffRL
AI4CE
363
3
0
12 Oct 2024
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
International Conference on Computer Aided Design (ICCAD), 2024
Tianshi Xu
Shuzhang Zhong
Wenxuan Zeng
Runsheng Wang
Meng Li
MQ
183
3
0
12 Oct 2024
DCT-CryptoNets: Scaling Private Inference in the Frequency Domain
International Conference on Learning Representations (ICLR), 2024
Arjun Roy
Kaushik Roy
954
3
0
27 Aug 2024
MPC-Minimized Secure LLM Inference
Deevashwer Rathee
Dacheng Li
Ion Stoica
Hao Zhang
Raluca A. Popa
264
8
0
07 Aug 2024
The Simpler The Better: An Entropy-Based Importance Metric To Reduce Neural Networks' Depth
Victor Quétu
Zhu Liao
Enzo Tartaglione
328
4
0
27 Apr 2024
EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based Protocol and Quantization Co-Optimization
Wenxuan Zeng
Tianshi Xu
Meng Li
Runsheng Wang
MQ
215
0
0
15 Apr 2024
Accurate Low-Degree Polynomial Approximation of Non-polynomial Operators for Fast Private Inference in Homomorphic Encryption
Conference on Machine Learning and Systems (MLSys), 2024
Jianming Tong
Jing Dang
Anupam Golder
Callie Hao
A. Raychowdhury
Tushar Krishna
240
8
0
04 Apr 2024
xMLP: Revolutionizing Private Inference with Exclusive Square Activation
Jiajie Li
Jinjun Xiong
183
1
0
12 Mar 2024
Privacy-Preserving Diffusion Model Using Homomorphic Encryption
Yaojian Chen
Qiben Yan
209
9
0
09 Mar 2024
Neural Networks with (Low-Precision) Polynomial Approximations: New Insights and Techniques for Accuracy Improvement
Chi Zhang
Jingjing Fan
Man Ho Au
Siu-Ming Yiu
206
1
0
17 Feb 2024
Linearizing Models for Efficient yet Robust Private Inference
Sreetama Sarkar
Souvik Kundu
Peter A. Beerel
AAML
139
0
0
08 Feb 2024
Disparate Impact on Group Accuracy of Linearization for Private Inference
International Conference on Machine Learning (ICML), 2024
Saswat Das
Marco Romanelli
Ferdinando Fioretto
FedML
207
4
0
06 Feb 2024
HEQuant: Marrying Homomorphic Encryption and Quantization for Communication-Efficient Private Inference
Tianshi Xu
Meng Li
Runsheng Wang
230
2
0
29 Jan 2024
Regularized PolyKervNets: Optimizing Expressiveness and Efficiency for Private Inference in Deep Neural Networks
Toluwani Aremu
163
0
0
23 Dec 2023
LayerCollapse: Adaptive compression of neural networks
Soheil Zibakhsh Shabgahi
Mohammad Soheil Shariff
F. Koushanfar
AI4CE
202
1
0
29 Nov 2023
CompactTag: Minimizing Computation Overheads in Actively-Secure MPC for Deep Neural Networks
Yongqin Wang
Pratik Sarkar
Nishat Koti
A. Patra
Murali Annavaram
229
2
0
08 Nov 2023
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference
Neural Information Processing Systems (NeurIPS), 2023
Wenxuan Zeng
Meng Li
Haichuan Yang
Wen-jie Lu
Runsheng Wang
Ru Huang
189
13
0
03 Nov 2023
Optimized Layerwise Approximation for Efficient Private Inference on Fully Homomorphic Encryption
Junghyun Lee
Eunsang Lee
Young-Sik Kim
Yongwoo Lee
Joon-Woo Lee
Yongjune Kim
Jong-Seon No
251
3
0
16 Oct 2023
AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE
IACR Cryptology ePrint Archive (IACR ePrint), 2023
Wei Ao
Vishnu Boddeti
AAML
171
33
0
12 Oct 2023
PriViT: Vision Transformers for Fast Private Inference
Naren Dhyani
Jianqiao Mo
Minsu Cho
Ameya Joshi
Siddharth Garg
Brandon Reagen
Chinmay Hegde
148
8
0
06 Oct 2023
LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference
Neural Information Processing Systems (NeurIPS), 2023
Hongwu Peng
Ran Ran
Yukui Luo
Jiahui Zhao
Shaoyi Huang
...
Tong Geng
Chenghong Wang
Xiaolin Xu
Wujie Wen
Caiwen Ding
318
45
0
25 Sep 2023
Approximating ReLU on a Reduced Ring for Efficient MPC-based Private Inference
Kiwan Maeng
G. E. Suh
181
4
0
09 Sep 2023
AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
IEEE International Conference on Computer Vision (ICCV), 2023
Hongwu Peng
Shaoyi Huang
Tong Zhou
Yukui Luo
Chenghong Wang
...
Tony Geng
Kaleel Mahmood
Wujie Wen
Xiaolin Xu
Caiwen Ding
OffRL
277
43
0
20 Aug 2023
Privacy Preserving In-memory Computing Engine
Haoran Geng
Jianqiao Mo
D. Reis
Jonathan Takeshita
Taeho Jung
Brandon Reagen
Michael Niemier
Xiyang Hu
235
1
0
04 Aug 2023
Towards Fast and Scalable Private Inference
ACM International Conference on Computing Frontiers (CF), 2023
Jianqiao Mo
Karthik Garimella
Negar Neda
Austin Ebel
Brandon Reagen
150
5
0
09 Jul 2023
PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment
Design Automation Conference (DAC), 2023
Hongwu Peng
Shangli Zhou
Yukui Luo
Nuo Xu
Shijin Duan
...
Chenghong Wang
Tong Geng
Wujie Wen
Xiaolin Xu
Caiwen Ding
189
9
0
27 Jun 2023
NetBooster: Empowering Tiny Deep Learning By Standing on the Shoulders of Deep Giants
Design Automation Conference (DAC), 2023
Zhongzhi Yu
Y. Fu
Jiayi Yuan
Haoran You
Yingyan Lin
189
2
0
23 Jun 2023
Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions
USENIX Security Symposium (USENIX Security), 2023
Abdulrahman Diaa
L. Fenaux
Thomas Humphries
Marian Dietz
Faezeh Ebrahimianghazani
...
Nils Lukas
Rasoul Akhavan Mahdavi
Simon Oya
Ehsan Amjadian
Florian Kerschbaum
PICV
214
11
0
14 Jun 2023
Training Large Scale Polynomial CNNs for E2E Inference over Homomorphic Encryption
Moran Baruch
Nir Drucker
Gilad Ezov
Yoav Goldberg
Eyal Kushnir
Jenny Lerner
Omri Soceanu
Itamar Zimerman
275
7
0
26 Apr 2023
Making Models Shallow Again: Jointly Learning to Reduce Non-Linearity and Depth for Latency-Efficient Private Inference
Souvik Kundu
Yuke Zhang
Dake Chen
Peter A. Beerel
3DV
156
16
0
26 Apr 2023
DeepReShape: Redesigning Neural Networks for Efficient Private Inference
N. Jha
Brandon Reagen
346
15
0
20 Apr 2023
Securing Neural Networks with Knapsack Optimization
Yakir Gorski
Amir Jevnisek
S. Avidan
AAML
116
1
0
20 Apr 2023
RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference
Hongwu Peng
Shangli Zhou
Yukui Luo
Nuo Xu
Shijin Duan
...
Chenghong Wang
Tong Geng
Wujie Wen
Xiaolin Xu
Caiwen Ding
208
18
0
05 Feb 2023
Learning to Linearize Deep Neural Networks for Secure and Efficient Private Inference
International Conference on Learning Representations (ICLR), 2023
Souvik Kundu
Shun Lu
Yuke Zhang
Jacqueline Liu
Peter A. Beerel
146
36
0
23 Jan 2023
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention
IEEE International Conference on Computer Vision (ICCV), 2022
Wenyuan Zeng
Meng Li
Wenjie Xiong
Tong Tong
Wen-jie Lu
Jin Tan
Runsheng Wang
Ru Huang
313
33
0
25 Nov 2022
Private and Reliable Neural Network Inference
Conference on Computer and Communications Security (CCS), 2022
Nikola Jovanović
Marc Fischer
Samuel Steffen
Martin Vechev
185
18
0
27 Oct 2022
Scaling up Trustless DNN Inference with Zero-Knowledge Proofs
Daniel Kang
Tatsunori Hashimoto
Ion Stoica
Yi Sun
LRM
157
59
0
17 Oct 2022
MPC-Pipe: an Efficient Pipeline Scheme for Secure Multi-party Machine Learning Inference
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
Yongqin Wang
Rachit Rajat
Murali Annavaram
165
6
0
27 Sep 2022
CryptoGCN: Fast and Scalable Homomorphically Encrypted Graph Convolutional Network Inference
Neural Information Processing Systems (NeurIPS), 2022
Ran Ran
Nuo Xu
Wei Wang
Quan Gang
Jieming Yin
Wujie Wen
GNN
215
33
0
24 Sep 2022
PolyMPCNet: Towards ReLU-free Neural Architecture Search in Two-party Computation Based Private Inference
Hongwu Peng
Shangli Zhou
Yukui Luo
Shijin Duan
Nuo Xu
...
Tong Geng
Ang Li
Wujie Wen
Xiaolin Xu
Caiwen Ding
153
4
0
20 Sep 2022
Efficient ML Models for Practical Secure Inference
Vinod Ganesan
Anwesh Bhattacharya
Pratyush Kumar
Divya Gupta
Rahul Sharma
Nishanth Chandran
MedIm
269
5
0
26 Aug 2022
Characterizing and Optimizing End-to-End Systems for Private Inference
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
Karthik Garimella
Zahra Ghodsi
N. Jha
S. Garg
Brandon Reagen
200
29
0
14 Jul 2022
Tabula: Efficiently Computing Nonlinear Activation Functions for Secure Neural Network Inference
Maximilian Lam
Michael Mitzenmacher
Vijay Janapa Reddi
Gu-Yeon Wei
David Brooks
197
4
0
05 Mar 2022
Selective Network Linearization for Efficient Private Inference
International Conference on Machine Learning (ICML), 2022
Minsu Cho
Ameya Joshi
S. Garg
Brandon Reagen
Chinmay Hegde
221
50
0
04 Feb 2022
AESPA: Accuracy Preserving Low-degree Polynomial Activation for Fast Private Inference
J. Park
M. Kim
Wonkyung Jung
Jung Ho Ahn
LLMSV
185
41
0
18 Jan 2022
CryptoNite: Revealing the Pitfalls of End-to-End Private Inference at Scale
Karthik Garimella
N. Jha
Zahra Ghodsi
S. Garg
Brandon Reagen
239
4
0
04 Nov 2021
1
2
Next