Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.08679
Cited By
Neural Network Compression Framework for fast model inference
20 February 2020
Alexander Kozlov
Ivan Lazarevich
Vasily Shamporov
N. Lyalyushkin
Yury Gorbachev
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Network Compression Framework for fast model inference"
7 / 7 papers shown
Title
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Simla Burcu Harma
Ayan Chakraborty
Elizaveta Kostenok
Danila Mishin
Dongho Ha
...
Martin Jaggi
Ming Liu
Yunho Oh
Suvinay Subramanian
Amir Yazdanbakhsh
MQ
44
5
0
31 May 2024
QFT: Post-training quantization via fast joint finetuning of all degrees of freedom
Alexander Finkelstein
Ella Fuchs
Idan Tal
Mark Grobman
Niv Vosco
Eldad Meller
MQ
24
6
0
05 Dec 2022
CheckINN: Wide Range Neural Network Verification in Imandra (Extended)
Remi Desmartin
Grant Passmore
Ekaterina Komendantskaya
M. Daggitt
26
5
0
21 Jul 2022
Anomalib: A Deep Learning Library for Anomaly Detection
S. Akçay
Dick Ameln
Ashwin Vaidya
B. Lakshmanan
Nilesh A. Ahuja
Ergin Utku Genc
30
108
0
16 Feb 2022
Enabling NAS with Automated Super-Network Generation
J. P. Muñoz
N. Lyalyushkin
Yash Akhauri
A. Senina
Alexander Kozlov
Nilesh Jain
22
17
0
20 Dec 2021
Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices
Md Mohaimenuzzaman
Christoph Bergmeir
I. West
B. Meyer
12
41
0
05 Mar 2021
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
1