Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.00864
Cited By
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
2 July 2020
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights"
14 / 14 papers shown
Title
Combining Neural Architecture Search and Automatic Code Optimization: A Survey
Inas Bachiri
Hadjer Benmeziane
Smail Niar
Riyadh Baghdadi
Hamza Ouarnoughi
Abdelkrime Aries
40
0
0
07 Aug 2024
STen: Productive and Efficient Sparsity in PyTorch
Andrei Ivanov
Nikoli Dryden
Tal Ben-Nun
Saleh Ashkboos
Torsten Hoefler
30
4
0
15 Apr 2023
Training Large Language Models Efficiently with Sparsity and Dataflow
V. Srinivasan
Darshan Gandhi
Urmish Thakker
R. Prabhakar
MoE
28
6
0
11 Apr 2023
Spatial Mixture-of-Experts
Nikoli Dryden
Torsten Hoefler
MoE
24
9
0
24 Nov 2022
Photonic Reconfigurable Accelerators for Efficient Inference of CNNs with Mixed-Sized Tensors
Sairam Sri Vatsavai
Ishan G. Thakkar
19
9
0
12 Jul 2022
Machine Learning for Microcontroller-Class Hardware: A Review
Swapnil Sayan Saha
S. Sandha
Mani B. Srivastava
19
117
0
29 May 2022
Special Session: Towards an Agile Design Methodology for Efficient, Reliable, and Secure ML Systems
Shail Dave
Alberto Marchisio
Muhammad Abdullah Hanif
Amira Guesmi
Aviral Shrivastava
Ihsen Alouani
Muhammad Shafique
23
13
0
18 Apr 2022
Computing Graph Neural Networks: A Survey from Algorithms to Accelerators
S. Abadal
Akshay Jain
Robert Guirado
Jorge López-Alonso
Eduard Alarcón
GNN
25
225
0
30 Sep 2020
Benchmarking TinyML Systems: Challenges and Direction
Colby R. Banbury
Vijay Janapa Reddi
Max Lam
William Fu
A. Fazel
...
Jae-sun Seo
Jeff Sieracki
Urmish Thakker
Marian Verhelst
Poonam Yadav
98
228
0
10 Mar 2020
A Survey on Deep Learning in Medical Image Analysis
G. Litjens
Thijs Kooi
B. Bejnordi
A. Setio
F. Ciompi
Mohsen Ghafoorian
Jeroen van der Laak
Bram van Ginneken
C. I. Sánchez
OOD
280
10,613
0
19 Feb 2017
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
294
10,216
0
16 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks
Martin Engelcke
Dushyant Rao
Dominic Zeng Wang
Chi Hay Tong
Ingmar Posner
3DPC
192
521
0
21 Sep 2016
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Vijay Badrinarayanan
Alex Kendall
R. Cipolla
SSeg
440
15,637
0
02 Nov 2015
1