Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.10159
Cited By
Automatic heterogeneous quantization of deep neural networks for low-latency inference on the edge for particle detectors
15 June 2020
C. Coelho
Aki Kuusela
Shane Li
Zhuang Hao
T. Aarrestad
Vladimir Loncar
J. Ngadiuba
M. Pierini
Adrian Alan Pol
S. Summers
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Automatic heterogeneous quantization of deep neural networks for low-latency inference on the edge for particle detectors"
50 / 53 papers shown
Title
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
Sanghwan Bae
Jiwoo Hong
Min Young Lee
Hanbyul Kim
Jeongyeon Nam
Donghyun Kwak
OffRL
LRM
48
0
0
04 Apr 2025
Real-time Anomaly Detection at the L1 Trigger of CMS Experiment
Abhijith Gandrakota
62
0
0
29 Nov 2024
Low Latency Transformer Inference on FPGAs for Physics Applications with hls4ml
Zhixing Jiang
Dennis Yin
Yihui Chen
Elham E Khoda
Scott Hauck
Shih-Chieh Hsu
E. Govorkova
Philip C. Harris
Vladimir Loncar
Eric A. Moreno
AI4CE
25
1
0
08 Sep 2024
Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence
Shubha R. Kharel
Prashansa Mukim
Piotr Maj
Grzegorz W. Deptuch
Shinjae Yoo
Yihui Ren
Soumyajit Mandal
36
0
0
18 Jul 2024
Reliable edge machine learning hardware for scientific applications
Tommaso Baldi
Javier Campos
B. Hawks
J. Ngadiuba
Nhan Tran
...
Michael W. Mahoney
Vladimir Loncar
Philip C. Harris
Joshua C. Agar
Shuyu Qin
35
0
0
27 Jun 2024
PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs
Binglei Lou
Richard Rademacher
David Boland
Philip H. W. Leong
31
4
0
07 Jun 2024
Investigating Resource-efficient Neutron/Gamma Classification ML Models Targeting eFPGAs
Jyothisraj Johnson
B. Boxer
Tarun Prakash
Carl Grace
Peter Sorensen
Mani Tripathi
25
1
0
19 Apr 2024
PikeLPN: Mitigating Overlooked Inefficiencies of Low-Precision Neural Networks
Marina Neseem
Conor McCullough
Randy Hsin
Chas Leichner
Shan Li
...
Andrew G. Howard
Lukasz Lew
Sherief Reda
Ville Rautio
Daniele Moro
MQ
42
0
0
29 Mar 2024
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions
Marta Andronic
G. Constantinides
19
5
0
29 Feb 2024
Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale
Dan Zhao
S. Samsi
Joseph McDonald
Baolin Li
David Bestor
Michael Jones
Devesh Tiwari
V. Gadepally
32
17
0
25 Feb 2024
A Plug-in Tiny AI Module for Intelligent and Selective Sensor Data Transmission
Wenjun Huang
Arghavan Rezvani
Hanning Chen
Yang Ni
Sanggeon Yun
Sungheon Jeong
Mohsen Imani
11
7
0
03 Feb 2024
Ultrafast jet classification on FPGAs for the HL-LHC
Patrick Odagiu
Zhiqiang Que
Javier Mauricio Duarte
J. Haller
Gregor Kasieczka
...
Arpita Seksaria
S. Summers
A. Sznajder
A. Tapper
Thea Klæboe Årrestad
24
3
0
02 Feb 2024
Exploring Prime Number Classification: Achieving High Recall Rate and Rapid Convergence with Sparse Encoding
Serin Lee
S. Kim
11
0
0
30 Jan 2024
Scaling Up Quantization-Aware Neural Architecture Search for Efficient Deep Learning on the Edge
Yao Lu
Hiram Rayo Torres Rodriguez
Sebastian Vogel
Nick Van De Waterlaat
P. Jancura
MQ
17
1
0
22 Jan 2024
SymbolNet: Neural Symbolic Regression with Adaptive Dynamic Pruning for Compression
Ho Fung Tsoi
Vladimir Loncar
S. Dasu
Philip C. Harris
29
3
0
18 Jan 2024
Neural Architecture Codesign for Fast Bragg Peak Analysis
Luke McDermott
Jason Weitz
Dmitri Demler
Daniel Cummings
N. Tran
Javier Mauricio Duarte
MQ
17
0
0
10 Dec 2023
LinguaLinked: A Distributed Large Language Model Inference System for Mobile Devices
Junchen Zhao
Yurun Song
Simeng Liu
Ian G. Harris
S. Jyothi
16
5
0
01 Dec 2023
Automated Heterogeneous Low-Bit Quantization of Multi-Model Deep Learning Inference Pipeline
Jayeeta Mondal
Swarnava Dey
Arijit Mukherjee
MQ
13
1
0
10 Nov 2023
PolyLUT: Learning Piecewise Polynomials for Ultra-Low Latency FPGA LUT-based Inference
Marta Andronic
G. Constantinides
20
17
0
05 Sep 2023
FPGA Resource-aware Structured Pruning for Real-Time Neural Networks
Benjamin Ramhorst
Vladimir Loncar
G. Constantinides
17
4
0
09 Aug 2023
MetaML: Automating Customizable Cross-Stage Design-Flow for Deep Learning Acceleration
Zhiqiang Que
Shuo Liu
Markus Rognlien
Ce Guo
J. G. Coutinho
Wayne Luk
10
4
0
14 Jun 2023
Differentiable Earth Mover's Distance for Data Compression at the High-Luminosity LHC
Rohan Shenoy
Javier Mauricio Duarte
C. Herwig
J. Hirschauer
D. Noonan
M. Pierini
Nhan Tran
C. Suarez
23
1
0
07 Jun 2023
Symbolic Regression on FPGAs for Fast Machine Learning Inference
Ho Fung Tsoi
Adrian Alan Pol
Vladimir Loncar
E. Govorkova
M. Cranmer
S. Dasu
P. Elmer
Philip C. Harris
I. Ojalvo
M. Pierini
11
7
0
06 May 2023
Improving Robustness Against Adversarial Attacks with Deeply Quantized Neural Networks
Ferheen Ayaz
Idris Zakariyya
José Cano
S. Keoh
Jeremy Singer
D. Pau
Mounia Kharbouche-Harrari
19
5
0
25 Apr 2023
Within-Camera Multilayer Perceptron DVS Denoising
A. Rios-Navarro
Shi Guo
G. Abarajithan
K. Vijayakumar
A. Linares-Barranco
T. Aarrestad
Ryan Kastner
T. Delbruck
21
8
0
15 Apr 2023
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs
Javier Campos
Zhen Dong
Javier Mauricio Duarte
A. Gholami
Michael W. Mahoney
Jovan Mitrevski
Nhan Tran
MQ
24
3
0
13 Apr 2023
Towards Optimal Compression: Joint Pruning and Quantization
Ben Zandonati
Glenn Bucagu
Adrian Alan Pol
M. Pierini
Olya Sirkin
Tal Kopetz
MQ
17
2
0
15 Feb 2023
BOMP-NAS: Bayesian Optimization Mixed Precision NAS
David van Son
F. D. Putter
Sebastian Vogel
Henk Corporaal
MQ
17
3
0
27 Jan 2023
PELICAN: Permutation Equivariant and Lorentz Invariant or Covariant Aggregator Network for Particle Physics
A. Bogatskiy
Timothy Hoffman
David W. Miller
Jan T. Offermann
16
30
0
01 Nov 2022
FIT: A Metric for Model Sensitivity
Ben Zandonati
Adrian Alan Pol
M. Pierini
Olya Sirkin
Tal Kopetz
MQ
16
8
0
16 Oct 2022
A Closer Look at Hardware-Friendly Weight Quantization
Sungmin Bae
Piotr Zielinski
S. Chatterjee
MQ
16
0
0
07 Oct 2022
LL-GNN: Low Latency Graph Neural Networks on FPGAs for High Energy Physics
Zhiqiang Que
Hongxiang Fan
Marcus Loo
He Li
Michaela Blott
M. Pierini
A. Tapper
Wayne Luk
GNN
29
12
0
28 Sep 2022
Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to Implementation
Pedro J. Freire
A. Napoli
D. A. Ron
B. Spinnler
M. Anderson
W. Schairer
T. Bex
N. Costa
S. Turitsyn
Jaroslaw E. Prilepsky
25
28
0
26 Aug 2022
Neural network accelerator for quantum control
David Xu
A. B. Özgüler
G. D. Guglielmo
Nhan Tran
G. Perdue
L. Carloni
F. Fahim
21
7
0
04 Aug 2022
AI Augmented Edge and Fog Computing: Trends and Challenges
Shreshth Tuli
Fatemeh Mirhakimi
Samodha Pallewatta
Syed Zawad
G. Casale
B. Javadi
Feng Yan
Rajkumar Buyya
N. Jennings
19
56
0
01 Aug 2022
FastML Science Benchmarks: Accelerating Real-Time Scientific Edge Machine Learning
Javier Mauricio Duarte
Nhan Tran
B. Hawks
C. Herwig
J. Muhizi
Shvetank Prakash
Vijay Janapa Reddi
25
11
0
16 Jul 2022
QONNX: Representing Arbitrary-Precision Quantized Neural Networks
Alessandro Pappalardo
Yaman Umuroglu
Michaela Blott
Jovan Mitrevski
B. Hawks
...
J. Muhizi
Matthew Trahms
Shih-Chieh Hsu
Scott Hauck
Javier Mauricio Duarte
MQ
11
18
0
15 Jun 2022
Memory-Oriented Design-Space Exploration of Edge-AI Hardware for XR Applications
V. Parmar
Syed Shakib Sarwar
Ziyun Li
H. Lee
B. D. Salvo
Manan Suri
22
1
0
08 Jun 2022
FELARE: Fair Scheduling of Machine Learning Tasks on Heterogeneous Edge Systems
Ali Mokhtari
Md. Abir Hossen
Pooyan Jamshidi
M. Salehi
24
9
0
31 May 2022
Machine Learning for Microcontroller-Class Hardware: A Review
Swapnil Sayan Saha
S. Sandha
Mani B. Srivastava
19
117
0
29 May 2022
Real-time semantic segmentation on FPGAs for autonomous vehicles with hls4ml
Nicolò Ghielmetti
Vladimir Loncar
M. Pierini
Marcel Roed
S. Summers
...
Christoffer Petersson
H. Linander
J. Ngadiuba
Kelvin Lin
Philip C. Harris
SSeg
30
20
0
16 May 2022
DropTrack -- automatic droplet tracking using deep learning for microfluidic applications
M. Durve
A. Tiribocchi
F. Bonaccorso
A. Montessori
M. Lauricella
Michał Bogdan
J. Guzowski
S. Succi
VOT
17
17
0
05 May 2022
Physics Community Needs, Tools, and Resources for Machine Learning
Philip C. Harris
E. Katsavounidis
W. McCormack
D. Rankin
Yongbin Feng
...
De-huai Chen
Mark S. Neubauer
Javier Mauricio Duarte
G. Karagiorgi
Miaoyuan Liu
AI4CE
9
3
0
30 Mar 2022
Graph Neural Networks in Particle Physics: Implementations, Innovations, and Challenges
S. Thais
P. Calafiura
G. Chachamis
G. Dezoort
Javier Mauricio Duarte
S. Ganguly
Michael Kagan
D. Murnane
Mark S. Neubauer
K. Terao
PINN
AI4CE
20
30
0
23 Mar 2022
Energy-Efficient Respiratory Anomaly Detection in Premature Newborn Infants
A. Paul
Md. Abu Saleh Tajin
Anup Das
W. Mongan
K. Dandekar
27
11
0
21 Feb 2022
Graph Neural Networks for Charged Particle Tracking on FPGAs
Abdelrahman Elabd
Vesal Razavimaleki
Shih-Yu Huang
Javier Mauricio Duarte
M. Atkinson
...
Bo-Cheng Lai
Mark S. Neubauer
I. Ojalvo
S. Thais
Matthew Trahms
GNN
30
35
0
03 Dec 2021
Online-compatible Unsupervised Non-resonant Anomaly Detection
Vinicius Mikuni
Benjamin Nachman
David Shih
14
35
0
11 Nov 2021
Applications and Techniques for Fast Machine Learning in Science
A. Deiana
Nhan Tran
Joshua C. Agar
Michaela Blott
G. D. Guglielmo
...
Ashish Sharma
S. Summers
Pietro Vischia
J. Vlimant
Olivia Weng
6
71
0
25 Oct 2021
Nanosecond machine learning event classification with boosted decision trees in FPGA for high energy physics
Tae Min Hong
B. Carlson
Brandon Eubanks
Stephen Racz
Stephen Roche
J. Stelzer
Daniel C. Stumpp
10
23
0
07 Apr 2021
Charged particle tracking via edge-classifying interaction networks
G. Dezoort
S. Thais
Javier Mauricio Duarte
Vesal Razavimaleki
M. Atkinson
I. Ojalvo
Mark S. Neubauer
P. Elmer
19
46
0
30 Mar 2021
1
2
Next