FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge

9 April 2019

Jinjun Xiong

Papers citing "FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge"

50 / 68 papers shown

Moss: Proxy Model-based Full-Weight Aggregation in Federated Learning with Heterogeneous ModelsProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2025

472

13 Mar 2025

Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and CompressionComputer Vision and Pattern Recognition (CVPR), 2025

335

23 Feb 2025

Reducing Inference Energy Consumption Using Dual Complementary CNNsFuture generations computer systems (FGCS), 2024

217

02 Dec 2024

FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios

Ming Hu

Anran Li

191

17 Jul 2024

New Solutions on LLM Acceleration, Optimization, and Application

Deming Chen

288

16 Jun 2024

CiMNet: Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware

228

19 Feb 2024

Enabling Resource-efficient AIoT System with Cross-level Optimization: A surveyIEEE Communications Surveys and Tutorials (COMST), 2023

300

27 Sep 2023

Computation-efficient Deep Learning for Computer Vision: A Survey

Yulin Wang

Gao Huang

313

27 Aug 2023

CARMA: Context-Aware Runtime Reconfiguration for Energy-Efficient Sensor FusionInternational Symposium on Low Power Electronics and Design (ISLPED), 2023

Mohammad Abdullah Al Faruque

Sitao Huang

200

27 Jun 2023

MetaML: Automating Customizable Cross-Stage Design-Flow for Deep Learning AccelerationInternational Conference on Field-Programmable Logic and Applications (FPL), 2023

Ce Guo

152

14 Jun 2023

RAMAN: A Re-configurable and Sparse tinyML Accelerator for Inference on EdgeIEEE Internet of Things Journal (IEEE IoT J.), 2023

Adithya Krishna

Srikanth Rohit Nudurupati

Chandana D G

Pritesh Dwivedi

André van Schaik

M. Mehendale

Chetan Singh Thakur

116

10 Jun 2023

A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU

135

16 Mar 2023

TinyAD: Memory-efficient anomaly detection for time series data in Industrial IoTIEEE Transactions on Industrial Informatics (IEEE TII), 2023

Yuting Sun

Tong Chen

Quoc Viet Hung Nguyen

Hongzhi Yin

210

07 Mar 2023

Tiny Classifier Circuits: Evolving Accelerators for Tabular Data

Konstantinos Iordanou

197

28 Feb 2023

Enabling Hard Constraints in Differentiable Neural Network and Accelerator Co-ExplorationDesign Automation Conference (DAC), 2022

142

23 Jan 2023

Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration

Masood S. Mortazavi

Tiancheng Qin

Ning Yan

285

03 Nov 2022

Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge DevicesAsia and South Pacific Design Automation Conference (ASP-DAC), 2022

267

16 Oct 2022

PolyMPCNet: Towards ReLU-free Neural Architecture Search in Two-party Computation Based Private Inference

Hongwu Peng

...

Caiwen Ding

177

20 Sep 2022

HiKonv: Maximizing the Throughput of Quantized Convolution With Novel Bit-wise Management and Computation

Jinjun Xiong

108

22 Jul 2022

EASNet: Searching Elastic and Accurate Network Architecture for Stereo MatchingEuropean Conference on Computer Vision (ECCV), 2022

158

20 Jul 2022

Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space Exploration Tool for FPGA High-Level SynthesisIdeal (IDEAL), 2022

Mang Yu

Sitao Huang

Deming Chen

133

03 Jul 2022

Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

Indhumathi Kandaswamy

...

156

10 Jun 2022

Compilation and Optimizations for Efficient Machine Learning on Embedded Systems

228

06 Jun 2022

The Larger The Fairer? Small Neural Networks Can Achieve Fairness for Edge DevicesDesign Automation Conference (DAC), 2022

222

23 Feb 2022

EF-Train: Enable Efficient On-device CNN Training on FPGA Through Data Reshaping for Online Adaptation or Personalization

147

18 Feb 2022

HiKonv: High Throughput Quantized Convolution With Novel Bit-wise Management and ComputationAsia and South Pacific Design Automation Conference (ASP-DAC), 2021

Jinjun Xiong

116

28 Dec 2021

Algorithm and Hardware Co-design for Reconfigurable CNN Accelerator

169

24 Nov 2021

EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search

156

24 Nov 2021

RT-RCG: Neural Network and Accelerator Search Towards Effective and Real-time ECG Reconstruction from Intracardiac ElectrogramsACM Journal on Emerging Technologies in Computing Systems (JETC), 2021

158

04 Nov 2021

Machine Learning for the Control and Monitoring of Electric Machine Drives: Advances and TrendsIEEE Open Journal of Industry Applications (JOIA), 2021

Shen Zhang

Oliver Wallscheid

Mario Porrmann

161

11 Oct 2021

SECDA: Efficient Hardware/Software Co-Design of FPGA-based DNN Accelerators for Edge Inference

Jude Haris

Perry Gibson

José Cano

Nicolas Bohm Agostini

David Kaeli

193

01 Oct 2021

G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency

181

18 Sep 2021

A High Throughput Parallel Hash Table on FPGA using XOR-based MemoryIEEE Conference on High Performance Extreme Computing (HPEC), 2020

230

07 Aug 2021

Generic Neural Architecture Search via RegressionNeural Information Processing Systems (NeurIPS), 2021

Jinjun Xiong

209

04 Aug 2021

WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAsIEEE International Conference on Application-Specific Systems, Architectures, and Processors (ASAP), 2021

119

09 Jul 2021

How to Reach Real-Time AI on Consumer Devices? Solutions for Programmable and Custom ArchitecturesIEEE International Conference on Application-Specific Systems, Architectures, and Processors (ASAP), 2021

Stylianos I. Venieris

Ioannis Panopoulos

Ilias Leontiadis

I. Venieris

249

21 Jun 2021

A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement LearningDesign Automation Conference (DAC), 2021

170

11 Jun 2021

NAAS: Neural Accelerator Architecture SearchDesign Automation Conference (DAC), 2021

Chengyue Wu

Mengtian Yang

Song Han

156

27 May 2021

A Full-Stack Search Technique for Domain Optimized Deep Learning AcceleratorsInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021

235

26 May 2021

3U-EdgeAI: Ultra-Low Memory Training, Ultra-Low BitwidthQuantization, and Ultra-Low Latency AccelerationACM Great Lakes Symposium on VLSI (GLSVLSI), 2021

158

11 May 2021

HAO: Hardware-aware neural Architecture Optimization for Efficient InferenceIEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2021

Zhen Dong

179

26 Apr 2021

Enabling Cross-Domain Communication: How to Bridge the Gap between AI and HW Engineers

08 Apr 2021

Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-designIEEE design & test (DT), 2021

Cong Hao

Jordan Dotzel

Jinjun Xiong

Luca Benini

Zhiru Zhang

Deming Chen

208

25 Mar 2021

HSCoNAS: Hardware-Software Co-Design of Efficient DNNs via Neural Architecture SearchDesign, Automation and Test in Europe (DATE), 2021

134

11 Mar 2021

A Comprehensive Survey on Hardware-Aware Neural Architecture Search

232

131

22 Jan 2021

FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional ActivationsSymposium on Field Programmable Gate Arrays (FPGA), 2020

239

103

22 Dec 2020

When Machine Learning Meets Quantum Computers: A Case StudyAsia and South Pacific Design Automation Conference (ASP-DAC), 2020

Weiwen Jiang

Jinjun Xiong

Yiyu Shi

206

18 Dec 2020

DNA: Differentiable Network-Accelerator Co-Search

325

28 Oct 2020

DANCE: Differentiable Accelerator/Network Co-ExplorationDesign Automation Conference (DAC), 2020

280

14 Sep 2020

ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement LearningMicro (MICRO), 2020

Sheng-Chun Kao

Geonhwa Jeong

T. Krishna

301

108

04 Sep 2020