ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.04421
  4. Cited By
FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence
  on the Edge

FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge

9 April 2019
Cong Hao
Xiaofan Zhang
Yuhong Li
Sitao Huang
Jinjun Xiong
K. Rupnow
Wen-mei W. Hwu
Deming Chen
ArXiv (abs)PDFHTML

Papers citing "FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge"

50 / 68 papers shown
Moss: Proxy Model-based Full-Weight Aggregation in Federated Learning with Heterogeneous ModelsProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2025
Y. Cai
Ziqi Zhang
Ding Li
Yao Guo
Xiangqun Chen
472
0
0
13 Mar 2025
Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression
Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and CompressionComputer Vision and Pattern Recognition (CVPR), 2025
Xiaoyi Qu
David Aponte
Colby R. Banbury
Daniel P. Robinson
Tianyu Ding
K. Koishida
Ilya Zharkov
Tianyi Chen
MQ
335
7
0
23 Feb 2025
Reducing Inference Energy Consumption Using Dual Complementary CNNs
Reducing Inference Energy Consumption Using Dual Complementary CNNsFuture generations computer systems (FGCS), 2024
Michail Kinnas
John Violos
I. Kompatsiaris
Symeon Papadopoulos
217
6
0
02 Dec 2024
FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible
  Pruning in Uncertain Scenarios
FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios
Zekai Chen
Chentao Jia
Ming Hu
Xiaofei Xie
Anran Li
Xiao He
191
10
0
17 Jul 2024
New Solutions on LLM Acceleration, Optimization, and Application
New Solutions on LLM Acceleration, Optimization, and Application
Yingbing Huang
Lily Jiaxin Wan
Hanchen Ye
Manvi Jha
Jinghua Wang
Yuhong Li
Xiaofan Zhang
Deming Chen
288
22
0
16 Jun 2024
CiMNet: Towards Joint Optimization for DNN Architecture and
  Configuration for Compute-In-Memory Hardware
CiMNet: Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware
Souvik Kundu
Anthony Sarah
Vinay Joshi
O. J. Omer
S. Subramoney
228
1
0
19 Feb 2024
Enabling Resource-efficient AIoT System with Cross-level Optimization: A
  survey
Enabling Resource-efficient AIoT System with Cross-level Optimization: A surveyIEEE Communications Surveys and Tutorials (COMST), 2023
Sicong Liu
Bin Guo
Cheng Fang
Ziqi Wang
Shiyan Luo
Zimu Zhou
Zhiwen Yu
AI4CE
300
38
0
27 Sep 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
313
33
0
27 Aug 2023
CARMA: Context-Aware Runtime Reconfiguration for Energy-Efficient Sensor
  Fusion
CARMA: Context-Aware Runtime Reconfiguration for Energy-Efficient Sensor FusionInternational Symposium on Low Power Electronics and Design (ISLPED), 2023
Yifan Zhang
Arnav V. Malawade
Xiaofang Zhang
Yuhui Li
DongHwan Seong
Mohammad Abdullah Al Faruque
Sitao Huang
200
4
0
27 Jun 2023
MetaML: Automating Customizable Cross-Stage Design-Flow for Deep
  Learning Acceleration
MetaML: Automating Customizable Cross-Stage Design-Flow for Deep Learning AccelerationInternational Conference on Field-Programmable Logic and Applications (FPL), 2023
Zhiqiang Que
Shuo Liu
Markus Rognlien
Ce Guo
Jose G. F. Coutinho
Wayne Luk
152
8
0
14 Jun 2023
RAMAN: A Re-configurable and Sparse tinyML Accelerator for Inference on
  Edge
RAMAN: A Re-configurable and Sparse tinyML Accelerator for Inference on EdgeIEEE Internet of Things Journal (IEEE IoT J.), 2023
Adithya Krishna
Srikanth Rohit Nudurupati
Chandana D G
Pritesh Dwivedi
André van Schaik
M. Mehendale
Chetan Singh Thakur
116
23
0
10 Jun 2023
A High-Performance Accelerator for Super-Resolution Processing on
  Embedded GPU
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
W. Zhao
Qi Sun
Yang Bai
Wenbo Li
Haisheng Zheng
Bei Yu
Martin D. F. Wong
SupR
135
12
0
16 Mar 2023
TinyAD: Memory-efficient anomaly detection for time series data in
  Industrial IoT
TinyAD: Memory-efficient anomaly detection for time series data in Industrial IoTIEEE Transactions on Industrial Informatics (IEEE TII), 2023
Yuting Sun
Tong Chen
Quoc Viet Hung Nguyen
Hongzhi Yin
210
22
0
07 Mar 2023
Tiny Classifier Circuits: Evolving Accelerators for Tabular Data
Tiny Classifier Circuits: Evolving Accelerators for Tabular Data
Konstantinos Iordanou
Timothy Atkinson
Emre Ozer
Jedrzej Kufel
J. Biggs
Gavin Brown
M. Luján
197
1
0
28 Feb 2023
Enabling Hard Constraints in Differentiable Neural Network and
  Accelerator Co-Exploration
Enabling Hard Constraints in Differentiable Neural Network and Accelerator Co-ExplorationDesign Automation Conference (DAC), 2022
Deokki Hong
Kanghyun Choi
Hyeyoon Lee
Joonsang Yu
Noseong Park
Youngsok Kim
Jinho Lee
142
4
0
23 Jan 2023
Theta-Resonance: A Single-Step Reinforcement Learning Method for Design
  Space Exploration
Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration
Masood S. Mortazavi
Tiancheng Qin
Ning Yan
285
4
0
03 Nov 2022
Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge
  Devices
Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge DevicesAsia and South Pacific Design Automation Conference (ASP-DAC), 2022
Yimeng Zhang
A. Kamath
Qiucheng Wu
Zhiwen Fan
Wuyang Chen
Zinan Lin
Shiyu Chang
Sijia Liu
Cong Hao
267
6
0
16 Oct 2022
PolyMPCNet: Towards ReLU-free Neural Architecture Search in Two-party Computation Based Private Inference
Hongwu Peng
Shangli Zhou
Yukui Luo
Shijin Duan
Nuo Xu
...
Tong Geng
Ang Li
Wujie Wen
Xiaolin Xu
Caiwen Ding
177
5
0
20 Sep 2022
HiKonv: Maximizing the Throughput of Quantized Convolution With Novel
  Bit-wise Management and Computation
HiKonv: Maximizing the Throughput of Quantized Convolution With Novel Bit-wise Management and Computation
Yao Chen
Junhao Pan
Xinheng Liu
Jinjun Xiong
Deming Chen
MQ
108
0
0
22 Jul 2022
EASNet: Searching Elastic and Accurate Network Architecture for Stereo
  Matching
EASNet: Searching Elastic and Accurate Network Architecture for Stereo MatchingEuropean Conference on Computer Vision (ECCV), 2022
Qiang-qiang Wang
Shaoshuai Shi
Kaiyong Zhao
Xiaowen Chu
158
6
0
20 Jul 2022
Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space
  Exploration Tool for FPGA High-Level Synthesis
Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space Exploration Tool for FPGA High-Level SynthesisIdeal (IDEAL), 2022
Mang Yu
Sitao Huang
Deming Chen
133
12
0
03 Jul 2022
Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware
  Accelerators
Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators
Indhumathi Kandaswamy
Saurabh Farkya
Z. Daniels
G. V. D. Wal
Aswin Raghavan
...
Jun Hu
M. Lomnitz
M. Isnardi
David C. Zhang
M. Piacentino
BDL
156
6
0
10 Jun 2022
Compilation and Optimizations for Efficient Machine Learning on Embedded
  Systems
Compilation and Optimizations for Efficient Machine Learning on Embedded Systems
Xiaofan Zhang
Yao Chen
Cong Hao
Sitao Huang
Yuhong Li
Deming Chen
228
2
0
06 Jun 2022
The Larger The Fairer? Small Neural Networks Can Achieve Fairness for
  Edge Devices
The Larger The Fairer? Small Neural Networks Can Achieve Fairness for Edge DevicesDesign Automation Conference (DAC), 2022
Yi Sheng
Junhuan Yang
Yawen Wu
Kevin Mao
Yiyu Shi
Jingtong Hu
Weiwen Jiang
Lei Yang
222
31
0
23 Feb 2022
EF-Train: Enable Efficient On-device CNN Training on FPGA Through Data
  Reshaping for Online Adaptation or Personalization
EF-Train: Enable Efficient On-device CNN Training on FPGA Through Data Reshaping for Online Adaptation or Personalization
Yue Tang
Xinyi Zhang
Peipei Zhou
Jingtong Hu
147
25
0
18 Feb 2022
HiKonv: High Throughput Quantized Convolution With Novel Bit-wise
  Management and Computation
HiKonv: High Throughput Quantized Convolution With Novel Bit-wise Management and ComputationAsia and South Pacific Design Automation Conference (ASP-DAC), 2021
Xinheng Liu
Yao Chen
Prakhar Ganesh
Junhao Pan
Jinjun Xiong
Deming Chen
MQ
116
17
0
28 Dec 2021
Algorithm and Hardware Co-design for Reconfigurable CNN Accelerator
Algorithm and Hardware Co-design for Reconfigurable CNN Accelerator
Hongxiang Fan
Martin Ferianc
Zhiqiang Que
He Li
Shuanglong Liu
Xinyu Niu
Wayne Luk
3DV
169
13
0
24 Nov 2021
EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture
  Search
EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search
Qian Jiang
Xiaofan Zhang
Deming Chen
Minh Do
Raymond A. Yeh
156
8
0
24 Nov 2021
RT-RCG: Neural Network and Accelerator Search Towards Effective and
  Real-time ECG Reconstruction from Intracardiac Electrograms
RT-RCG: Neural Network and Accelerator Search Towards Effective and Real-time ECG Reconstruction from Intracardiac ElectrogramsACM Journal on Emerging Technologies in Computing Systems (JETC), 2021
Yongan Zhang
Anton Banta
Yonggan Fu
M. John
A. Post
M. Razavi
Joseph R. Cavallaro
B. Aazhang
Yingyan Lin
158
4
0
04 Nov 2021
Machine Learning for the Control and Monitoring of Electric Machine
  Drives: Advances and Trends
Machine Learning for the Control and Monitoring of Electric Machine Drives: Advances and TrendsIEEE Open Journal of Industry Applications (JOIA), 2021
Shen Zhang
Oliver Wallscheid
Mario Porrmann
161
62
0
11 Oct 2021
SECDA: Efficient Hardware/Software Co-Design of FPGA-based DNN
  Accelerators for Edge Inference
SECDA: Efficient Hardware/Software Co-Design of FPGA-based DNN Accelerators for Edge Inference
Jude Haris
Perry Gibson
José Cano
Nicolas Bohm Agostini
David Kaeli
193
24
0
01 Oct 2021
G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and
  Efficiency
G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency
Yongan Zhang
Haoran You
Yonggan Fu
Tong Geng
Ang Li
Yingyan Lin
GNN
181
32
0
18 Sep 2021
A High Throughput Parallel Hash Table on FPGA using XOR-based Memory
A High Throughput Parallel Hash Table on FPGA using XOR-based MemoryIEEE Conference on High Performance Extreme Computing (HPEC), 2020
Ruizhi Zhang
Sasindu Wijeratne
Yang Yang
S. Kuppannagari
Viktor Prasanna
230
5
0
07 Aug 2021
Generic Neural Architecture Search via Regression
Generic Neural Architecture Search via RegressionNeural Information Processing Systems (NeurIPS), 2021
Yuhong Li
Cong Hao
Pan Li
Jinjun Xiong
Deming Chen
209
34
0
04 Aug 2021
WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient
  Convolutional Neural Network Acceleration on FPGAs
WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAsIEEE International Conference on Application-Specific Systems, Architectures, and Processors (ASAP), 2021
Xinheng Liu
Yao Chen
Cong Hao
Ashutosh Dhar
Deming Chen
119
21
0
09 Jul 2021
How to Reach Real-Time AI on Consumer Devices? Solutions for
  Programmable and Custom Architectures
How to Reach Real-Time AI on Consumer Devices? Solutions for Programmable and Custom ArchitecturesIEEE International Conference on Application-Specific Systems, Architectures, and Processors (ASAP), 2021
Stylianos I. Venieris
Ioannis Panopoulos
Ilias Leontiadis
I. Venieris
249
7
0
21 Jun 2021
A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning
A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement LearningDesign Automation Conference (DAC), 2021
Yonggan Fu
Yongan Zhang
Chaojian Li
Zhongzhi Yu
Yingyan Lin
170
6
0
11 Jun 2021
NAAS: Neural Accelerator Architecture Search
NAAS: Neural Accelerator Architecture SearchDesign Automation Conference (DAC), 2021
Chengyue Wu
Mengtian Yang
Song Han
156
70
0
27 May 2021
A Full-Stack Search Technique for Domain Optimized Deep Learning
  Accelerators
A Full-Stack Search Technique for Domain Optimized Deep Learning AcceleratorsInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021
Dan Zhang
Safeen Huda
Ebrahim M. Songhori
Kartik Prabhu
Quoc V. Le
Anna Goldie
Azalia Mirhoseini
235
54
0
26 May 2021
3U-EdgeAI: Ultra-Low Memory Training, Ultra-Low BitwidthQuantization,
  and Ultra-Low Latency Acceleration
3U-EdgeAI: Ultra-Low Memory Training, Ultra-Low BitwidthQuantization, and Ultra-Low Latency AccelerationACM Great Lakes Symposium on VLSI (GLSVLSI), 2021
Yao Chen
Cole Hawkins
Kaiqi Zhang
Zheng Zhang
Cong Hao
158
9
0
11 May 2021
HAO: Hardware-aware neural Architecture Optimization for Efficient
  Inference
HAO: Hardware-aware neural Architecture Optimization for Efficient InferenceIEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2021
Zhen Dong
Yizhao Gao
Qijing Huang
J. Wawrzynek
Hayden Kwok-Hay So
Kurt Keutzer
179
41
0
26 Apr 2021
Enabling Cross-Domain Communication: How to Bridge the Gap between AI
  and HW Engineers
Enabling Cross-Domain Communication: How to Bridge the Gap between AI and HW Engineers
M. Klaiber
Axel Acosta
I. Feldner
Falk Rehm
97
1
0
08 Apr 2021
Enabling Design Methodologies and Future Trends for Edge AI:
  Specialization and Co-design
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-designIEEE design & test (DT), 2021
Cong Hao
Jordan Dotzel
Jinjun Xiong
Luca Benini
Zhiru Zhang
Deming Chen
208
41
0
25 Mar 2021
HSCoNAS: Hardware-Software Co-Design of Efficient DNNs via Neural
  Architecture Search
HSCoNAS: Hardware-Software Co-Design of Efficient DNNs via Neural Architecture SearchDesign, Automation and Test in Europe (DATE), 2021
Xiangzhong Luo
Di Liu
Shuo Huai
Weichen Liu
134
9
0
11 Mar 2021
A Comprehensive Survey on Hardware-Aware Neural Architecture Search
A Comprehensive Survey on Hardware-Aware Neural Architecture Search
Hadjer Benmeziane
Kaoutar El Maghraoui
Hamza Ouarnoughi
Smail Niar
Martin Wistuba
Naigang Wang
232
131
0
22 Jan 2021
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with
  Fractional Activations
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional ActivationsSymposium on Field Programmable Gate Arrays (FPGA), 2020
Yichi Zhang
Junhao Pan
Xinheng Liu
Hongzheng Chen
Deming Chen
Zhiru Zhang
MQ
239
103
0
22 Dec 2020
When Machine Learning Meets Quantum Computers: A Case Study
When Machine Learning Meets Quantum Computers: A Case StudyAsia and South Pacific Design Automation Conference (ASP-DAC), 2020
Weiwen Jiang
Jinjun Xiong
Yiyu Shi
206
29
0
18 Dec 2020
DNA: Differentiable Network-Accelerator Co-Search
DNA: Differentiable Network-Accelerator Co-Search
Yongan Zhang
Y. Fu
Weiwen Jiang
Chaojian Li
Haoran You
Meng Li
Vikas Chandra
Yingyan Lin
325
18
0
28 Oct 2020
DANCE: Differentiable Accelerator/Network Co-Exploration
DANCE: Differentiable Accelerator/Network Co-ExplorationDesign Automation Conference (DAC), 2020
Kanghyun Choi
Deokki Hong
Hojae Yoon
Joonsang Yu
Youngsok Kim
Jinho Lee
280
49
0
14 Sep 2020
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators
  using Reinforcement Learning
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement LearningMicro (MICRO), 2020
Sheng-Chun Kao
Geonhwa Jeong
T. Krishna
301
108
0
04 Sep 2020
12
Next
Page 1 of 2