Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.12842
Cited By
A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators
26 May 2021
Dan Zhang
Safeen Huda
Ebrahim M. Songhori
Kartik Prabhu
Quoc V. Le
Anna Goldie
Azalia Mirhoseini
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators"
11 / 11 papers shown
Title
RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving
Wenqi Jiang
Suvinay Subramanian
Cat Graves
Gustavo Alonso
Amir Yazdanbakhsh
Vidushi Dadu
49
6
0
18 Mar 2025
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts
R. Prabhakar
R. Sivaramakrishnan
Darshan Gandhi
Yun Du
Mingran Wang
...
Urmish Thakker
Dawei Huang
Sumti Jairath
Kevin J. Brown
K. Olukotun
MoE
39
12
0
13 May 2024
Workload-Aware Hardware Accelerator Mining for Distributed Deep Learning Training
Muhammad Adnan
Amar Phanishayee
Janardhan Kulkarni
Prashant J. Nair
Divyat Mahajan
29
0
0
23 Apr 2024
Training Large Language Models Efficiently with Sparsity and Dataflow
V. Srinivasan
Darshan Gandhi
Urmish Thakker
R. Prabhakar
MoE
30
6
0
11 Apr 2023
AutoML for neuromorphic computing and application-driven co-design: asynchronous, massively parallel optimization of spiking architectures
A. Yanguas-Gil
Sandeep Madireddy
14
3
0
26 Feb 2023
Multi-Agent Reinforcement Learning for Microprocessor Design Space Exploration
Srivatsan Krishnan
Natasha Jaques
Shayegan Omidshafiei
Dan Zhang
Izzeddin Gur
Vijay Janapa Reddi
Aleksandra Faust
24
2
0
29 Nov 2022
Open Source Vizier: Distributed Infrastructure and API for Reliable and Flexible Blackbox Optimization
Xingyou Song
Sagi Perel
Chansoo Lee
Greg Kochanski
Daniel Golovin
29
26
0
27 Jul 2022
Special Session: Towards an Agile Design Methodology for Efficient, Reliable, and Secure ML Systems
Shail Dave
Alberto Marchisio
Muhammad Abdullah Hanif
Amira Guesmi
Aviral Shrivastava
Ihsen Alouani
Muhammad Shafique
28
13
0
18 Apr 2022
Carbon Emissions and Large Neural Network Training
David A. Patterson
Joseph E. Gonzalez
Quoc V. Le
Chen Liang
Lluís-Miquel Munguía
D. Rothchild
David R. So
Maud Texier
J. Dean
AI4CE
244
643
0
21 Apr 2021
Rethinking Co-design of Neural Architectures and Hardware Accelerators
Yanqi Zhou
Xuanyi Dong
Berkin Akin
Mingxing Tan
Daiyi Peng
Tianjian Meng
Amir Yazdanbakhsh
Da Huang
Ravi Narayanaswami
James Laudon
49
26
0
17 Feb 2021
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,567
0
17 Apr 2017
1