v1v2 (latest)

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks

21 August 2018

Papers citing "Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks"

48 / 48 papers shown

Title
PrETi: Predicting Execution Time in Early Stage with LLVM and Machine Learning Risheng Xu Philipp Sieweck Hermann von Hasseln Dirk Nowotka 65 0 0 17 Mar 2025
Interference-Aware Edge Runtime Prediction with Conformal Matrix Completion Tianshu Huang Arjun Ramesh Emily Ruppel Nuno Pereira Anthony G. Rowe Carlee Joe-Wong 70 0 0 09 Mar 2025
Data-efficient Performance Modeling via Pre-training Chunting Liu Riyadh Baghdadi 182 1 0 24 Jan 2025
Specification Generation for Neural Networks in Systems Isha Chaudhary Shuyi Lin Cheng Tan Gagandeep Singh 143 0 0 04 Dec 2024
Performance Debugging through Microarchitectural Sensitivity and Causality Analysis Alban Dutilleul Hugo Pompougnac Nicolas Derumigny Gabriel Rodriguez Valentin Trophime Christophe Guillon Fabrice Rastello 113 0 0 03 Dec 2024
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation Mufei Li Viraj Shitole Eli Chien Changhai Man Zhaodong Wang Srinivas Sridharan Ying Zhang Tushar Krishna P. Li 83 2 0 04 Nov 2024
A Benchmark on Directed Graph Representation Learning in Hardware Designs Haoyu Wang Yinan Huang Nan Wu Pan Li OOD 126 1 0 09 Oct 2024
Tao: Re-Thinking DL-based Microarchitecture Simulation Santosh Pandey Amir Yazdanbakhsh Hang Liu AI4CE 77 3 0 16 Apr 2024
LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers Massinissa Merouani Khaled Afif Boudaoud Iheb Nassim Aouadj Nassim Tchoulak Islam Kara Bernou Hamza Benyamina F. B. Tayeb K. Benatchba Hugh Leather Riyadh Baghdadi 135 3 0 18 Mar 2024
OmniPred: Language Models as Universal Regressors Xingyou Song Oscar Li Chansoo Lee Bangding Yang Daiyi Peng Sagi Perel Yutian Chen 115 16 0 22 Feb 2024
Learning Generalizable Program and Architecture Representations for Performance Modeling Lingda Li T. Flynn A. Hoisie 57 2 0 25 Oct 2023
AdaMEC: Towards a Context-Adaptive and Dynamically-Combinable DNN Deployment Framework for Mobile Edge Computing Bowen Pang Sicong Liu Hongli Wang Bin Guo Yuzhan Wang Hao Wang Zhenli Sheng Zhongyi Wang Zhiwen Yu 67 3 0 25 Oct 2023
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs P. Phothilimthana Sami Abu-El-Haija Kaidi Cao Bahare Fatemi Mike Burrows Charith Mendis Bryan Perozzi GNN AI4TS 123 20 0 25 Aug 2023
ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture Design Srivatsan Krishnan Amir Yazdanbaksh Shvetank Prakash Jason J. Jabbour Ikechukwu Uchendu ... Behzad Boroujerdian Daniel Richins Devashree Tripathy Aleksandra Faust Vijay Janapa Reddi 116 14 0 15 Jun 2023
Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation Gaurav Verma Siddhisanket Raskar Zhenda Xie A. Malik M. Emani Barbara M. Chapman 65 2 0 11 Apr 2023
ML-driven Hardware Cost Model for MLIR Dibyendu Das Sandya Mannarswamy 76 0 0 14 Feb 2023
COMET: Neural Cost Model Explanation Framework Isha Chaudhary Alex Renda Charith Mendis Gagandeep Singh 81 2 0 14 Feb 2023
Application Performance Modeling via Tensor Completion Edward Hutter Edgar Solomonik 53 3 0 18 Oct 2022
GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation O. Sýkora P. Phothilimthana Charith Mendis Amir Yazdanbakhsh GNN 108 21 0 08 Oct 2022
Optimizing DNN Compilation for Distributed Training with Joint OP and Tensor Fusion Xiaodong Yi Shiwei Zhang Lansong Diao Chuan Wu Zhen Zheng Shiqing Fan Siyu Wang Jun Yang W. Lin 67 4 0 26 Sep 2022
Learning to Learn to Predict Performance Regressions in Production at Meta M. Beller Hongyu Li V. Nair V. Murali Imad Ahmad Jürgen Cito Drew Carlson Gareth Ari Aye Wes Dyer 63 5 0 08 Aug 2022
Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation Haoyu Wang Nan Wu Hang Yang Cong Hao Pan Li 104 32 0 13 Jul 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training Xin Wang Yasheng Wang Yao Wan Jiawei Wang Pingyi Zhou Li Li Hao Wu Jin Liu 75 36 0 04 May 2022
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads Jens Domke Emil Vatai Balazs Gerofi Yuetsu Kodama Mohamed Wahib ... Miquel Pericas Lingqi Zhang Peng Chen Aleksandr Drozd Satoshi Matsuoka 36 1 0 05 Apr 2022
RL4ReAl: Reinforcement Learning for Register Allocation S. VenkataKeerthy Siddhartha Jain Anilava Kundu Rohit Aggarwal Albert Cohen Ramakrishna Upadrasta OffRL 109 6 0 05 Apr 2022
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques H. Abdelkhalik Shammi Aktar Yehia Arafa Atanu Barai Gopinath Chennupati ... N. Panda Nirmal Prajapati Nazmul Haque Turja S. Eidenbenz Abdel-Hameed A. Badawy 21 2 0 16 Feb 2022
Profile Guided Optimization without Profiles: A Machine Learning Approach Nadav Rotem Chris Cummins OffRL 73 8 0 24 Dec 2021
Programming with Neural Surrogates of Programs Alex Renda Yi Ding Michael Carbin 36 4 0 12 Dec 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research Chris Cummins Bram Wasti Jiadong Guo Brandon Cui Jason Ansel ... Jia-Wei Liu O. Teytaud Benoit Steiner Yuandong Tian Hugh Leather 74 76 0 17 Sep 2021
Program-to-Circuit: Exploiting GNNs for Program Representation and Circuit Translation Nan Wu Huake He Yuan Xie Pan Li Cong Hao GNN 81 3 0 13 Sep 2021
Using Graph Neural Networks to model the performance of Deep Neural Networks Shikhar Singh Benoit Steiner James Hegarty Hugh Leather GNN 38 3 0 27 Aug 2021
Latent Execution for Neural Program Synthesis Xinyun Chen Basel Alomair Yuandong Tian NAI 116 53 0 29 Jun 2021
SimNet: Accurate and High-Performance Computer Architecture Simulation using Deep Learning Lingda Li Santosh Pandey T. Flynn Hang Liu Noel Wheeler A. Hoisie 38 8 0 12 May 2021
A Deep Learning Based Cost Model for Automatic Code Optimization Riyadh Baghdadi Massinissa Merouani Mohamed-Hicham Leghettas K. Abdous T. Arbaoui K. Benatchba Saman P. Amarasinghe 81 71 0 11 Apr 2021
Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space Search Kartik Hegde Po-An Tsai Sitao Huang Vikas Chandra A. Parashar Christopher W. Fletcher 72 97 0 02 Mar 2021
An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks K. Seshadri Berkin Akin James Laudon Ravi Narayanaswami Amir Yazdanbakhsh 105 121 0 20 Feb 2021
A Survey of Machine Learning for Computer Architecture and Systems Nan Wu Yuan Xie AI4TS AI4CE 108 152 0 16 Feb 2021
A Runtime-Based Computational Performance Predictor for Deep Neural Network Training Geoffrey X. Yu Yubo Gao P. Golikov Gennady Pekhimenko 3DH 69 68 0 31 Jan 2021
The Tribes of Machine Learning and the Realm of Computer Architecture Ayaz Akram Jason Lowe-Power AI4CE 40 2 0 07 Dec 2020
DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates Alex Renda Yishen Chen Charith Mendis Michael Carbin 59 36 0 08 Oct 2020
A Learned Performance Model for Tensor Processing Units Samuel J. Kaufman P. Phothilimthana Yanqi Zhou Charith Mendis Sudip Roy Amit Sabne Mike Burrows 76 8 0 03 Aug 2020
Contrastive Code Representation Learning Paras Jain Ajay Jain Tianjun Zhang Pieter Abbeel Joseph E. Gonzalez Ion Stoica SSL DRL 132 151 0 09 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights Shail Dave Riyadh Baghdadi Tony Nowatzki Sasikanth Avancha Aviral Shrivastava Baoxin Li 112 85 0 02 Jul 2020
ProTuner: Tuning Programs with Monte Carlo Tree Search Ameer Haj-Ali Hasan Genç Qijing Huang William S. Moses J. Wawrzynek Krste Asanović Ion Stoica 78 25 0 27 May 2020
Towards High Performance, Portability, and Productivity: Lightweight Augmented Neural Networks for Performance Prediction Ajitesh Srivastava Naifeng Zhang Rajgopal Kannan Viktor Prasanna 20 2 0 17 Mar 2020
Proposition dún modèle pour lóptimisation automatique de boucles dans le compilateur Tiramisu : cas dóptimisation de déroulage Asma Balamane Zina Taklit 10 0 0 29 Jul 2019
Learning Execution through Neural Code Fusion Zhan Shi Kevin Swersky Daniel Tarlow Parthasarathy Ranganathan Milad Hashemi GNN 116 29 0 17 Jun 2019
goSLP: Globally Optimized Superword Level Parallelism Framework Charith Mendis Saman P. Amarasinghe 46 38 0 23 Apr 2018