ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.07412
  4. Cited By
Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation
  using Deep Neural Networks
v1v2 (latest)

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks

21 August 2018
Charith Mendis
Alex Renda
Saman P. Amarasinghe
Michael Carbin
ArXiv (abs)PDFHTML

Papers citing "Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks"

48 / 48 papers shown
Title
PrETi: Predicting Execution Time in Early Stage with LLVM and Machine Learning
PrETi: Predicting Execution Time in Early Stage with LLVM and Machine Learning
Risheng Xu
Philipp Sieweck
Hermann von Hasseln
Dirk Nowotka
65
0
0
17 Mar 2025
Interference-Aware Edge Runtime Prediction with Conformal Matrix Completion
Tianshu Huang
Arjun Ramesh
Emily Ruppel
Nuno Pereira
Anthony G. Rowe
Carlee Joe-Wong
70
0
0
09 Mar 2025
Data-efficient Performance Modeling via Pre-training
Data-efficient Performance Modeling via Pre-training
Chunting Liu
Riyadh Baghdadi
182
1
0
24 Jan 2025
Specification Generation for Neural Networks in Systems
Specification Generation for Neural Networks in Systems
Isha Chaudhary
Shuyi Lin
Cheng Tan
Gagandeep Singh
143
0
0
04 Dec 2024
Performance Debugging through Microarchitectural Sensitivity and
  Causality Analysis
Performance Debugging through Microarchitectural Sensitivity and Causality Analysis
Alban Dutilleul
Hugo Pompougnac
Nicolas Derumigny
Gabriel Rodriguez
Valentin Trophime
Christophe Guillon
Fabrice Rastello
113
0
0
03 Dec 2024
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
Mufei Li
Viraj Shitole
Eli Chien
Changhai Man
Zhaodong Wang
Srinivas Sridharan
Ying Zhang
Tushar Krishna
P. Li
83
2
0
04 Nov 2024
A Benchmark on Directed Graph Representation Learning in Hardware
  Designs
A Benchmark on Directed Graph Representation Learning in Hardware Designs
Haoyu Wang
Yinan Huang
Nan Wu
Pan Li
OOD
126
1
0
09 Oct 2024
Tao: Re-Thinking DL-based Microarchitecture Simulation
Tao: Re-Thinking DL-based Microarchitecture Simulation
Santosh Pandey
Amir Yazdanbakhsh
Hang Liu
AI4CE
77
3
0
16 Apr 2024
LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers
LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers
Massinissa Merouani
Khaled Afif Boudaoud
Iheb Nassim Aouadj
Nassim Tchoulak
Islam Kara Bernou
Hamza Benyamina
F. B. Tayeb
K. Benatchba
Hugh Leather
Riyadh Baghdadi
135
3
0
18 Mar 2024
OmniPred: Language Models as Universal Regressors
OmniPred: Language Models as Universal Regressors
Xingyou Song
Oscar Li
Chansoo Lee
Bangding Yang
Daiyi Peng
Sagi Perel
Yutian Chen
115
16
0
22 Feb 2024
Learning Generalizable Program and Architecture Representations for
  Performance Modeling
Learning Generalizable Program and Architecture Representations for Performance Modeling
Lingda Li
T. Flynn
A. Hoisie
57
2
0
25 Oct 2023
AdaMEC: Towards a Context-Adaptive and Dynamically-Combinable DNN
  Deployment Framework for Mobile Edge Computing
AdaMEC: Towards a Context-Adaptive and Dynamically-Combinable DNN Deployment Framework for Mobile Edge Computing
Bowen Pang
Sicong Liu
Hongli Wang
Bin Guo
Yuzhan Wang
Hao Wang
Zhenli Sheng
Zhongyi Wang
Zhiwen Yu
67
3
0
25 Oct 2023
TpuGraphs: A Performance Prediction Dataset on Large Tensor
  Computational Graphs
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
P. Phothilimthana
Sami Abu-El-Haija
Kaidi Cao
Bahare Fatemi
Mike Burrows
Charith Mendis
Bryan Perozzi
GNNAI4TS
123
20
0
25 Aug 2023
ArchGym: An Open-Source Gymnasium for Machine Learning Assisted
  Architecture Design
ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture Design
Srivatsan Krishnan
Amir Yazdanbaksh
Shvetank Prakash
Jason J. Jabbour
Ikechukwu Uchendu
...
Behzad Boroujerdian
Daniel Richins
Devashree Tripathy
Aleksandra Faust
Vijay Janapa Reddi
116
14
0
15 Jun 2023
Transfer Learning Across Heterogeneous Features For Efficient Tensor
  Program Generation
Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation
Gaurav Verma
Siddhisanket Raskar
Zhenda Xie
A. Malik
M. Emani
Barbara M. Chapman
65
2
0
11 Apr 2023
ML-driven Hardware Cost Model for MLIR
ML-driven Hardware Cost Model for MLIR
Dibyendu Das
Sandya Mannarswamy
76
0
0
14 Feb 2023
COMET: Neural Cost Model Explanation Framework
COMET: Neural Cost Model Explanation Framework
Isha Chaudhary
Alex Renda
Charith Mendis
Gagandeep Singh
81
2
0
14 Feb 2023
Application Performance Modeling via Tensor Completion
Application Performance Modeling via Tensor Completion
Edward Hutter
Edgar Solomonik
53
3
0
18 Oct 2022
GRANITE: A Graph Neural Network Model for Basic Block Throughput
  Estimation
GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
O. Sýkora
P. Phothilimthana
Charith Mendis
Amir Yazdanbakhsh
GNN
108
21
0
08 Oct 2022
Optimizing DNN Compilation for Distributed Training with Joint OP and
  Tensor Fusion
Optimizing DNN Compilation for Distributed Training with Joint OP and Tensor Fusion
Xiaodong Yi
Shiwei Zhang
Lansong Diao
Chuan Wu
Zhen Zheng
Shiqing Fan
Siyu Wang
Jun Yang
W. Lin
67
4
0
26 Sep 2022
Learning to Learn to Predict Performance Regressions in Production at
  Meta
Learning to Learn to Predict Performance Regressions in Production at Meta
M. Beller
Hongyu Li
V. Nair
V. Murali
Imad Ahmad
Jürgen Cito
Drew Carlson
Gareth Ari Aye
Wes Dyer
63
5
0
08 Aug 2022
Unsupervised Learning for Combinatorial Optimization with Principled
  Objective Relaxation
Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation
Haoyu Wang
Nan Wu
Hang Yang
Cong Hao
Pan Li
104
32
0
13 Jul 2022
CODE-MVP: Learning to Represent Source Code from Multiple Views with
  Contrastive Pre-Training
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang
Yasheng Wang
Yao Wan
Jiawei Wang
Pingyi Zhou
Li Li
Hao Wu
Jin Liu
75
36
0
04 May 2022
At the Locus of Performance: Quantifying the Effects of Copious
  3D-Stacked Cache on HPC Workloads
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads
Jens Domke
Emil Vatai
Balazs Gerofi
Yuetsu Kodama
Mohamed Wahib
...
Miquel Pericas
Lingqi Zhang
Peng Chen
Aleksandr Drozd
Satoshi Matsuoka
36
1
0
05 Apr 2022
RL4ReAl: Reinforcement Learning for Register Allocation
RL4ReAl: Reinforcement Learning for Register Allocation
S. VenkataKeerthy
Siddhartha Jain
Anilava Kundu
Rohit Aggarwal
Albert Cohen
Ramakrishna Upadrasta
OffRL
109
6
0
05 Apr 2022
BB-ML: Basic Block Performance Prediction using Machine Learning
  Techniques
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques
H. Abdelkhalik
Shammi Aktar
Yehia Arafa
Atanu Barai
Gopinath Chennupati
...
N. Panda
Nirmal Prajapati
Nazmul Haque Turja
S. Eidenbenz
Abdel-Hameed A. Badawy
21
2
0
16 Feb 2022
Profile Guided Optimization without Profiles: A Machine Learning
  Approach
Profile Guided Optimization without Profiles: A Machine Learning Approach
Nadav Rotem
Chris Cummins
OffRL
73
8
0
24 Dec 2021
Programming with Neural Surrogates of Programs
Programming with Neural Surrogates of Programs
Alex Renda
Yi Ding
Michael Carbin
36
4
0
12 Dec 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for
  AI Research
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Chris Cummins
Bram Wasti
Jiadong Guo
Brandon Cui
Jason Ansel
...
Jia-Wei Liu
O. Teytaud
Benoit Steiner
Yuandong Tian
Hugh Leather
74
76
0
17 Sep 2021
Program-to-Circuit: Exploiting GNNs for Program Representation and
  Circuit Translation
Program-to-Circuit: Exploiting GNNs for Program Representation and Circuit Translation
Nan Wu
Huake He
Yuan Xie
Pan Li
Cong Hao
GNN
81
3
0
13 Sep 2021
Using Graph Neural Networks to model the performance of Deep Neural
  Networks
Using Graph Neural Networks to model the performance of Deep Neural Networks
Shikhar Singh
Benoit Steiner
James Hegarty
Hugh Leather
GNN
38
3
0
27 Aug 2021
Latent Execution for Neural Program Synthesis
Latent Execution for Neural Program Synthesis
Xinyun Chen
Basel Alomair
Yuandong Tian
NAI
116
53
0
29 Jun 2021
SimNet: Accurate and High-Performance Computer Architecture Simulation
  using Deep Learning
SimNet: Accurate and High-Performance Computer Architecture Simulation using Deep Learning
Lingda Li
Santosh Pandey
T. Flynn
Hang Liu
Noel Wheeler
A. Hoisie
38
8
0
12 May 2021
A Deep Learning Based Cost Model for Automatic Code Optimization
A Deep Learning Based Cost Model for Automatic Code Optimization
Riyadh Baghdadi
Massinissa Merouani
Mohamed-Hicham Leghettas
K. Abdous
T. Arbaoui
K. Benatchba
Saman P. Amarasinghe
81
71
0
11 Apr 2021
Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space
  Search
Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space Search
Kartik Hegde
Po-An Tsai
Sitao Huang
Vikas Chandra
A. Parashar
Christopher W. Fletcher
72
97
0
02 Mar 2021
An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks
An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks
K. Seshadri
Berkin Akin
James Laudon
Ravi Narayanaswami
Amir Yazdanbakhsh
105
121
0
20 Feb 2021
A Survey of Machine Learning for Computer Architecture and Systems
A Survey of Machine Learning for Computer Architecture and Systems
Nan Wu
Yuan Xie
AI4TSAI4CE
108
152
0
16 Feb 2021
A Runtime-Based Computational Performance Predictor for Deep Neural
  Network Training
A Runtime-Based Computational Performance Predictor for Deep Neural Network Training
Geoffrey X. Yu
Yubo Gao
P. Golikov
Gennady Pekhimenko
3DH
69
68
0
31 Jan 2021
The Tribes of Machine Learning and the Realm of Computer Architecture
The Tribes of Machine Learning and the Realm of Computer Architecture
Ayaz Akram
Jason Lowe-Power
AI4CE
40
2
0
07 Dec 2020
DiffTune: Optimizing CPU Simulator Parameters with Learned
  Differentiable Surrogates
DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates
Alex Renda
Yishen Chen
Charith Mendis
Michael Carbin
59
36
0
08 Oct 2020
A Learned Performance Model for Tensor Processing Units
A Learned Performance Model for Tensor Processing Units
Samuel J. Kaufman
P. Phothilimthana
Yanqi Zhou
Charith Mendis
Sudip Roy
Amit Sabne
Mike Burrows
76
8
0
03 Aug 2020
Contrastive Code Representation Learning
Contrastive Code Representation Learning
Paras Jain
Ajay Jain
Tianjun Zhang
Pieter Abbeel
Joseph E. Gonzalez
Ion Stoica
SSLDRL
132
151
0
09 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML
  Models: A Survey and Insights
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
112
85
0
02 Jul 2020
ProTuner: Tuning Programs with Monte Carlo Tree Search
ProTuner: Tuning Programs with Monte Carlo Tree Search
Ameer Haj-Ali
Hasan Genç
Qijing Huang
William S. Moses
J. Wawrzynek
Krste Asanović
Ion Stoica
78
25
0
27 May 2020
Towards High Performance, Portability, and Productivity: Lightweight
  Augmented Neural Networks for Performance Prediction
Towards High Performance, Portability, and Productivity: Lightweight Augmented Neural Networks for Performance Prediction
Ajitesh Srivastava
Naifeng Zhang
Rajgopal Kannan
Viktor Prasanna
20
2
0
17 Mar 2020
Proposition dún modèle pour lóptimisation automatique de boucles
  dans le compilateur Tiramisu : cas dóptimisation de déroulage
Proposition dún modèle pour lóptimisation automatique de boucles dans le compilateur Tiramisu : cas dóptimisation de déroulage
Asma Balamane
Zina Taklit
10
0
0
29 Jul 2019
Learning Execution through Neural Code Fusion
Learning Execution through Neural Code Fusion
Zhan Shi
Kevin Swersky
Daniel Tarlow
Parthasarathy Ranganathan
Milad Hashemi
GNN
116
29
0
17 Jun 2019
goSLP: Globally Optimized Superword Level Parallelism Framework
goSLP: Globally Optimized Superword Level Parallelism Framework
Charith Mendis
Saman P. Amarasinghe
46
38
0
23 Apr 2018
1