v1v2 (latest)

MLIR: A Compiler Infrastructure for the End of Moore's Law

25 February 2020

Papers citing "MLIR: A Compiler Infrastructure for the End of Moore's Law"

50 / 75 papers shown

SkyEgg: Joint Implementation Selection and Scheduling for Hardware Synthesis using E-graphsInternational Conference on Information Photonics (ICIP), 2024

Youwei Xiao

Yuyang Zou

Yun Liang

19 Nov 2025

TurkEmbed4Retrieval: Turkish Embedding Model for Retrieval Task

135

10 Nov 2025

Resource Estimation of CGGI and CKKS scheme workloads on FracTLcore Computing Fabric

Denis Ovichinnikov

Hemant Kavadia

Satya Keerti Chand Kudupudi

...

15 Oct 2025

LightCode: Compiling LLM Inference for Photonic-Electronic Systems

Ryan Tomich

Zhizhen Zhong

Dirk Englund

115

19 Sep 2025

GraphMend: Code Transformations for Fixing Graph Breaks in PyTorch 2

Savini Kashmira

Jayanaka L. Dantanarayana

Thamirawaran Sathiyalogeswaran

201

17 Sep 2025

Astra: A Multi-Agent System for GPU Kernel Performance Optimization

240

09 Sep 2025

Accelerating GenAI Workloads by Enabling RISC-V Microkernel Support in IREE

07 Jul 2025

DiTOX: Fault Detection and Localization in the ONNX Optimizer

Nikolaos Louloudakis

Ajitha Rajan

598

03 May 2025

Rulebook: bringing co-routines to reinforcement learning environments

Massimo Fioravanti

Samuele Pasini

Giovanni Agosta

218

28 Apr 2025

Morphing-based Compression for Data-centric ML Pipelines

Sebastian Baunsgaard

Matthias Boehm

258

15 Apr 2025

DSP-MLIR: A MLIR Dialect for Digital Signal ProcessingACM SIGPLAN Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES), 2024

Abhinav Kumar

Atharva Khedkar

Aviral Shrivastava

104

20 Aug 2024

vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs

161

01 May 2024

UniSparse: An Intermediate Language for General Sparse Format Customization

Zijian Ding

188

09 Mar 2024

Architectural Neural Backdoors from First PrinciplesIEEE Symposium on Security and Privacy (S&P), 2024

268

10 Feb 2024

PolyTOPS: Reconfigurable and Flexible Polyhedral SchedulerIEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2024

...

Artur Cesar Araujo Alves

119

12 Jan 2024

HElium: A Language and Compiler for Fully Homomorphic Encryption with Support for Proxy Re-Encryption

101

21 Dec 2023

Zero Bubble Pipeline Parallelism

283

30 Nov 2023

XLB: A differentiable massively parallel lattice Boltzmann library in PythonComputer Physics Communications (CPC), 2023

Mohammadmehdi Ataei

H. Salehipour

AI4CE

457

27 Nov 2023

Neuromorphic Intermediate Representation: A Unified Instruction Set for Interoperable Brain-Inspired ComputingNature Communications (Nat. Commun.), 2023

...

342

24 Nov 2023

CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs

407

16 Nov 2023

Automatic Generators for a Family of Matrix Multiplication Routines with Apache TVMACM Transactions on Mathematical Software (TOMS), 2023

Enrique S. Quintana-Ortí

224

31 Oct 2023

Tackling the Matrix Multiplication Micro-kernel Generation with ExoIEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2023

249

26 Oct 2023

GEVO-ML: Optimizing Machine Learning Code with Evolutionary Computation

273

16 Oct 2023

SimplePIM: A Software Framework for Productive and Efficient Processing-in-MemoryInternational Conference on Parallel Architectures and Compilation Techniques (PACT), 2023

207

03 Oct 2023

A Portable Framework for Accelerating Stencil Computations on Modern Node Architectures

117

09 Sep 2023

On the Tool Manipulation Capability of Open-source Large Language Models

371

110

25 May 2023

ACRoBat: Optimizing Auto-batching of Dynamic Deep Learning at Compile TimeConference on Machine Learning and Systems (MLSys), 2023

235

17 May 2023

Experiences in Building a Composable and Functional API for Runtime SPIR-V Code Generation

J. Fumero

György Réthy

Athanasios Stratikopoulos

N. Foutris

Christos Kotselidis

117

16 May 2023

Ada-Grouper: Accelerating Pipeline Parallelism in Preempted Network by Adaptive Group-Scheduling for Micro-Batches

153

03 Mar 2023

Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform

Ziji Shi

Zhen Zheng

Chuan Wu

W. Lin

AI4CE

211

16 Feb 2023

OpenHLS: High-Level Synthesis for Low-Latency Deep Neural Networks for Experimental Science

354

13 Feb 2023

CMLCompiler: A Unified Compiler for Classical Machine LearningInternational Conference on Supercomputing (ICS), 2023

311

31 Jan 2023

oneDNN Graph Compiler: A Hybrid Approach for High-Performance Deep Learning CompilationIEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2023

Zhennan Qin

...

276

03 Jan 2023

Python FPGA Programming with Data-Centric Multi-Level Design

Johannes de Fine Licht

Carl-Johannes Johnsen

Torsten Hoefler

278

28 Dec 2022

On Physics-Informed Neural Networks for Quantum ComputersFrontiers in Applied Mathematics and Statistics (FAMS), 2022

Stefano Markidis

PINN

296

28 Sep 2022

Optimizing DNN Compilation for Distributed Training with Joint OP and Tensor FusionIEEE Transactions on Parallel and Distributed Systems (TPDS), 2022

Zhen Zheng

170

26 Sep 2022

Programming Autonomous MachinesInternational Conference on Embedded Software (EMSOFT), 2022

131

06 Sep 2022

GraphQ IR: Unifying the Semantic Parsing of Graph Query Languages with One Intermediate RepresentationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Juanzi Li

215

24 May 2022

Special Session: Towards an Agile Design Methodology for Efficient, Reliable, and Secure ML SystemsIEEE VLSI Test Symposium (VTS), 2022

Shail Dave

Alberto Marchisio

Muhammad Abdullah Hanif

288

18 Apr 2022

Query Processing on Tensor Computation RuntimesProceedings of the VLDB Endowment (PVLDB), 2022

Jesús Camacho-Rodríguez

Konstantinos Karanasos

Matteo Interlandi

475

03 Mar 2022

Memory Planning for Deep Neural Networks

Maksim Levental

210

23 Feb 2022

Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation

Yinlin Deng

164

21 Feb 2022

Implementing Spiking Neural Networks on Neuromorphic Architectures: A Review

202

17 Feb 2022

HECO: Fully Homomorphic Encryption CompilerUSENIX Security Symposium (USENIX Security), 2022

424

03 Feb 2022

Compiler-Driven Simulation of Reconfigurable Hardware AcceleratorsInternational Symposium on High-Performance Computer Architecture (HPCA), 2022

191

01 Feb 2022

Lifting C Semantics for Dataflow OptimizationInternational Conference on Supercomputing (ICS), 2021

A. Calotoiu

Tal Ben-Nun

Grzegorz Kwa'sniewski

Johannes de Fine Licht

Timo Schneider

Philipp Schaad

Torsten Hoefler

342

22 Dec 2021

Torch.fx: Practical Program Capture and Transformation for Deep Learning in Python

235

15 Dec 2021

A Highly Configurable Hardware/Software Stack for DNN Inference Acceleration

250

29 Nov 2021

A Data-Centric Optimization Framework for Machine LearningInternational Conference on Supercomputing (ICS), 2021

329

20 Oct 2021

DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator FusionACM Transactions on Architecture and Code Optimization (TACO) (TACO), 2020

327

208

30 Aug 2021