ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05371
  4. Cited By
Input-Aware Auto-Tuning of Compute-Bound HPC Kernels

Input-Aware Auto-Tuning of Compute-Bound HPC Kernels

15 February 2018
Philippe Tillet
David D. Cox
ArXiv (abs)PDFHTML

Papers citing "Input-Aware Auto-Tuning of Compute-Bound HPC Kernels"

4 / 4 papers shown
Title
Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix
  Multiplication on the GPU
Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU
Muhammad Osama
D. Merrill
C. Cecka
M. Garland
John Douglas Owens
61
28
0
09 Jan 2023
Using hardware performance counters to speed up autotuning convergence
  on GPUs
Using hardware performance counters to speed up autotuning convergence on GPUs
Jiri Filipovic
Jana Hozzová
A. Nezarat
Jaroslav Olha
Filip Petrovic
36
12
0
10 Feb 2021
SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep Learning
  Inference
SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep Learning Inference
Ziheng Wang
86
68
0
26 Aug 2020
A model-driven approach for a new generation of adaptive libraries
A model-driven approach for a new generation of adaptive libraries
Marco Cianfriglia
Damiano Perri
C. Nugteren
Anton Lokhmotov
G. Fursin
74
14
0
19 Jun 2018
1