Performance Tuning for GPU-Embedded Systems: Machine-Learning-based and
Analytical Model-driven Tuning MethodologiesSymposium on Computer Architecture and High Performance Computing (CAHPC), 2023 |
Towards a learning-based performance modeling for accelerating Deep
Neural NetworksCommunication Systems and Applications (CSA), 2019 |
C-for-Metal: High Performance SIMD Programming on Intel GPUsIEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2021 |
Performance portability through machine learning guided kernel selection
in SYCL librariesParallel Computing (PC), 2020 |
Towards automated kernel selection in machine learning systems: A SYCL
case studyIEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPS), 2020 |
A Benchmark Set of Highly-efficient CUDA and OpenCL Kernels and its
Dynamic Autotuning with Kernel Tuning ToolkitFuture generations computer systems (FGCS), 2019 |