Bridging the Architecture Gap: Abstracting Performance-Relevant
Properties of Modern Server ProcessorsSupercomputing Frontiers and Innovations (SuperFri), 2019 |
Massively Parallel Algorithms for the Lattice Boltzmann Method on
Non-uniform GridsSIAM Journal on Scientific Computing (SISC), 2015 |
Multicore-optimized wavefront diamond blocking for optimizing stencil
updatesSIAM Journal on Scientific Computing (SISC), 2014 |
Chip-level and multi-node analysis of energy-optimized lattice-Boltzmann
CFD simulationsConcurrency and Computation (CCPE), 2013 |