ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.12952
13
0

Accelerating Compact Fractals with Tensor Core GPUs

25 October 2021
Felipe A. Quezada
C. Navarro
ArXiv (abs)PDFHTML
Abstract

This work presents a GPU thread mapping approach that allows doing fast parallel stencil-like computations on discrete fractals using their compact representation. The intuition behind is to employ two GPU tensor-core accelerated thread maps, λ(ω)\lambda(\omega)λ(ω) and ν(ω)\nu(\omega)ν(ω), which act as threadspace-to-dataspace and dataspace-to-threadspace functions, respectively. By combining these maps, threads can access compact space and interact with their neighbors. The cost of the maps is O(log⁡log⁡(n))\mathcal{O}(\log \log(n))O(loglog(n)) time, with nnn being the side of a n×nn \times nn×n embedding for the fractal in its expanded form. The technique works on any fractal that belongs to the Non-overlapping-Bounding-Boxes (NBB) class of discrete fractals, and can be extended to three dimensions as well. Results using an A100 GPU on the Sierpinski Triangle as a case study show up to ∼11×\sim11\times∼11× of speedup and a memory usage reduction of 234×234\times234× with respect to a Bounding Box approach. These results show that the proposed compact approach can allow the scientific community to tackle larger problems that did not fit in GPU memory before, and run even faster than a bounding box approach.

View on arXiv
Comments on this paper