ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.06085
14
0

Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities

9 May 2025
Hiari Pizzini Cavagna
Daniele Cesarini
Andrea Bartolini
ArXivPDFHTML
Abstract

The increasing demand for generative AI as Large Language Models (LLMs) services has driven the need for specialized hardware architectures that optimize computational efficiency and energy consumption. This paper evaluates the performance of the Tenstorrent Grayskull e75 RISC-V accelerator for basic linear algebra kernels at reduced numerical precision, a fundamental operation in LLM computations. We present a detailed characterization of Grayskull's execution model, gridsize, matrix dimensions, data formats, and numerical precision impact computational efficiency. Furthermore, we compare Grayskull's performance against state-of-the-art architectures with tensor acceleration, including Intel Sapphire Rapids processors and two NVIDIA GPUs (V100 and A100). Whilst NVIDIA GPUs dominate raw performance, Grayskull demonstrates a competitive trade-off between power consumption and computational throughput, reaching a peak of 1.55 TFLOPs/Watt with BF16.

View on arXiv
@article{cavagna2025_2505.06085,
  title={ Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities },
  author={ Hiari Pizzini Cavagna and Daniele Cesarini and Andrea Bartolini },
  journal={arXiv preprint arXiv:2505.06085},
  year={ 2025 }
}
Comments on this paper