ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.06896
  4. Cited By
Reinit++: Evaluating the Performance of Global-Restart Recovery Methods
  For MPI Fault Tolerance

Reinit++: Evaluating the Performance of Global-Restart Recovery Methods For MPI Fault Tolerance

13 February 2021
Giorgis Georgakoudis
Luanzheng Guo
Ignacio Laguna
    LRM
ArXiv (abs)PDFHTML

Papers citing "Reinit++: Evaluating the Performance of Global-Restart Recovery Methods For MPI Fault Tolerance"

4 / 4 papers shown
Title
Checkpoint-Restart Libraries Must Become More Fault Tolerant
Checkpoint-Restart Libraries Must Become More Fault Tolerant
Anthony Skjellum
Derek Schafer
34
0
0
20 Dec 2021
MATCH: An MPI Fault Tolerance Benchmark Suite
MATCH: An MPI Fault Tolerance Benchmark Suite
Luanzheng Guo
Giorgis Georgakoudis
K. Parasyris
Ignacio Laguna
Dong Li
27
7
0
13 Feb 2021
Towards Distributed Software Resilience in Asynchronous Many-Task
  Programming Models
Towards Distributed Software Resilience in Asynchronous Many-Task Programming Models
Nikunj Gupta
J. Mayo
Adrian S. Lemoine
Hartmut Kaiser
17
2
0
19 Oct 2020
PARIS: Predicting Application Resilience Using Machine Learning
PARIS: Predicting Application Resilience Using Machine Learning
Luanzheng Guo
Dong Li
Ignacio Laguna
AI4CE
24
25
0
07 Dec 2018
1