Please use this identifier to cite or link to this item: https://hdl.handle.net/11147/14188
Title: Regional soft error vulnerability and error propagation analysis for GPGPU applications
Authors: Öz, I.
Karadaş, Ö.F.
Keywords: Fault injection
GPGPU programs
Soft error reliability
Fault tolerance
Program processors
Radiation hardening
Reliability analysis
Error propagation analysis
Fault tolerance techniques
General-purpose computations
Graphics programs
Performance and reliabilities
Reliability level
Reliable execution
Vulnerability analysis
Error correction
Publisher: Springer
Abstract: The wide use of GPUs for general-purpose computations as well as graphics programs makes soft errors a critical concern. Evaluating the soft error vulnerability of GPGPU programs and employing efficient fault tolerance techniques for more reliable execution become more important. Protecting only the most error-sensitive program regions maintains an acceptable reliability level by eliminating the large performance overheads due to redundant operations. Therefore, fine-grained regional soft error vulnerability analysis is crucial for the systems targeting both performance and reliability. In this work, we present a regional fault injection framework and perform a detailed error propagation analysis to evaluate the soft error vulnerability of GPGPU applications. We evaluate both intra-kernel and inter-kernel vulnerabilities for a set of programs and quantify the severity of the data corruptions by considering metrics other than SDC rates. Our experimental study demonstrates that the code regions inside GPGPU programs exhibit different characteristics in terms of soft error vulnerability and the soft errors corrupting the variables propagate into the program output in several ways. We present the potential impact of our analysis by discussing the usage scenarios after we compile our observations acquired from our empirical work. © 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
URI: https://doi.org/10.1007/s11227-021-04026-6
https://hdl.handle.net/11147/14188
ISSN: 0920-8542
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Show full item record



CORE Recommender

SCOPUSTM   
Citations

4
checked on Apr 5, 2024

Page view(s)

30
checked on Apr 22, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.