NOTE! This site uses cookies and similar technologies.

If you not change browser settings, you agree to it. Learn more

I understand
Tuesday, 18 July 2017 06:40

MeRLiN: Exploiting Dynamic Instruction Behavior for Fast and Accurate Microarchitecture Level Reliability Assessment

Rate this item
(0 votes)

Manolis Kaliorakis, Dimitris Gizopoulos, Ramon Canal, and Antonio Gonzalez. 2017. In Proceedings of the 44th Annual International Symposium on Computer Architecture (ISCA '17). ACM, New York, NY, USA, 241-254. DOI: https://doi.org/10.1145/3079856.3080225

DOI

Abstract

Abstract: Early reliability assessment of hardware structures using microarchitecture level simulators can effectively guide major error protection decisions in microprocessor design. Statistical fault injection on microarchitectural structures modeled in performance simulators is an accurate method to measure their Architectural Vulnerability Factor (AVF) but requires excessively long campaigns to obtain high statistical significance. We propose MeRLiN1, a methodology to boost microarchitecture level injection-based reliability assessment by several orders of magnitude and keep the accuracy of the assessment unaffected even for large injection campaigns with very high statistical significance. The core of MeRLiN is the grouping of faults of an initial list in equivalent classes. All faults in the same group target equivalent vulnerable intervals of program execution ending up to the same static instruction that reads the faulty entries. Faults in the same group occur in different times and entries of a structure and it is extremely likely that they all have the same effect in program execution; thus, fault injection is performed only on a few representatives from each group. We evaluate MeRLiN for different sizes of the physical register file, the store queue and the first level data cache of a contemporary microarchitecture running MiBench and SPEC CPU2006 benchmarks. For all our experiments, MeRLiN is from 2 to 3 orders of magnitude faster than an extremely high statistical significant injection campaign, reporting the same reliability measurements with negligible loss of accuracy. Finally, we theoretically analyze MeRLiN's statistical behavior to further justify its accuracy.

Details

  • BIBTEX:
@inproceedings{Kaliorakis:2017:MED:3079856.3080225,
 author = {Kaliorakis, Manolis and Gizopoulos, Dimitris and Canal, Ramon and Gonzalez, Antonio},
 title = {MeRLiN: Exploiting Dynamic Instruction Behavior for Fast and Accurate Microarchitecture Level Reliability Assessment},
 booktitle = {Proceedings of the 44th Annual International Symposium on Computer Architecture},
 series = {ISCA '17},
 year = {2017},
 isbn = {978-1-4503-4892-8},
 location = {Toronto, ON, Canada},
 pages = {241--254},
 numpages = {14},
 url = {https://doi.acm.org/10.1145/3079856.3080225},
 doi = {10.1145/3079856.3080225},
 acmid = {3080225},
 publisher = {ACM},
 address = {New York, NY, USA},
 keywords = {Microarchitecture level reliability estimation, architectural vulnerability factor, fault injection, transient faults},
} 
  • 10.1145/3079856.3080225
  • KEYWORDS: Microarchitecture level reliability estimation, architectural vulnerability factor, fault injection, transient faults
Read 2319 times Last modified on Tuesday, 18 July 2017 06:44

Add comment


Security code
Refresh