Expected runtime definition

NVlabs / nvbitfi

Architecture-level Fault Injection Tool for GPU Application Resilience Evaluation

Other

53 stars 22 forks source link

Hi all, i am trying to estimate the Expected_runtime to define the Timeout fault. Usually the Expected_runtime measured during the normal_execution of the application is much shoter than the runtime measured by the tool when injecting faults (I guess that nvbitfi instrumentation is the cause of the delay) .

E.g: Inj_count=1, App=mEle_Sz256_Blk32, Mode=inst_value, Group=7, EM=0, Time=83.747101, Outcome: Masked: other reasons

So using the normal_execution time in the list of apps of params.py produces a lot of Timeouts. The other option is to use the maximum Time obtained in a DUMMY campaign. However in that case there are no Timeouts in the results. What is the right way to estimate the Expected runtime? Thank you in advance

NVlabs / nvbitfi

Expected runtime definition #12