Open pshipton opened 4 weeks ago
@dmitripivkine fyi
There is no stored results unfortunately, so nothing to investigate yet.
Ya it's weird. The job reports the following, but there is nothing.
Attempting to upload compressed file.Upload appears to have been successful.
You could try more grinder, and also on the same machine where it failed.
grinders 10 jobs on the same machine/130 jobs all machines all passed.
Running more grinders. However I do not expect this problem to be reproduced. It is very unlikely that there is a regression, this code has not been touched for years. Memory corruption or machine threads sync problem is possible.
Another set of grinders (100 jobs across all machines and 100 jobs on the same machine) has passed. Looks like it is hard to reproduce, we are going to wait when is failed again in testing.
Looking to the code closely there is no logical possibility for GC threads to discover unmarked object and request work packet for it (reason for triggering the assertion). However new object can be added by non-GC thread by calling Write Barrier. This code is executed under Exclusive access (STW), so there is no valid way for Write Barrier to be called. This scenario can be investigated - is there possibility for mutator thread without VMAccess or Compilation thread to call Write Barrier. Lost threads synchronization and memory corruption still be potential scenarios as well.
Internal build [AIX] 80 Load_Level_2.harmony.5mins.Mode301 -
-Xgcpolicy:metronome -Xnocompressedrefs
Note 32-bit aix72p9-1050x grinder - passed