ROCm / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
BSD 3-Clause "New" or "Revised" License
17 stars 14 forks source link

Add a wrapper to skip flaky tests and un-skip some MLP unit tests #86

Closed hubertlu-tw closed 2 years ago

hubertlu-tw commented 2 years ago

Un-skipped tests:

Skipped flaky tests:

hubertlu-tw commented 2 years ago

rocm-pytorch-master failed synced_batchnorm/test_batchnorm1d.py in run_rocm_distributed.sh. The error message is as follows:

Running syncbn batchnorm1d tests
Traceback (most recent call last):
  File "synced_batchnorm/test_batchnorm1d.py", line 16, in <module>
    assert(output.allclose(output_ref))
AssertionError
Build step 'Execute shell' marked build as failure

This test may be marked as flaky test in the future.