dask / distributed

A distributed task scheduler for Dask
https://distributed.dask.org
BSD 3-Clause "New" or "Revised" License
1.55k stars 712 forks source link

[P2P] Log event during failure #8663

Closed hendrikmakait closed 2 weeks ago

hendrikmakait commented 3 weeks ago

Adds a log event if a P2P operation fails while it's active.

github-actions[bot] commented 3 weeks ago

Unit Test Results

_See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests._

    29 files  ± 0      29 suites  ±0   11h 7m 20s :stopwatch: + 11m 27s  4 057 tests + 4   3 957 :white_check_mark: + 3     97 :zzz: ±0  3 :x: +1  55 883 runs  +56  53 716 :white_check_mark: +51  2 163 :zzz: +4  4 :x: +1 

For more details on these failures, see this check.

Results for commit f378b0ce. ± Comparison against base commit 9fae5dac.

This pull request removes 1 and adds 5 tests. Note that renamed tests count towards both. ``` distributed.shuffle.tests.test_shuffle ‑ test_error_receive ``` ``` distributed.http.scheduler.tests.test_scheduler_http ‑ test_prometheus_collect_task_groups distributed.shuffle.tests.test_shuffle ‑ test_error_receive[P2PConsistencyError] distributed.shuffle.tests.test_shuffle ‑ test_error_receive[ValueError] distributed.shuffle.tests.test_shuffle ‑ test_restarting_does_not_log_p2p_failed distributed.tests.test_worker ‑ test_suppress_compute_failure_for_cancelled_tasks ```

:recycle: This comment has been updated with latest results.