beacon-biosignals / K8sClusterManagers.jl

A Julia cluster manager for Kubernetes
Other
31 stars 5 forks source link

Report abnormal worker terminations #52

Closed omus closed 3 years ago

omus commented 3 years ago

Fixes https://github.com/beacon-biosignals/K8sClusterManagers.jl/issues/15. If a worker fails for any reason we now will report the termination reason specified by Kubernetes. This should be useful for cluster users by providing them a basic reason as to why their worker was terminated. Specifically this feature was added to inform users of workers being terminated due to running into memory limits (OOM).

codecov[bot] commented 3 years ago

Codecov Report

Merging #52 (0dd0d25) into main (ab200f8) will decrease coverage by 0.82%. The diff coverage is 62.50%.

Impacted file tree graph

@@            Coverage Diff             @@
##             main      #52      +/-   ##
==========================================
- Coverage   68.45%   67.63%   -0.83%     
==========================================
  Files           4        4              
  Lines         149      173      +24     
==========================================
+ Hits          102      117      +15     
- Misses         47       56       +9     
Impacted Files Coverage Δ
src/native_driver.jl 7.01% <0.00%> (-1.32%) :arrow_down:
src/pod.jl 96.87% <100.00%> (+0.57%) :arrow_up:

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update ab200f8...0dd0d25. Read the comment docs.