VIDA-NYU / alpha-automl

Alpha-AutoML is a Python library for automatically generating end-to-end machine learning pipelines.
https://alpha-automl.readthedocs.io
Apache License 2.0
19 stars 3 forks source link

add cull for unused/inactive server #87

Closed EdenWuyifan closed 1 year ago

EdenWuyifan commented 1 year ago

Okay, thank you! I will do that.

Roque Lopez [rlopez@nyu.edu](mailto:rlopez@nyu.edu) 于2023年11月29日周三 15:02写道: Hi Eden,

As far as I know, members of the D3M project are not using these containers at that rate. My impression is that these are kind of zombie containers, I mean, users started them, but they were not killed properly. Feel free to get rid of them and please, update the configs to limit the container lifetime to 24 hours. Thanks!

Regards,

On Wed, Nov 29, 2023 at 2:25 PM Eden Wu [yfw215@nyu.edu](mailto:yfw215@nyu.edu) wrote:

Hi Roque,

Rémi emailed me about this high volume container usage issue on our k8s cluster. Do you have an idea of what is this task for? <Screenshot from 2023-11-29 14-21-51.png>

Eden

---------- Forwarded message --------- 发件人: Rémi Rampin [remi.rampin@nyu.edu](mailto:remi.rampin@nyu.edu) Date: 2023年11月29日周三 14:10 Subject: HSRN Kubernetes: High number of volumes in 'alphad3m' To: Eden Wu [yfw215@nyu.edu](mailto:yfw215@nyu.edu) Cc: [hsrn-staff@nyu.edu](mailto:hsrn-staff@nyu.edu)

Hi Eden,

There is a high number of PersistentVolumeClaims accumulating in your namespace 'alphad3m'. They follow the format 'claim-'. As I'm writing this email, there are 48 volumes total.

Are all those volumes required? Could there be some configuration issue causing those volumes to accumulate?

Thanks -- Rémi Rampin Senior Research Network Engineer High Speed Research Network NYU Research and Instructional Technology

-- Yifan (Eden) Wu Master of Computer Science New York University

+1 6467061025 | eden.wu@nyu.edu

-- Roque Lopez

-- Yifan (Eden) Wu Master of Computer Science New York University

+1 6467061025 | eden.wu@nyu.edu

roquelopez commented 1 year ago

It looks good to me, thanks Eden! Please, set it to a 1-hour timeout.