Project-HAMi / HAMi

Heterogeneous AI Computing Virtualization Middleware
http://project-hami.io/
Apache License 2.0
649 stars 147 forks source link

Optimize scheduler logic and remove nodelock #450

Open lengrongfu opened 3 weeks ago

lengrongfu commented 3 weeks ago

Currently, there are some problems with using nodelock that cannot be solved.

Current hami scheduler use nodelock to implement serial bind pod, and release the nodelock only after device-plugin allocate device succeeds, but

archlitchi commented 3 weeks ago

yes, it seems ascend-for-volcano use a FIFO strategy with 'bind-time timestamp' instead of using nodelock, and works properly, we can try that as well