Closed ThomasBlock closed 2 weeks ago
update: it fixed itself.. ?
time="2024-06-17 10:51:04.277" level=info msg="collect hardware resource, freeCpu:24, freeMemory: 6 GiB, freeStorage: 226 GiB, freeGpu: map[NVIDIA 4070 SUPER:1]" func=reportClusterResourceForDocker file="ubi.go:952"
time="2024-06-17 10:57:04.278" level=info msg="collect hardware resource, freeCpu:24, freeMemory: 31.63 GiB, freeStorage: 226.51 GiB, freeGpu: map[NVIDIA 4070 SUPER:1]" func=reportClusterResourceForDocker file="ubi.go:952"
update: it fixed itself.. ?
time="2024-06-17 10:51:04.277" level=info msg="collect hardware resource, freeCpu:24, freeMemory: 6 GiB, freeStorage: 226 GiB, freeGpu: map[NVIDIA 4070 SUPER:1]" func=reportClusterResourceForDocker file="ubi.go:952" time="2024-06-17 10:57:04.278" level=info msg="collect hardware resource, freeCpu:24, freeMemory: 31.63 GiB, freeStorage: 226.51 GiB, freeGpu: map[NVIDIA 4070 SUPER:1]" func=reportClusterResourceForDocker file="ubi.go:952"
update: it fixed itself.. ?
time="2024-06-17 10:51:04.277" level=info msg="collect hardware resource, freeCpu:24, freeMemory: 6 GiB, freeStorage: 226 GiB, freeGpu: map[NVIDIA 4070 SUPER:1]" func=reportClusterResourceForDocker file="ubi.go:952" time="2024-06-17 10:57:04.278" level=info msg="collect hardware resource, freeCpu:24, freeMemory: 31.63 GiB, freeStorage: 226.51 GiB, freeGpu: map[NVIDIA 4070 SUPER:1]" func=reportClusterResourceForDocker file="ubi.go:952"
The bug of resource acquisition has been fixed
The ECP engine is being repaired because of the instability of rpc, which leads to the failure of task sending. Currently, the 32G task and aleo task are not enabled, but will be enabled in the future.
ok thank you. i can confirm that jobs are working now