Open HaloKim opened 1 year ago
@HaloKim this is interesting!
I have a question regarding the testing methodology: did you run both the docker notebook and the KF Notebook in the same node on your onpremise cluster?
Want to rule out that it could be caused by the node itself and understand if it's an issue related specifically to Charmed Kubeflow.
@HaloKim this is interesting!
I have a question regarding the testing methodology: did you run both the docker notebook and the KF Notebook in the same node on your onpremise cluster?
Want to rule out that it could be caused by the node itself and understand if it's an issue related specifically to Charmed Kubeflow.
Sorry, I was thinking wrong. Higher is better, but when I checked again, I was using "torch.backends.cuda.matmul.allow_tf32=False" in docker. However, it doesn't seem like a big problem, but in KF jupyter, "torch.backends.cuda.matmul.allow_tf32" has little difference in speed whether it is True or False.
Thank you for your reply.
Hello, I am running charmed kubeflow onpremise.
I have question.
There is a big difference in gpu speed between docker and kubeflow jupyter, but I don't know the cause.
I run this code,
Kubeflow jupyter output
Local Docker output
Server env
No LSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 22.04.1 LTS Release: 22.04 Codename: jammy
Driver Version: 515.86.01 CUDA Version: 11.7 cuda_11.8.r11.8 cudnn 8.4.1
Client Version: v1.24.13-2+cd9733de84ad4b Kustomize Version: v4.5.4 Server Version: v1.24.13-2+cd9733de84ad4b
charmed kubeflow 1.7