Open tedcar opened 4 days ago
Hi, @tedcar, thanks for your suggestions! We will further analyze these issues to improve the user experience. For loading behavior, the script will download files from HF the first time you run it, so it might take some time. On subsequent runs, it will read the files directly from the local storage if they are available, so it won't need to download them again.
Appreciate that. This is still cutting-edge technology and I appreciate you guys making this open source
I'm experiencing inconsistent behavior while using an NVIDIA L4 GPU. The main issues are:
SafeTensors Loading Problem:
Installation Method Issues:
Environment Details:
VRAM Usage Concerns:
As someone who isn't particularly technical, I'd appreciate if there was a built-in feature to control VRAM allocation for better performance. Could you please investigate these issues, particularly the inconsistent loading behavior and VRAM utilization?