Open STEMBytes opened 8 months ago
Hi, thanks for reporting the issue! Can you describe the exact error you observed?
You bet here is the full error. I get this same message when trying to add to the 180B version off petals.dev. Here is part of the results but I removed PII info like my IP address in the full responce"
[INFO] Model weights are loaded in bfloat16, quantized to nf4 format
Mar 18 23:21:44.649 [INFO] Server will fill your GPU memory with 60 transformer blocks. If you want to leave some free GPU memory, please specify a lesser --num_blocks manually
Mar 18 23:21:44.650 [INFO] Attention cache for all blocks will consume up to 1.88 GiB
Mar 18 23:21:44.650 [INFO] Loading throughput info
Mar 18 23:21:44.650 [INFO] Measuring network and compute throughput. This takes about a minute and will be cached for future runs
Traceback (most recent call last):
File "
I am using cuda "| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 ,,, and Name: torch Version: 2.0.1 on Ubuntu 22.04.
Let me know if you need any further details. If I revert to the prior version it loads fine. I have tried it on three machines with the same behavior.
@STEMBytes Thank you for writing a detailed report.
I'll investigate falcon support as soon as i'm done with the current sprint (eta next night). Will keep you posted here when I have more understanding on how to fix that mask issue, eta before this friday AOE.
I am sorry, I got fatally tangled in the ICML duties and they take longer than expected. I am still working my way through the todo list to eventually repair this. I will still get to fix falcon as soon as I can
No worries at all. I understand that the ICML work went well. Thanks for the followup; and let me know how I can help.
Hello, I believe the Latest bump, "Bump transformers and accelerate versions (#554)," looks to destroy Falcon support. Falcon is an essential model, given its open license use (40 and lower). I have validated that if I use an older version of Petals, I can run Falcon but get errors with the current Petals build.