efeslab / Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
259 stars 21 forks source link

LLM model load hanging problem #18

Closed jimmy-adams closed 2 months ago

jimmy-adams commented 2 months ago

Hello, When i follow the guidance and try reproduce the result, i encounter the problem shown in the screening below.

1719457102248
happierpig commented 2 months ago

Hi @jimmy-adams ,

Could you please provide more information like the dependent library env and the GPU types? so that we can reproduce this issue. Thanks!

jimmy-adams commented 2 months ago

Hi @jimmy-adams ,

Could you please provide more information like the dependent library env and the GPU types? so that we can reproduce this issue. Thanks!

Dear author, Thank you for your reply. I think i missed some dependency lib in my computer, now it is working.