one-some / KoboldAI-united

GNU Affero General Public License v3.0
3 stars 2 forks source link

Models don't cleanly unload #48

Open one-some opened 9 months ago

one-some commented 9 months ago

Currently we manually delete tensors found on the GPU by writing empty data -- this causes memory access errors GPU-side with 4-bit models. Ideally we would get rid of this and unload models like normal people