tomchang25 / whisper-auto-transcribe

Auto transcribe tool based on whisper
MIT License
205 stars 14 forks source link

Very High RAM usage #39

Closed Milincho closed 1 year ago

Milincho commented 1 year ago

It used up 32GB of RAM up until the computer totally froze:

image

tomchang25 commented 1 year ago

What is the size of the file you are using, and do you enable a Vocal Extractor?

Milincho commented 1 year ago

It's 3.69GB.

image

I don't know how to enable/disable Vocal Extractor, so it depends if it is enabled by default or not. This is the CLI command I use for all my files:

python cli.py "File.ext" --output "File.English.srt" -lang ja --task translate --model large

It eats all the 32GB of RAM until Windows freezes, and then it goes all the way down. I also use a RTX 4080 with 16GB VRAM, but the problem is with RAM.

Milincho commented 1 year ago

With new 0.3.2b1 version:

At this point: image It used 28GB of RAM: image

At the end of the process: image

It was using 44GB of RAM: image

This is totally absurd. No other program ever uses that much RAM for anything.

Milincho commented 1 year ago

And it failed anyway: image

And it didn't clean up the temp files:

image

tomchang25 commented 1 year ago

And it didn't clean up the temp files:

The temporary file will be stored in the Windows temp directory and will be automatically deleted based on your Windows settings.

It was using 44GB of RAM, This is totally absurd. No other program ever uses that much RAM for anything.

The extremely high RAM usage in big files is caused by VE(Demucs), and there is nothing I can do about it. You can check the detailed instructions how to disable VE by running python .\cli.py -h.

Milincho commented 1 year ago

And it didn't clean up the temp files:

The temporary file will be stored in the Windows temp directory and will be automatically deleted based on your Windows settings.

This is not a Windows issue. Windows does not automatically delete the files in the Temp folder. It is not the Recycle Bin. It is up to each program to delete its own files. Again, having a queue of 100 jobs will fill up the disk with many HUNDREDS of GB in temp files that Windows don't know anything about. It is up to Whisper-auto-transcribe to delete those temp files immediately after each job is completed OR it fails.

It was using 44GB of RAM, This is totally absurd. No other program ever uses that much RAM for anything.

The extremely high RAM usage in big files is caused by VE(Demucs), and there is nothing I can do about it. You can check the detailed instructions how to disable VE by running python .\cli.py -h.

Then there is something wrong with that part and should be reported to the authors.

And cli.py -h doesn't say anything specific about how to disable VE... although I assume is forcing not to use the stable_whisper model?

image