IAHispano / Applio

VITS-based Voice Conversion focused on simplicity, quality and performance.
https://applio.org
MIT License
1.29k stars 217 forks source link

Suggestion : Cache Dataset in VRAM suggestion (for high end GPUs) #447

Closed tomakorea closed 1 month ago

tomakorea commented 1 month ago

Because the batch size should be kept low at 4 or 8, the high end consumer GPUs with 16 or 24gb of VRAM are under-utilized. It could be great to find a usage for the large amount of Vram that is left-over, like main RVC branch using it for caching dataset. From what I understand, pushing the batch size at 16 or 24 (even for large datasets of 1 hour) isn't recommended, so it could be nice to find usage of this Vram.

Desktop Details:

aitronssesin commented 1 month ago

Added https://github.com/IAHispano/Applio/commit/5e302727f2b844e33503afb6cdba3a1945c76527