JHGuitarFreak / UQM-MegaMod

A fork of The Ur-Quan Masters + HD-mod that remasters the HD graphics with a veritable smorgasbord of extra features, options, QoL improvements, and much more...
https://uqm-mods.sourceforge.net
GNU General Public License v2.0
78 stars 22 forks source link

Suggestion: Make upsampled/upscaled voicepack mod with AudioSR or similar software #244

Open dedbbs1 opened 5 months ago

dedbbs1 commented 5 months ago

I was wondering if it would be possible for you or possibly me or someone else who is better equipped to make mods to upscale/upsample voices with AudioSR (An Nvidia GPU may be required) - https://github.com/haoheliu/versatile_audio_super_resolution - or similar software up to 48khz quality using AI and/or other means, what it would entail as far as making the mod is concerned (would simply replacing the original 11khz ogg with a 48khz ogg work? What about an 11khz ogg with a 48khz flac? would any other modifications to what files be necessary or would it just be a "cut and paste" job) and whether there's interest from people in a modded audio pack that does this? I pieced together basically how to use AudioSR and from this starting file: https://drive.google.com/file/d/1bKv4SrZQ4ukuZEAI9nj3jJUUTzsiHPyN/view?usp=sharing I was able to make this using the speech model https://drive.google.com/file/d/16YrDs0NvIL-1SiVuxxiIuGCFHJ2XSowf/view?usp=sharing - it's not perfect by any means but to me anyway it sounds much clearer and crisper and less like it was recorded over a landline telephone, if that makes any sense. I'm sure someone with more audio editing experience could make it sound even better by tweaking each file individually, but to me the results are impressive.

If you - JHGuitarFreak - or someone else wants to take it from where I've started and run with it, that's fine. I could probably do it, but I bet someone else could do it much better and much faster than I could. Sorry if I'm posting this to the wrong place, I just didn't know where else to place this suggestion.