Fine-Tuning on multispeaker multilingual noisy dataset

Hi, Thanks for making your work publicly available.

I want to ask a few things:

Can I get average quality audios from the noisy audio dataset?
Even in a single audio file (in my dataset) there is more than one speaker speaking more than one language (English + Hindi) in a noisy environment, so will it be able to produce audios even from this type of dataset.
Which one of your pre-trained models can I use for this task?

Any help or suggestions is highly appreciated. Thank you!

jik876 / hifi-gan