ylacombe / finetune-hf-vits

Finetune VITS and MMS using HuggingFace's tools
MIT License
123 stars 30 forks source link

You have to change the code a little as well. You have to comment out where speaker_id is being referenced in the code. I commented out 3 lines and it works fine for me without further modificaiton; #42

Open gangagyatso4364 opened 1 month ago

gangagyatso4364 commented 1 month ago
          You have to change the code a little as well. You have to comment out where speaker_id is being referenced in the code. I commented out 3 lines and it works fine for me without further modificaiton;

1- model_outputs = model( input_ids=batch["input_ids"], attention_mask=batch["attention_mask"], labels=batch["labels"], labels_attention_mask=batch["labels_attention_mask"],

speaker_id=batch["speaker_id"],

                return_dict=True,
                monotonic_alignment_function=maximum_path,
            )

2-model_outputs_train = model( input_ids=batch["input_ids"], attention_mask=batch["attention_mask"], labels=batch["labels"], labels_attention_mask=batch["labels_attention_mask"],

speaker_id=batch["speaker_id"],

                        return_dict=True,
                        monotonic_alignment_function=maximum_path,
                    )

3- model_outputs_train = model( input_ids=batch["input_ids"], attention_mask=batch["attention_mask"], labels=batch["labels"], labels_attention_mask=batch["labels_attention_mask"],

speaker_id=batch["speaker_id"],

                    return_dict=True,
                    monotonic_alignment_function=maximum_path,
                )

Originally posted by @muhammadsaadgondal in https://github.com/ylacombe/finetune-hf-vits/issues/35#issuecomment-2282717976

gangagyatso4364 commented 1 month ago

Can you share the config file training multiple speakers. ?

C0deXG commented 3 weeks ago

Can you share the config file training multiple speakers. ?

Just remove all the speaker_id or speaker mentioing in config file so remove all speaker mentioning in the config