Availability of using other BERT alternatives for current langauges

gordon0414 commented 1 week ago

Hello, I was wondering if we can replace the BERT for current supported models

    Languages.JP: BASE_DIR / "bert" / "deberta-v2-large-japanese-char-wwm",
    Languages.EN: BASE_DIR / "bert" / "deberta-v3-large",
    Languages.ZH: BASE_DIR / "bert" / "chinese-roberta-wwm-ext-large",

Since the current models are mostly large variants of the models, but I think smaller BERT models will still be capable for our use case.

I mean by smaller variants such as

English: microsoft/deberta-v3-small
Japanese: ku-nlp/deberta-v2-base-japanese-char-wwm

I suppose these variants won't cause any issues since the tokenizers are identical to the original ones

My hypothesis is that if we fine-tune our model with the new BERT models, it will seamlessly incorporate with the current system.

litagin02 commented 1 week ago

I haven't checked, but maybe as you guess we can, and you can try it.

gordon0414 commented 1 week ago

okay, maybe I should try it this weekend!

litagin02 / Style-Bert-VITS2

Availability of using other BERT alternatives for current langauges #145