Closed WinniyGD closed 3 years ago
These were for ablation studies. But I find some of them are not significant enough.
OK, could you share the 33237 vocabulary and it's dataset ?
You can download the idioms_pretrain.json.
Thx ! It was really helpful. 👍
Sry, I wanna know the relation between embedding index and idioms id. It means the 0-3847(index) embedding is ChID idioms embedding ? The file 'idioms_pretrain.json' does not show the idiom id. How to get the one-to-one correspondence?
Oh!Amazing!Thank for your patience!
Sry, I have one more question about the codes.
What's the difference or purpose among the following model classes:
@register_model('chengyubert-2stage-stage2-mask') @register_model('chengyubert-2stage-stage2-cls') @register_model('chengyubert-2stage-stage2-window') @register_model('chengyubert-2stage-stage2-mask-window')