Closed Luvata closed 4 years ago
I tested RobertaModel
also failed in function extract_features_aligned_to_words
My current version is fairseq 0.8.0
Both model was loaded from_pretrained
with a '.gz' file download in README
It's not supported. XLMR uses sentencepiece BPE whereas RoBERTa uses the GPT-2 BPE. Unfortunately the extract_features_aligned_to_words
doesn't have support for sentencepiece BPE yet.
cc @ngoyal2707
@myleott Actually https://github.com/fairinternal/fairseq-py/commit/e8c0196e4927f77e980e4a15375bc6872066fb42#diff-c3ae106584251b0d35cc504bc481482e commit seems to have added stripping of bos
token in string()
call of dictionary.py
. So kinda broken for both roberta
and xlm-r
.
Will send out a fix
Fix is merged to master
I have
AssertionError
error when runningextract_features_aligned_to_words
fromXLMRModel
. Is this a bug or there's difference betweenRoBERTa
andXLMRModel
?Here is my code:
And Error: