Closed vigosser closed 3 years ago
you should check your config.json file. the error information suggest that your vocabulary size is 21128, however, your some inputs("token_id") in "inputs_ids" exceed the vocabulary size.
and Tinybert that we released now, is trained only in english corpus.
The ERROR happened during task-specific distill, Traceback is in the END. Fine-turn Bert model was generated using transformer package using the bert-base-chinese model, which included in the transformer package.
Is that because the release of TinyBERT's model trained using corpus without Chinese?
Fine-turn command using transformer as follow:
Traceback