Closed HMM2021 closed 3 years ago
You can create another tokenizer and embedding matrix for your own datasets.
Thank you for your answer. However, when i create another tokenizer and embedding matrix i have the error below:
RuntimeError: Error(s) in loading state_dict for AttentionRNN: size mismatch for emb.emb.weight: copying a param with shape torch.Size([697040, 300]) from checkpoint, the shape in current model is torch.Size([81651, 300]).
this is the whole error:
RuntimeError Traceback (most recent call last)
I think maybe I misunderstanded your question. The tokenizer and embedding matrix should be consistent for training and prediction. You need to retrain the model with your own tokenizer and embedding matrix.
Ok i got it Thank you so much
Hi,
I have a question about using AttentionXML model in production. Do we have to use the same tokenizer used for training in POC or we can create another tokenizer and embedding matrix in production?
Thank you in advance