Open Chrakimnas6 opened 3 years ago
XLNet and (likely GPT2) currently don't work, as they use a different padding strategy, which is currently not supported within the batching strategy that is used here.
In the upcoming version 0.4.1, tokenization and padding will change and it is likely that XLNet will work (and GPT2 maybe also, never used GPT2).
However, I did test with XLNet and it was not producing any good results. In all my experiments it performed so far quite badly.
Thank you for your reply!
Hi,
Currently I'm using training_nli.py directly and try to test different pretrained models from huggingface. Some models are fine but I met two problems with xlnet and gpt-2.
First, I used 'xlnet-base-cased' and when I train the model it says:
As a result, it only generates 'similarity_evaluation_sts-dev_results.csv' in the output file and all the values are 0 in the csv.
Second, I also used 'gpt2' and it gives me:
Not really sure how to fix them, would be really appreciated if someone could help me out. Thanks.