-
I am trying to fine-tune XLNet model and it used work fine but I think huggingface update some classes and I ran through this error:
RuntimeError: Trying to create tensor with negative dimension -1…
-
Hi:
Thanks for this repo. Text generation is my main interest and I was wondering how the xlnet large model can be fine tuned with new text then used as a model in XLnet-gen using language_generati…
-
-
Hi,
How many parameters for Xlnet_large and Xlnet_base?
Thanks
Mahesh
-
Hi,
Setting Dropout in production results in random embeddings. This has been discussed in the main repo if you search. To reproduce this, just run the same string more than once, each time you'll s…
-
Thanks again for providing this repository and actively maintaining it. Do you have performance of XLNet and Longformer on the 2010 i2b2 test set, 2012 i2b2 test set, and/or 2018 n2c2 test set readily…
-
I finished Simple XLNet implementation with Pytorch Wrapper
You can see How XLNet Architecture work in pre-training with small batch size(=1) example.
Also I added comment in code, so you can learn …
-
When I run XLnet colab example
https://github.com/zihangdai/xlnet/blob/master/notebooks/colab_imdb_gpu.ipynb
It showed the following error:
=========================
Traceback (most recent…
-
It was projected to be released June 2019. I am waiting eagerly for it.
-
Building on https://github.com/keras-team/keras-nlp/blob/master/keras_nlp/layers/preprocessing/mlm_mask_generator.py which dynamically masks tokens, I was wondering if we can implement a layer for how…