-
Thanks for this wonderful notebook. Are there any future plans to update and fix the attention model. It's just that there aren't any good attention model implementation in keras out there.
-
**Describe the bug**
I tried to train Stanza for Tamil, and mwt training always (tried with different data set) breaks at 33rd epoch.
Log:
2020-08-20 09:24:11: step 360/1100 (epoch 33/100), loss = …
-
Hi,
When I am trying validation for small questions in the GECOR model and I am getting repeated words in the final output. I will provide couple of examples:
1:
user : What is the price of pi…
-
Please go to Stack Overflow for help and support:
http://stackoverflow.com/questions/tagged/tensorflow
If you open a GitHub issue, here is our policy:
1. It must be a bug or a feature request…
-
# ❓ Questions & Help
## Details
-
Trying master on another machine I get:
```
Traceback (most recent call last):
File "examples/seq2seq/run_seq2seq.py", line 650, in
main()
File "examples/seq2seq/run_seq2seq.py", line 590,…
-
## Environment info
- `transformers` version: 3.3.1
- Platform: Linux-5.4.38-t2.el7.x86_64-x86_64-with-centos-7.7.1908-Core
- Python version: 3.7.9
- PyTorch version (GPU?): 1.4.0+cu100 (…
-
## Environment info
- `transformers` version: 4.5.0.dev0
- Platform: Linux-4.14.225-121.362.amzn1.x86_64-x86_64-with-glibc2.9
- Python version: 3.6.13
- PyTorch version (GPU?): 1.8.1+cu102 (True)
…
-
- `transformers` version: 3.0.0
- Platform: windows
- Python version: 3.6.10 :: Anaconda, Inc.
- PyTorch version (GPU?): 1.7.0+cu101
- Tensorflow version (GPU?):
- Using GPU in script?:
-…
-
## Environment info
- `transformers` version: 4.3.3
- Platform: Linux-4.15.0-109-generic-x86_64-with-debian-buster-sid
- Python version: 3.6.13
- PyTorch version (GPU?): 1.7.1 (True)
- Tensor…