-
In Keras NLP **t5 model** the architecture and weights are present, but **HL workflows are missing**
I would like to contribute a **high-level masked language modeling** workflow in the [t5 model](…
-
Hello,
Thanks for sharing you experience on this subject.
I've got an issue running insurance_qa_eval.py. Here's the full stacktrace :
`Traceback (most recent call last):
File "insurance_qa_eval…
-
Train on 16686 samples, validate on 1854 samples
Epoch 1/1
16686/16686 [==============================] - 1s - loss: 0.0060 - val_loss: 0.0340Fitting epoch 2000
2016-10-28 07:51:05 -- Epoch 1999 Loss…
-
```
json_string = tweet_model.to_json()
open(r'models\tweet_model_architecture.json', 'w', encoding = 'utf-8').write(json_string)
tweet_model.save_weights(r'models\tweet_model_weights.h5',overwrite = …
-
Hi, I'm running BIG-Bench Lite tasks, succeeded in running 22 of 24 tasks, and for the two tasks language_identification and logic_grid_puzzle I get segmentation fault.
I am using the main branch of …
-
**Is your feature request related to a problem? Please describe.**
In the [BERT data creation script](https://github.com/keras-team/keras-nlp/blob/master/examples/bert_pretraining/bert_create_pretr…
-
-
We need to convert keras.io examples to work with Keras 3.
This involves two stages:
## Stage 1: tf.keras backwards compatibility check
Keras 3 is intended as a drop-in replacement for tf.ker…
-
-
While looking around, I found [this paper on the ELECTRA model](https://arxiv.org/abs/2003.10555), it shows that replacing MLM with RTD gave them better GLUE scores than BERT. Might be worth taking no…