-
Hello there. This is a really fantastic project. I'm trying to extend your work to Bart but I've run into some strange behaviour.
I've made a [Colab notebook](https://colab.research.google.com/dri…
-
track
-
_ToDo: determine phd focus and scope_
Phd Funding project: https://www.tudelft.nl/en/2020/tu-delft/eur33m-research-funding-to-establish-trust-in-the-internet-economy
Duration: 1 Sep 2023 - 1 sep 2…
-
Hello,
Thank you for this great package!
I would like to know on which datasets and how the two models that are used when running `OmniEvent.infer` were fine-tuned. That is, the 2 models which …
-
# 🚀 Feature request
This is a discussion issue for training/fine-tuning very large transformer models. Recently, model parallelism was added for gpt2 and t5. The current implementation is for PyTor…
-
Is this the correct JSON ? Could you point out the way to download the weights ?
I should also point out that I ported your PyTorch code to TensorFlow. Changes are minimal though.
`{
"module"…
-
### System Info
Transformers version: 4.36.2
pytorch version: 2.1.1
Python version: 3.10.13
### Who can help?
_No response_
### Information
- [ ] The official example scripts
- [X] …
-
Hi, I'm encountering the following error while following the usage of the command. Could anyone kindly help me resolve this issue? I would really appreciate any assistance. Thank you!
Below are the…
-
I'm fine-tuning an mT5 small checkpoint on a custom mixture. For debugging, I've set `eval_period=25` and `utils.SaveCheckpointConfig.period=50` in finetune.gin.
The training job freezes inside `trai…
-
If I try to use the distilled model with Sentence Transformers, I am getting the following error:
```
---------------------------------------------------------------------------
AttributeError …