-
I have finetuned the model for 10 epochs. But it seems like the loss do not converge.
![image](https://github.com/amirhnazerii/Transformer_OD_TPU/assets/102167812/8be5736e-ba49-4252-9ae2-c3e5dc5d8d…
-
### Describe the issue
**Issue:**
I ran into tokenization mismatch errors when I tried to fine-tune from Llama-3.1. I pre-trained a new MLP adapter for Llama-3.1 and that seems to work, but the fine…
-
Hey guys,
I'm pretty new here just trying to figure all this out.
Finally managed to get my first finetuning running. But I'm kinda confused.
I'm using the thomas - medium model (german) for fine…
-
I was trying to finetune a model as mentioned in the docs, but after training, when I am trying to load the model, I get the following error:
```
[Error(s) in loading state_dict for SubwordBert:
s…
-
Good Job! Do you have a plan to support LoRA or other PEFT?
-
Hi,
Thanks for the great work. Is it possible to release the data (and codes) used to fine-tune the language model.
-
Hello,
I am trying to finetune the tapas_wtq_wikisql_sqa_masklm_medium_reset.
Just to see, if it works in general, I wanted to finetune it on the same data it's already trained on, WTQ. Creating …
-
First of all, great job on the project! I was wondering if there were any implemented functionality to finetune this model? Basically introducing another dataset to train on top of the already trained…
-
Should finetune all TDBN or just CRBM+LogReg?
At the moment (13 avril 2016), I try to put together all my models.
In other words, until now I train RBMs and use them to generate a new dataset (train,…
-
在finetune代码中,部分也加入了loss的计算,想请教下这样相比conditioning language modeling loss有什么特别的好处吗?