Closed dyhan316 closed 5 months ago
Hey,
Imo this repo is a nice starting point as it has some basic training functionality implemented and exposed. Things you need to do are changing your model and the dataset, but this should be fairly easy. On the other hand, if you were to take the copying from HF route then it would be more work as model implementations from HF are really huge as they implement a lot of extra functionalities that you don't need. In this repo you have roughly minimal implementation of a T5 model which makes it a better starting point imo.
Hello,
It is my first time using transformers, I wanted to ask a few questions on how I can implement a custom transformer fast with minimal effort.
Do you think I should use your code as a starting point to create a custom model that :
Or should I just copy the T5 model code from hugging face and try to customize it from there using PyTorch? (I am already familiar with PyTorch (but not transformers or hugging face or etc, as it I only used CNNs )
Any advice would be greatly appreciated :)