stefan-it / turkish-bert

Turkish BERT/DistilBERT, ELECTRA and ConvBERT models
494 stars 42 forks source link

bert uncased tf checkpoints #29

Closed hazalturkmen closed 7 months ago

hazalturkmen commented 2 years ago

Hi! @stefan-it i need to bert-base-32k-uncased tf chekpoints for further pre-training on Cloud TPU. I found cased version from this link

wget https://schweter.eu/cloud/bert-base-turkish-cased/bert-base-turkish-cased-tf.tar.gz

is it possible to get 32k uncased version of Turkish Bert model?

Thanks for all sharing,

stefan-it commented 2 years ago

Hi @hazalturkmen ,

in this archive you can find the last 5 checkpoints from the uncased model (I think I've chosen the 2M checkpoint for the final model):

wget wget https://schweter.eu/cloud/bert-base-turkish-uncased/bert-base-turkish-uncased.tar.gz

Hope this helps :)

hazalturkmen commented 2 years ago

thanks for sharing model Stefan! this is what i was looking for :)

how long did it take to train model from scratch? do you remember? its okay if you don't :)

stefan-it commented 2 years ago

Hey @hazalturkmen ,

sure, here you can the TensorBoard for the complete training:

8.2, 21:40 to 15.2, 08:25. So the training took ~6 days and 13,5 hours for 2M steps on a v3-8 TPU :hugs:

hazalturkmen commented 2 years ago

Hi @stefan-it ,

Finally, can I learn the tensorboard configuration for google cloud tpu? i have this error and i am using firefox browser

Couldn't connect to a server on port 8080

Thank you so much :)

hazalturkmen commented 7 months ago

Hi @stefan-it , I want to dowload Turkish BERT uncased tf checkpoints from previous mention codes: wget wget https://schweter.eu/cloud/bert-base-turkish-uncased/bert-base-turkish-uncased.tar.gz

but I get an error in downloading. I would be very grateful if you help me :) error:

ERROR: cannot verify schweter.eu's certificate, issued by ‘CN=R3,O=Let's Encrypt,C=US’: Issued certificate has expired. To connect to schweter.eu insecurely, use--no-check-certificate'.`

julien-c commented 7 months ago

@hazalturkmen maybe from here: https://huggingface.co/dbmdz/bert-base-turkish-uncased

stefan-it commented 7 months ago

Hi @hazalturkmen ,

I've finally uploaded the original checkpoints to Model Hub.

For the uncased model, they are are prefixed with model.ckpt-*, and can be found here.

I hope this helps :)

hazalturkmen commented 7 months ago

Thanks! @stefan-it