vistec-AI / thai2nmt

English-Thai Machine Translation Models
https://airesearch.in.th/releases/machine-translation-models/
Apache License 2.0
28 stars 6 forks source link

Hosting huggingface/datasets version on Internet Archive #3

Open cstorm125 opened 3 years ago

cstorm125 commented 3 years ago

Since git-lfs has limits, I'm considering changing the huggingface/datasets download link to: https://archive.org/details/scb_mt_enth_2020

chrisadas commented 1 year ago

Is this still an issue? If things area better now, may we have checkpoint/data on huggingface?

Ideally, I'd like to see this in a form ready to be load:

model_name = "..."
tokenizer = XLMTokenizer.from_pretrained(model_name)
model = XLMTokenizer.from_pretrained(model_name)
translated = model.generate(...)

Anything I can help with?