mt5 Search Results - Githubissues

1000+ results
for mt5

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #15500

Allow training from multiple languages for multilingual seq2…

# 🚀 Feature request Allow mBART and M2M100 to be easily fine-tuned with multiple target languages in the fine-tuning data set, probably by allowing forced_bos_token_id to be provided in the trainin…

nfortescue updated 1 year ago
5
huggingface/trl #606

DPO training model and model ref on different GPUs causing e…

I am working on 8 V100 16 GB GPUs and I am trying to train a 3.7B parameter mt5-xl model on the DPO training. I managed to load both model and model ref as separate instances of mT5-xl in 8 bit. How…

testerpce updated 1 year ago
3
shibing624/pycorrector #418

怎么训练多语种模型？简繁体中文日文韩文英文等

怎么训练多语种模型？简繁体中文日文韩文英文等

nissansz updated 1 year ago
5
sergiovision/FinCore #11

[Question] MT5 Source files

Cool project! Wondering whether this should include MQL files for MT5? Thanks

gedeh updated 1 year ago
2
RUCKBReasoning/RESDSQL #43

Dataset used for finetuning mt5 model

Hi First of all, thank you for your great work on this project. You've reached among best results on Spider benchmark and your clear and complete readme file allowed me to run your code very easily.…

SepehrAminiAfshar updated 1 year ago
3
microsoft/DeepSpeed #889

Error building extension 'cpu_adam'

Hey guys, I'm having a problem getting DeepSpeed working with XLM-Roberta. I'm trying to run it on an Amazon Linux machine, which is based on Red Hat. Here are a some versions of packages/dependencies…

arthur-morgan-712 updated 4 months ago
30
Expensify/App #26911

[HOLD for payment 2023-09-20] [$500] Web - Inconsistency - …

If you haven’t already, check out our [contributing guidelines](https://github.com/Expensify/ReactNativeChat/blob/main/contributingGuides/CONTRIBUTING.md) for onboarding and email contributors@expensi…

kbecciv updated 1 year ago
53
EricFillion/happy-transformer #277

Too much time to eval&train

I am trying to fine tune the mt5 model for grammatical error correction task using happy-transformers, and following the provided tutorial. However, it takes too much time ( approximately 12 hours !) …

saloyiana updated 1 year ago
2
OpenThaiGPT/openthaigpt-pretraining #46

EDA + Find how to clean OSCAR

**Source:** https://oscar-project.github.io/documentation/versions/oscar-2019/ **Description:** EDA + Clean Thai language part of OSCAR 2019. **Note:** You might want to sample only a few amounts of…

Chawak updated 1 year ago
1
OpenThaiGPT/openthaigpt-pretraining #44

EDA + Find how to clean mC4

**Source:** https://huggingface.co/datasets/mc4/ **Description:** Clean Thai language part of mC4. - Gamble website ![Image](https://user-images.githubusercontent.com/56959186/227706424-53c5e556-7…

Chawak updated 1 year ago
7

上一页 1...72 73 74 75 76 77 78...100 下一页

1000+ results for mt5

1000+ results
for mt5