mt5-models Search Results

426 results
for mt5-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #10512

Dynamic batch size for Seq2SeqTrainer

# 🚀 Feature request In Fairseq it is possible to forego setting a constant batch-size in favor of a dynamic batch size with --max_tokens. This ensures that a batch always consists of at max N=max_t…

clang88 updated 2 years ago
5
bigscience-workshop/data_tooling #43

using different models under bertin

Can we try to train different models like gpt2 or t5/mt5 in the bertin pipeline? gpt2 is more of a priority.

huu4ontocord updated 2 years ago
1
martiansideofthemoon/style-transfer-paraphrase #41

Style transfer on french text

Hi, Would the pytorch model file located in the multilingual folder work for applying the algorithm on french ? Thank you.

ghost updated 2 years ago
3
huggingface/transformers #13839

TF mT5 model is not adding new tokens into it's vocabulary.

## Environment info - `transformers` version: 4.11.2 - Platform: google colab - Python version: 3.7.12 - PyTorch version (GPU?): - Tensorflow version (GPU?): 2.6.0 - Using GPU in script?: Ye…

laddhakrishna updated 2 years ago
8
huggingface/transformers #14822

fp16 flag silently fails

## Environment info - `transformers` version: 4.15.0.dev0 - Platform: Linux-5.10.68+-x86_64-with-debian-bullseye-sid - Python version: 3.7.12 - PyTorch version (GPU?): 1.9.1 (True) - Tensorfl…

rumeshmadhusanka updated 2 years ago
2
huggingface/transformers #14304

transformers_4.13.devo giving error during saving model

## Environment info - `transformers` version: 4.13 - Platform: linux - Python version: 1.80 - PyTorch version (GPU?): gpu @patil-suraj Model :mt5-base input : python run_summarizatio…

Aniruddha-JU updated 2 years ago
1
onnx/onnx #3940

AutoModel throws ValueError: Unrecognized model in ./MRPC/. …

Based on [SO post](https://stackoverflow.com/q/70697470/17840900). Goal: Amend [Bert-GLUE_OnnxRuntime_quantization.ipynb][1] to work with **Albert** and **Distilbert** models Kernel: `conda_pyto…

danielbellhv updated 2 years ago
1
boostcampaitech2/mrc-level2-nlp-14 #9

[Dev] Solution/Models 생성

## Reader Models Baseline ### **Extractive Models** - `__ init __.py` - `modeling_bart.py` - `modeling_bert.py` ##### ※ Base : `AutoModelForQuestionAnswering` ### **Generative Mode…

Amber-Chaeeunk updated 3 years ago
1
huggingface/transformers #12867

Possible bug in spm-based tokenizers

## Environment info - `transformers` version: latest (4.10.0.dev0) - Python version: 3.8 - PyTorch version (GPU?): 1.9.0 - Using GPU in script?: no - Using distributed or parallel set-up in s…

Mehrad0711 updated 2 years ago
9
IDEA-CCNL/Fengshenbang-LM #156

Wenzhong2.0-GPT2-3.5B-chinese微调后生成乱码

你好，非常感谢封神榜之前在 #111 和 #123 提供的帮助，我们现在已经成功完成一个对Wenzhong2.0-GPT2-3.5B-chinese模型的领域微调，但是微调后模型生成的是乱码。我注意到 #89 也遇到了类似的问题，但似乎最后并没有解决，可否再麻烦大佬帮忙看一下。

koking0 updated 2 years ago
18

上一页 1...32 33 34 35 36 37 38...43 下一页

426 results for mt5-models

426 results
for mt5-models