t5-model Search Results

1000+ results
for t5-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ELS-RD/transformer-deploy #143

OpusMT conversion

Hi, Are there any plans to allow conversion of seq2seq translation model as opus-mt (MarianMT model type)?

Matthieu-Tinycoaching updated 1 year ago
1
Pan-ML/panml #69

Models for text classification & summarisation, and running …

Hi, Many thanks for releasing this library for using LLMs! We just have two quick questions about models in this library. 1. Could we know if there are any models that are good for text classif…

Dee-Ma updated 1 year ago
1
lm-sys/FastChat #2558

Fine-tune fastchat-t5-3b-v1.0 with Lora, learning_rate is al…

Hi，when I use the script to fine-tune fastchat-t5-3b-v1.0 with Lora: ``` CUDA_VISIBLE_DEVICES=3 python fastchat/train/train_lora_t5.py \ --model_name_or_path /fastchat-t5-3b-v1.0 \ --lora…

BeerTai updated 9 months ago
1
kohya-ss/sd-scripts #1419

Given groups=1, weight of size [1536, 16, 2, 2], expected in…

Loading settings from /content/fine_tune/config/config_file.toml... /content/fine_tune/config/config_file You are using the default legacy behaviour of the . This is expected, and simply means that …

hieusttruyen updated 2 months ago
1
pytorch/benchmark #1901

Torchbench models that don't run in dynamo runners

There's small nuances in how the dynamo runners benchmark models that can make certain torchbench models fail Some models might be explicitly skipped, others might fail because of some dtype conve…

msaroufim updated 12 months ago
1
litanlitudan/skyagi #52

how to use open-source models?

Please add examples using local open-source models, like llama or chatGLM. Thanks

basicmi updated 1 year ago
8
wp931120/LongChainKBQA #4

换了模型，运行失败

把模型换成了llama，显示没有chat（）函数。就是在llm.py文件中，有一段代码： response, _ = self.model.chat( self.tokenizer, prompt, history=self.history, max_length=self.max_token,…

cookie925 updated 8 months ago
1
EleutherAI/lm-evaluation-harness #2094

Having issues with MMLU benchmark

Hi, I am trying to run some LLMs (currently trying openai models) on MMLU. My first question is which configuration is the standard setup (5 shot without CoT)? What does flan mean in some of the c…

berkatil updated 1 month ago
9
ELS-RD/transformer-deploy #123

Question about generative model notebook

https://github.com/ELS-RD/transformer-deploy/blob/main/demo/generative-model/gpt2.ipynb In this notebook, when you tested cache feature, I think you should use `generate` function rather than `forw…

hyunwoongko updated 2 years ago
1
EleutherAI/megatron-3d #9

add T5 positional encoding

StellaAthena updated 3 years ago
5

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for t5-model

1000+ results
for t5-model