transformer-models Search Results

1000+ results
for transformer-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ludwig-ai/ludwig #3876

repetition_penalty bugged out

**Describe the bug** When defining a value for `repetition_penalty` & fine-tuning the model, predictions fail with the following error: ``` Prediction: 0%| …

rlleshi updated 1 month ago
2
runpod-workers/worker-vllm #89

ValueError: rope_scaling must be a dictionary with two field…

``` Traceback (most recent call last): 2024-08-01T21:29:17.880522621Z File "/src/handler.py", line 6, in 2024-08-01T21:29:17.880527641Z vllm_engine = vLLMEngine() 2024-08-01T21:29:17.880533…

omar93939 updated 3 months ago
2
huggingface/transformers #34674

Vision Encoder-Decoder fails with LLaMA decoder due to missi…

### System Info - `transformers` version: 4.46.2 - Platform: Linux-6.1.85+-x86_64-with-glibc2.35 - Python version: 3.10.12 - Huggingface_hub version: 0.24.7 - Safetensors version: 0.4.5 - Accele…

amazingvince updated 1 week ago
1
mlflow/mlflow #13327

Deprecate `generate_signature_output` in favor of input_exam…

### Summary The [mlflow.transformers.generate_signature_output](https://mlflow.org/docs/latest/python_api/mlflow.transformers.html#mlflow.transformers.generate_signature_output) function is an utilit…

B-Step62 updated 1 week ago
4
leejet/stable-diffusion.cpp #352

Support providing diffusion models and text encoders separat…

With very large open models like SD3 medium and Flux.1 gaining popularity It's becoming comon to provide the diffusion model (unet/diffusion transformer) part of the model and the text encoders separa…

stduhpf updated 3 months ago
1
InftyAI/llmaz #119

Loading model weights more efficiently

**What would you like to be added**: Right now we can download model weights from model hub directly, but each time we start/restart a pod, it will downloading the model weights again. Without …

kerthcet updated 1 month ago
6
open-compass/opencompass #1557

[Bug] HuggingFacewithChatTemplate

### Prerequisite - [X] I have searched [Issues](https://github.com/open-compass/opencompass/issues/) and [Discussions](https://github.com/open-compass/opencompass/discussions) but cannot get the expe…

ZCzzzzzz updated 1 month ago
2
HVision-NKU/StoryDiffusion #164

cannot import name 'split_torch_state_dict_into_shards' from…

Hello, i am getting this error constantly when trying to run the first code block in jupyter notebook or the gradio interface. I tried upgrading the packages separately, downgrading and installing a …

Jukeman9 updated 2 months ago
1
xiuqhou/Salience-DETR #34

pos_embed算子输入的shape不匹配

### Question 跑训练过程遇到 pos_embed算子输入的shape不匹配，请教大概是什么原因呢？我是pt2.3，其他以来版本是requirments.txt中内容 [rank5]: File "/torch/venv3/pytorch/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1541, …

CerrieJ updated 3 months ago
6
huggingface/transformers #33130

Is it possible to add L1/L2 regularization using the trainer…

### Feature request I want to add L1/L2 regularization to the transformer training. ### Motivation Adding L1/L2 reg can promote sparser models that can accelerate inference and reduce storage. ###…

mayank64ce updated 2 months ago
3

上一页 1...88 89 90 91 92 93 94...100 下一页

1000+ results for transformer-models

1000+ results
for transformer-models