-
Hello, this is the first time I have asked a question on github. If you offend you, please forgive me.Why my rmse is more than 40 .I hope you have time to answer this question, thank you
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Current Behavior
单机微调int4的chatglm模型,在模型加载时出现错误,提示信息:Only Tensors of floating point and complex dtype can requi…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
I've already attempted solutions like reinstalling python…
-
I'm trying to trian a model using the CACSegmentor. I copied the model section from another config file and I implemented my custom dataset. The training works with the default segmentor but with the …
-
Building on the amazing work by @mzbac and @nkasmanoff in https://github.com/ml-explore/mlx-examples/pull/461, I'd really love an example of how LLaVA 1.6 (aka llava next) can be fine-tuned with a LoR…
-
# 🌟 FAVOR+ / Performer attention addition
Are there any plans to add this new attention approximation block to Transformers library?
## Model description
The new attention mechanism with linear…
-
буду хранить тут дамп статей про трансформеры, которые читаю, либо которые хочу прочитать
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale - статья где предложили ViT, иде…
-
### Checklist
- [X] The issue exists after disabling all extensions
- [X] The issue exists on a clean installation of webui
- [ ] The issue is caused by an extension, but I believe it is caused by a …
-
### System Info
The `load_in_4bit` and `load_in_8bit` arguments are deprecated and will be removed in the future versions. Please, pass a `BitsAndBytesConfig` object in `quantization_config` argument…
-
https://github.com/pytorch/vision/actions/runs/5941974400/job/16117254380
Failures start with 9c4f7389d0db7cfe7e8591ea920459673344aaa8, which is the first commit that used yesterdays (20230822) PyT…