-
I installed PEFT from source.
And use the latest versions of Transformers and TRL.
I passed the XLoRA model to TRL but the training doesn't seem to work (training loss doesn't decrease and validatio…
-
import os
os.environ["TOKENIZERS_PARALLELISM"] = "false"
import mmfreelm
from transformers import AutoModelForCausalLM, AutoTokenizer
name = '/mnt/workspace/MMfreeLM-370M'
tokenizer = AutoTokeniz…
-
Has anyone tried downscaling the K and/or Q matrices for repeated layers in franken-merges? This should act like changing the temperature of the softmax and effectively smooth the distribution:
**H…
-
### 🚀 The feature, motivation and pitch
[Mamba](https://arxiv.org/pdf/2312.00752.pdf) is a new SSM (State Space Model) which is developed to address Transformers’ computational inefficiency on long…
-
Running 1 process
Loading Flux model
Loading transformer
Error running job: We couldn't connect to 'https://huggingface.co' to load this model, couldn't find it in the cached files and it looks li…
-
# Efficient Attention
## Reference
- [Efficient Attention](https://github.com/Separius/awesome-fast-attention)
- 2020-09 Efficient Transformers: A Survey [[Paper](https://arxiv.org/abs/2009.06732…
-
Hi,
i was focusing with the human posture task (getting posture from openpose image + prompt and than generating the charter under the right pose - control_sd15_openpose.pth)
However, i wanted to …
-
orkspace/code/MPS/inference2.py:22: FutureWarning: You are using `torch.load` with `weights_only=False` (the current default value), which uses the default pickle module implicitly. It is possible to …
-
Hi developer,
Thanks to develop the great tool to annotate the single cell,
I wander that this scGPT must be need GPU on centos7.9, and i hadn't the GPU, what about CPU to use this scG…
-
### System Info
- `transformers` version: 4.44.0
- Platform: macOS-14.6.1-arm64-arm-64bit
- Python version: 3.12.3
- Huggingface_hub version: 0.24.6
- Safetensors version: 0.4.4
- Accelerate v…