-
### System Info
- `transformers` version: 4.46.0
- Platform: Linux-6.1.85+-x86_64-with-glibc2.35
- Python version: 3.10.12
- Huggingface_hub version: 0.24.7
- Safetensors version: 0.4.5
- Acce…
-
`# Developed by Aamir Mirza
# create a conda virtual environment python 3.9
# install PyTorch 1.13.1 ( not 2.0)
# conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=1…
-
## 🐛 Bug
## To Reproduce
Steps to reproduce the behavior:
(mlc-chat-venv) hhg@dell:~/mlc-llm$ mlc_llm convert_weight ./dist/models/music-4-rwkv-converted/ --quantization q4f16_1 --source-…
-
Thanks for the code,
Is there any tweak we can do to fit it in Google Colab (~15 GB Memory)
or should I just settle with QLoRA?
-
-
Hello, @YTianZHU . I read the Differential Transformer paper and found it very interesting.
Thank you so much for your work.
I was wondering how you visualized the attention scores in Figure 1:
![Ima…
-
# Issues Discription #
I tried to use the following command to run all model level tests:
```
python run.py -j 16 --report --cachedir cached -v --testsfile models.txt \
--torchmlirbuild /torch…
-
Open this issue for tracking the progress of models supported in candle-vllm.
-
### Summary
- Provide k-quant models
- Maintain existing gguf models
- Embedding models
- [x] [second-state/Nomic-embed-text-v1.5-Embedding-GGUF](https://huggingface.co/second-state/Nomic-…
-
### Describe the bug
Short story: One of the latest `intel-oneapi-compiler-shared-opencl-cpu.icd` versions causes every call to `clGetPlatformIDs` to hang indefinitely. This is probably not your is…