-
Hello Unsloth Team,
I am trying to finetune the **dwb2023/phi-3-vision-128k-instruct-quantized** model using Unsloth, but I encountered a NotImplementedError. The error message indicates that this …
-
## Describe the bug
running this in terminal : "./mistralrs-server --isq Q4K -i plain -m microsoft/Phi-3.5-MoE-instruct -a phi3.5moe"
I am unable to run this because out of memory it gets over 64 …
-
### Your current environment
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
OS: Ubuntu 22.04.1 LTS (x86_64)
GCC version: (U…
-
**Is your feature request related to a problem? Please describe.**
Can I use fitch.sty by Johan Klüwer in MathJax?
https://www.actual.world/resources/tex/sty/kluwer/edited/fitch.sty
**Describe th…
-
Take the best bits of https://github.com/ExCALIBUR-NEPTUNE/nektar-diffusion-ambipolar for the oblique BCs and cylindrical coordinate systems and https://github.com/ExCALIBUR-NEPTUNE/nektar-driftwave f…
-
### **Background:**
TT-Buda, developed by Tenstorrent, is a growing collection of model demos showcasing the capabilities of AI models running on Tenstorrent hardware. These demonstrations cover a wi…
-
## Describe the bug
```bash
cargo run --features metal --package mistralrs-server --bin mistralrs-server -- --token-source cache -i plain -m microsoft/Phi-3.5-mini-instruct -a phi3 --dtype bf16
``…
-
**Describe the bug**
After a model is generated running `big_model_fp8.py`, lm_eval dont not work unless the .py files from the original base model is transferred to the generated model folder. Happe…
-
https://huggingface.co/microsoft/Phi-3.5-MoE-instruct
https://huggingface.co/microsoft/Phi-3.5-mini-instruct
-
## Description
According to [Wikipedia the exponentially modified Gaussian](https://en.wikipedia.org/wiki/Exponentially_modified_Gaussian_distribution#cite_note-Kalambet2011-2) can be made more preci…