-
I am experiencing a memory leak while running my application, which is to run an MMLU accuracy test on my Radeon 780M iGPU via DirectML.
Each inference adds tens-hundreds of megabytes to the total …
-
按照默认的工作流生成图片时,给出的文本只有中文
When generating images according to the default workflow, the text given is in Chinese only
如果将原提示词换为英文,会生成较长的无用的内容
If the original prompt is replaced with Eng…
-
![image](https://github.com/ZHO-ZHO-ZHO/ComfyUI-Phi-3-mini/assets/839168/7d034354-58a3-4ef2-92ca-eb6683353fe1)
-
Hi,
I am trying to train phi3 mini model with longer context length 8192 than its default length of 4096.
I understand that reope scaling is not supported for models with sliding window. How can I…
-
Just creating an issue, reported in discord.
Hi guys! When I am fine tuning `unsloth/Phi-3-mini-4k-instruct-bnb-4bit` on continued pretraining with korean language, I got endless output when I test i…
-
### Your current environment
The output of `python collect_env.py`
Collecting environment information...
PyTorch version: 2.4.0
Is debug build: False
CUDA used to build PyTorch: 12.4
ROCM …
-
### Describe the issue
I am running phi3-mini-int4 using the usual onnxruntime c# API and it is 2x as slow as when I use the [genai code](https://github.com/microsoft/onnxruntime-genai). I am using…
-
how i use this with ollama locally? i have this list ready to go
ollama list
NAME ID SIZE MODIFIED
hub/stewart/multi-agent:latest 8cc6e95685ac 3.8 GB 10…
-
### 🐛 Describe the bug
`_load_for_executorch` pybinding cannot load joint graph for phi-3-mini-lora because `load_into` is not implemented in `MmapLoader`, and `load_into` is used by `Program`'s `l…
-
### 软件环境
```Markdown
- paddlepaddle:2.5.1
- paddlepaddle-gpu: 2.5.1
- paddlenlp: develop
```
### 重复问题
- [X] I have searched the existing issues
### 错误描述
```Markdown
Traceback (most recent call…