-
Hello, I get the error in the title when finetuning Phi3.5.
I believe I'm on the latest unsloth (installed from git wit pip).
Context: finetuning Phi3.5 with code that already works with other u…
beniz updated
2 weeks ago
-
This issue tracks op coverage requests for nested tensors. If you'd like a specific op to be implemented for nested tensors, please add a comment here so we can prioritize effectively.
Prior reques…
-
### Issue Description
./webui.sh --debug --ckpt /workspace/mnt/storage/xiangxin@supremind.com/infer_tensor/stable-diffusion-2-1 --listen --port 8000
### Version Platform Description
_…
-
I tried unit testing a single block and it seems like Hydra is at least 4x slower. Is that expected?
-
## 🚀 Feature
TorchStore is a key-value store that holds ATen tensors in shared memory so that they can be accessed across process boundaries without any expensive copy operations.
## Problem
…
-
### News
- HyperCLOVA X 공개 (8.24)
- 네이버클라우드 소개페이지: https://www.ncloud.com/solution/featured/hyperclovax
- DAN23 영상 다시보기: https://tv.naver.com/v/39568301
- [ChatGPT-3.5 Tuning and Enterprise](h…
-
Hello,
Thank you for such a great job and for releasing your code.
I want to train your network using a custom dataset. When I looked at the options.py file, the batch_size parameter is set to 1 a…
-
Hi,
I am working with the example [code](https://github.com/laszukdawid/ai-traineree/blob/master/examples/petting_zoo/connect_four.py) for the training of multi-agent env. However, when I create ea…
-
Currently, there is no data type in MCore for representing a byte. While it might be fine for some applications to only require String based I/O, the absence of binary I/O heavily impacts performance …
-
If I understand the state of research and theory, any "Schrödinger method" QC simulator definitely uses the Hamiltonian mechanics approach. Further, I think "tensor networks," which we might have some…