-
So I was thinking about the following idea.
It is probably completely bogus, but I would definitely investigate it when and if I had the time to, so maybe someone else would be interested as well.
…
-
## Description
The control_wrtGauss_netcdf_parallel_intel fails on hercules (intel) with the following error:
```
2023-11-29 11:42:48.544357 +0000 ERROR /work/noaa/epic/role-epic/spack-stack/…
-
Hi , now i am trying to train SDXL with images of resolutin 768x768, i set batch size for 2 in env file so i have 16 images on folder.
I am using resolution type 'pixel' an resolution '768' both env …
-
The following ops are using `ir.FallbackKernel` via `make_fallback()` in [lowering.py](https://github.com/pytorch/torchdynamo/blob/main/torchinductor/lowering.py#L894) and appear in benchmarks. We sh…
-
## 🚀 Feature
---
In this doc we are requesting comments for the implementation of the sparsification flow as part of architecture optimization namespace (`torch.ao`)
with: @raghuramank100 @dskh…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### Reproduction
myvenv) ubuntu@b12420:~/LLaMA-Factory$ CUDA_VISIBLE_DEVICES=0 python src/train_bash.py --stage sft …
-
It might be useful if you get the model to work to write down the model (e.g. 7B) and the hardware you got it to run on. Then people can get an idea of what will be the minimum specs. I'd also be inte…
-
您好,我在用多卡RLHF训练时报错Tensors must be CUDA and denseRuntimeError,**用单卡训练在这一步是不会报错的**,但是单卡我的显存不够也跑不了,您知道是什么问题吗?
参数如下:
accelerate launch src/train_bash.py \
--stage ppo \
--model_name_or_path "…
-
### Describe the bug
Hello,
Thank you for this useful library.
I have a small problem, I managed to use the code to generate images using SDXL with t2i, then an Image with a Lora. But for some …
-
### What is the issue?
`main` when running a model (specifically `llama3:8b-instruct-fp16` will begin to generate gibberish. It will also leak state between sessions. Swapping out the models will…