Open CCpt5 opened 10 months ago
Are there any available/downloadable trained checkpoints?
Here's one to test with:
OT trained model export that reports "SDOne": https://drive.google.com/file/d/1OLHk8JhS9cRqGX_UcsPvosuAWZ6TEY22/view?usp=drive_link
Oddly I'm not getting the popup warning this morning, so it's probably an order of operations thing where model mixer will reject the 2nd model as not SDXL (or SDNone as it seems the OT trained models get listed).
The screenshots are from the A1111 extension "Model Toolkit" which also finds the checkpoints aren't standard SDXL. Not 100% sure why this is occurring, but they checkpoints themselves do work fine (and mix fine if allowed), so being able to use them w/o possibility of rejection would be good in my case.
Screenshots from "Model Toolkit" checking this model that I'd taken a few days ago and posted in the OT discord: ![header1](https://github.com/wkpark/sd-webui-model-mixer/assets/37972078/e6bdfc0f-49ea-4522-9b3a-46d5840ed2a4) ![2-header](https://github.com/wkpark/sd-webui-model-mixer/assets/37972078/6eaa9954-72c8-45e4-bd2f-71c0226f7b0d)
Here's one to test with:
OT trained model export that reports "SDOne": https://drive.google.com/file/d/1OLHk8JhS9cRqGX_UcsPvosuAWZ6TEY22/view?usp=drive_link
Oddly I'm not getting the popup warning this morning, so it's probably an order of operations thing where model mixer will reject the 2nd model as not SDXL (or SDNone as it seems the OT trained models get listed).
The screenshots are from the A1111 extension "Model Toolkit" which also finds the checkpoints aren't standard SDXL. Not 100% sure why this is occurring, but they checkpoints themselves do work fine (and mix fine if allowed), so being able to use them w/o possibility of rejection would be good in my case.
The uploaded model is a valid SDXL checkpoint. (model-toolkit does not detect SDXL correctly, without VAE
or position_ids
cases)
and for me, An OT checkpoint works as expected... so strange.
I guess, there are some bug exist to prevent a SDXL being detected correctly.
satetensors
: 0.4.1.dev0
(you can force to use latest safetensors
with the following fix.diff --git a/requirements_versions.txt b/requirements_versions.txt
index 2a922f288..a574c7ca7 100644
--- a/requirements_versions.txt
+++ b/requirements_versions.txt
@@ -19,7 +19,7 @@ piexif==1.1.3
psutil==5.9.5
pytorch_lightning==1.9.4
resize-right==0.0.2
-safetensors==0.3.1
+safetensors
scikit-image==0.21.0
spandrel==0.1.6
tomesd==0.1.3
after this fix applied, you can update safetensors
using pip install safetensors -U
command.
currently, there are 3-ways to detect SDXL. check "conditioner.embedders.1.model.transformer.resblocks.9.mlp.c_proj.weight"
tensor by
safetensors
for .safetensors
filespickle
module for old .ckpt
files"conditioner.embedders.1.model.transformer.resblocks.9.mlp.c_proj.weight"
for an already loaded models.
Hey,
I think this is a issue/quirk on the OneTrainer side, but due to some difference in the way the header(maybe?) is formatted some applications read SDXL models trained w/ "One Trainer" as broken. I asked in their Discord why models trained w/ Onetrainer might be reported incorrectly, and apparently it's a known thing but it seemed the consensus was Kohya's training script just does something differently that some applications look-for/expect.
The models I've trained w/ OneTrainer (which gained a lot of popularity recently due to a nice GUI and very active developer) have never failed to work anywhere for inferencing/training(w/ Kohya)/or even merging when it's been allowed w/o the warning/rejection. On the other hand the A1111's extension "Model Toolkit" is one other place I've found OT trained models list as corrupt/invalid.....so I don't know what the difference is.
Regardless, based on this can you allow for an over-ride if a model is reported as "SDNone"? I honestly think I've gotten it to accept my OT models anyway, but I can't remember how I did it. I don't know if I had to use a OneTrainer model as the 1st one, and then could use "standard"/kohya models in B,C,etc......or perhaps it was just an older version.
An override would allow people to still use an OT model until maybe whatever the difference is can be sorted out. It's not an SD 1.5 vs. SDXL type issue - it's something minor w/ the header/file-info (faik). I could upload a model that's been exported w/ OT if that'd be helpful....but they are 6gb obv....
https://github.com/Nerogar/OneTrainer