Closed CognitiveDiffusion closed 8 months ago
wait, did you build or downloaded the release?
ohhhhhhh nvm
Its a debug line that I forgot to remove
I've uploaded a Hotfix version, can you try it out and reply back here? Thanks!
https://github.com/Particle1904/DatasetHelpers/releases/tag/v2.1.0
It worked now. Thanks.
My first test run with Joytagger didn't look better than the old taggers though (and the community is stepping away from pure danbooru tagging).
Did you consider integrating more modern Taggers like CogVLM?
Its definitively better than WD 1.4 but it is still limited.
I have considered other types of taggers and captioners but I don't have the knowledge to implement inference for them. Unfortunately its not as simple as loading the file and running a single line of code like in Python; I have to implement everything the model needs from pre-processing to post-processing, lets not even get started on tokenization; I cannot wrap my head around how to implement tokenization. It took me 3 hours to figure out that I need to pass the outputs of JoyTag through a Sigmoid function which is something that the WD 1.4 model already does by itself. I have to process the image in a format that the model will understand, and every single one of the 3 models currently supported needs a different format and I imagine this is also true for other types of models. The main problem for me is Tokenization, currently there's a single library I've found for C# that does tokenization for BERT models and that about it.
Thank you for that indepth explanation. It's much appreciated.
You welcome, I'll probably try again in the future cus I'm personally interested in having better tools.
Thanks a lot for the update!
I wanted to try Joytagger right away, but I get the following error: