x-CK-x / Dataset-Curation-Tool

A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well as S.O.T.A. diffusion and auto-tag/caption models for your purposes. Custom datasets can be added!
GNU General Public License v3.0
33 stars 7 forks source link

Feature List of EVERYTHING #36

Open x-CK-x opened 5 months ago

x-CK-x commented 5 months ago

A Feature List of everything being added in the short to long term future.

Suggestions Welcome

I will move things off this list into individual tickets to be worked on as time permits.:

- new models for tagging and/or captioning
- new augmentation options
- new image board support (anime & gelbooru)
- two new image tagging modes for both tag based models and caption based models
- a new UI for both modes
- a new layout
- a better way of outlining tags and words in different categories
- faster response times with tag/word suggestions
------------
- segmentation & object detection model/s integration
- a new feature to automatically curate data for the user according to a few questions the user has to answer to determine the heuristics of the model
- cogVLM support
- LLM support for optionally improving captions i.e. cogVLM models struggle with NSFW tagging, so it might be better to use tags resulting from a tagging model and have an LLM re-format the tags into a caption; and then as a second pass have the VLM act as validation to those captions. And/or use an LLM to also merge the LLM caption and the cogVLM caption together into 1 singular caption that takes certain semantics of both captions to make a better one.
- more user configurable options to caption their data
- new methods of pruning tags, again based on various heuristics from Q&A with the user/s and an LLM and/or VLM
- a new WIKI for the tutorials
- super resolution model & denoising model support to (remove) noise artifacts or adversarial attacks from images the user may want to train with; i.e. mitigating some of the effects of Nightshade & Glaze
---------
- new tag, image, & caption statistics and visualization tools for the user to use from a data scientist perspective on how to best choose their data and augment their data
- a custom trained vision classifier on images with [nightshade, glaze, both, nothing]. To allow the user to know which data has been poisoned by artists etc. and if they need to be de-noised / upscaled on to mitigate the effects to some extent
---------
- tagging/captioning model/s will be downloaded on the fly if the option on the dropdown menu is selected, instead of having to go to the download tab to grab it beforehand.

NEW Features Paused as of (09/05/2023) :: unless there are willing contributors to develop any of the other features.

New image board specific tagging/captioning models will be supported as they are released :: (There is "no" current eta. on the progress of those models being developed by others)

Contributors are welcome to open a Pull Request for their developments & I will promptly review it to be added

ProxyPlayerHD commented 6 days ago

one feature i'd love to see added is the ability to abort the current operation. currently that is only possible by closing/quitting the terminal running the program, which isn't great.