Closed yukiarimo closed 5 months ago
Most UI developers probably already add a š š , where š saves your input output. Once you 1000 š , you train saved messages
Are you talking about š š in ChatGPT or Character.AI? Because as far as I know in the open source UIs like ooba or TavernAI, etc, there no ranking (even fine-tuning guides are not well-explained yet)
This issue was closed because it has been inactive for 14 days since being marked as stale.
For example consider this model: https://huggingface.co/TheBloke/Pygmalion-2-7B-GGUF
I want to make (good/bad) RLHF like this:
User: Which drug do you like? AI: I like Meth (say this bad, and maybe provide good response) User: What is your name? AI: Iām ChatGPT (say this is good, and provide another good response, or not)
Then using this knowledge fine tune LoRA or Model