Can you share code/resources for Self Knowledge learning?

Stability-AI / StableLM

StableLM: Stability AI Language Models

Apache License 2.0

15.84k stars 1.03k forks source link

Can you share code/resources for Self Knowledge learning? #106

Open ucalyptus2 opened 5 months ago

ucalyptus2 commented 5 months ago

cc: @jon-tow @hardmaru @estability

ucalyptus2 commented 5 months ago

Mirror: https://huggingface.co/stabilityai/stablelm-2-12b-chat/discussions/3

maxim-saplin commented 4 months ago

I am curious about that part very much - tinkering with StableLM and trying to find a way to teach the assistant its' own name.

P.S.> a dirty hack that worked for me was running LORA for 3 epoch with just 16 records in the dataset (on top of SFT instruction tuned base model) that had the prompts with different versions of "what is your name" conversation. It worked yet did hit the overall quality of responses.