janhq / jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
https://jan.ai/
GNU Affero General Public License v3.0
23.29k stars 1.35k forks source link

epic: Jan Studio (users can finetune) #1249

Closed 0xSage closed 5 months ago

0xSage commented 10 months ago

HackMD

Motivation

Specs

Designs

Figma

Tasklist

Not in Scope

Appendix

TashaSkyUp commented 10 months ago

Fine tuning please.

tikikun commented 9 months ago

Is OpenAI compatible is strictly needed in this usecase

hiro-v commented 9 months ago

@tikikun > Is OpenAI compatible is strictly needed in this usecase

I think yes, we will need a way to sync the state whatsoever, then I use the biggest platform there for it.

hahuyhoang411 commented 9 months ago

Just to add detail for the input dataset.

  1. Columns We should accept only 3 columns/fields named
system instruction response
  1. Split train/test Normally, HF datasets only have train split so we can add a small handle train_test_split (test = 0.1) for users
agi-dude commented 9 months ago

Try Axolotl, It'll make everything simpler. Jan should just generate a config file and start the finetuning.

0xSage commented 5 months ago

oudated