What is the `fit` method actually doing?

iryna-kondr / scikit-llm

Seamlessly integrate LLMs into scikit-learn.

MIT License

3.38k stars 275 forks source link

Hi @kb-open,

In ZeroShotGPTClassifier no actual training is being done, we just use zero-shot prompts and extract the output. The main purpose of fit is to "memorize" the labels seen in the training set to use them for prompting and output validation.
Using GPTVectorizer you can embed the text using ada-02 and add anything on top.
For now it is not possible neither to fine-tune the GPT models nor perform the tasks you mentioned. We are planning to add GPTSummarizer as a preprocessor (similar to GPTVectorizer) later this week, and fine-tuning options in the future (no fixed timeline for that though).
We did not evaluate the possibility of supporting the question-answering task yet, but we will.

iryna-kondr / scikit-llm