FacVain / dil-asistanim

1 stars 0 forks source link

Fine Tune gpt-3.5-turbo-1106 for Sentiment Analysis #31

Open FacVain opened 7 months ago

FacVain commented 7 months ago

Task Description

Fine Tune gpt-3.5-turbo-1106 for Sentiment Analysis with the Turkish Sentiment Analysis Data Dataset we created previously. GPT should be able to correctly classify Turkish Sentences into one of the 3 classes given as : ["Pozitif", "Nötr", "Negatif"]

Implementation Details

Prepare samples in JSONL format for Fine Tune operation as specified in the OpenAI API The example prompt samples used for training should include system messages telling the system to classify user input and the class value that the model should give in return. Save samples as JSONL and create a fine-tune job in OpenAI API for the gpt-3.5-turbo-1106 model.

Design and Tasks

Develop a script that converts the sentiment dataset that is in CSV format to the Fine Tune format given in the OpenAI API. This format should include a system message to the model instructing it to classify the given sentence, Turkish sentences as user input, and the output the model should provide as an assistant message. Split the dataset as Train, test. Develop a Python script that tests the base gpt-3.5-turbo-1106 model with test data to get the performance of the base model. Then start a Fine-tune job on the API with the training data. After training test the new fine-tuned model and compare it with the base model.

Acceptance Criteria

The Fine-tuned model should result in significant improvement in the 3-label classification task.