daveshap / Raspberry

Create an open source toy dataset for finetuning LLMs with reasoning abilities
MIT License
371 stars 26 forks source link

Paper -> CoT pipeline: Optimize prompt for paper grading rubric #55

Open thehunmonkgroup opened 1 week ago

thehunmonkgroup commented 1 week ago

Requirements:

Deliverable: the final prompt

thehunmonkgroup commented 1 week ago

Here are the current rubric questions, the model is instructed to provide a simple yes/no answer to the question, along with an explanation of the reasoning for the answer:

thehunmonkgroup commented 1 week ago

Here is the full archive of artifacts (inference data for profiling and CoT extraction, JSONL training files) for a test run of 100 papers through the pipeline:

paper-cot-extraction-test-data.tar.gz

The CoT extraction logs are in results/inference with -paper-profiling.txt extension.

thehunmonkgroup commented 2 hours ago

We've decided the current list of questions is sufficient until we need to run a larger number of papers, we'll need funding for that.