Open brunneis opened 1 month ago
Next steps:
The latest finetuning is "lazy". Produces a lot of placeholder comments such as "// TODO" or "// Your code here". Tried several custom and pre-existing LogitsProcessor implementations without relevant success. Trying ORPO to disincentivize the model to be "lazy" with a smaller synthetic dataset.
Try to improve the baseline for Solidity generation.