hunterchewitt-usc / LLM---Detect-AI-Generated-Text

0 stars 1 forks source link

Setup Testing Framework #7

Open hunterchewitt-usc opened 1 year ago

jjm82 commented 11 months ago

local cross validation is apparently a much better indicator of our performance than the public leaderboard. The thought is that the external data people are using for training is actually the data in the public leaderboard hidden test set, so many groups are overfitting. The following is a good tutorial for setting up cross-validation with scikitlearn: https://scikit-learn.org/stable/modules/cross_validation.html