issues
search
hunterchewitt-usc
/
LLM---Detect-AI-Generated-Text
0
stars
1
forks
source link
Clean preprocessing with held out test set
#12
Closed
jjm82
closed
11 months ago
jjm82
commented
12 months ago
Choose best data set from Kaggle
create new file preprocessing.py
put dataset into this file and hold out a randomly sampled test set which is balanced
i.e. output a train/validate set and a held-out test set
upload these files to the folder for permanent use (training/validate for CV and hyper-parameter tuning, test for final evaluation)