hunterchewitt-usc / LLM---Detect-AI-Generated-Text

0 stars 1 forks source link

Copy and try out Tokenizer code #11

Open jjm82 opened 12 months ago

jjm82 commented 12 months ago
jjm82 commented 12 months ago

In general, tokenizer seems to be doing well. The following discussion should be useful and the author seems to be a good one to follow: https://www.kaggle.com/competitions/llm-detect-ai-generated-text/discussion/458522

jjm82 commented 12 months ago

An important thing the author said: "In my experiments, I've found that a significant portion of score improvement comes from tweaking the vectorization part."