andydevs / robot-trump

AI Generated Twitter Madness
MIT License
0 stars 0 forks source link

Use character based tokenization instead of word-based #7

Open andydevs opened 2 years ago

andydevs commented 2 years ago

Instead of a shallower model predicting every word, use a deeper model to predict characters.

You may get better Model accuracy with a character based model

The only challenge would be what to do with hyperlinks, but it could be fun to see what the AI comes up with

Ranked strategies for incorporation:

  1. Create a separate, protected branch for old model and continue with new model on the main branch
  2. Create a separate branch for new model and keep old one
  3. Have both exist in the same code and have a selector in program options to switch between them
  4. Create a separate repo for the new model
  5. Make a monorepo with both projects in separate environments
  6. Replace old model
andydevs commented 2 years ago

You may also want to look into adding randomness to the model

I think the repeating tweets are caused by the fact that the model is too deterministic