MusicLang / musiclang_predict

AI Prediction api of the MusicLang package
GNU General Public License v3.0
250 stars 16 forks source link

Documentation on MusicLang Tokenizer #29

Open MikeMpapa opened 3 months ago

MikeMpapa commented 3 months ago

Hi and thanks a lot for the great work! I was wondering if there is any detailed documentation on the tokenizer beyond what's in the technical blog? Also is there a way to add custom tokens based on metadata in order to control on inference?

floriangardin commented 3 months ago

Hi, thank you. Adding custom tokens would imply retraining the model (because the vocab would change). Unfortunately, at this time I don't have a timeline for releasing the training code. You can check the tokenizer class to get the tokens structure, and check musiclang language documentation to see what the tokens means. Can you share what kind of tokens do you want to add in the model ?

MikeMpapa commented 3 months ago

I would be interested to control stuff like genre, mood, and similar tags of that nature that might be more abstract in nature but live in the metadata-land