Closed kmizuki closed 5 months ago
Hi! By online you mean processing a stream of audio frame by frame in real-time? The model itself is extremely lightweight and processes CQT frames independently so it should be fast enough, however computing the CQT in real-time can be challenging because of the large kernel required for low frequencies. I'm currently not aware of a solution for solving this issue.
About the training set the provided checkpoint has been trained on MIR-1K.
Do you have any ideas on how to run pesto processing online? Also, what is the default model's training dataset?
Thank you.