impel-intelligence / dippy-bittensor-subnet

MIT License
46 stars 18 forks source link

Models should be reevaluated after 24~48 hours later. #85

Closed torquedrop closed 1 week ago

torquedrop commented 1 week ago

If a model is trained on a very small, specific dataset, it can lead to overfitting. The current validation process relies heavily on data generated in the last 48 hours. To address this, we could train the model using data from the previous 5 to 10 days, which tends to result in higher emissions.

This approach isn't fair to developers of general-purpose models, and it could result in ineffective models. To prevent this, newly submitted models should be regularly reevaluated using newer datasets.

There are two key benefits to this:

I suggest implementing 2 or 3 validation processes during regular operations if the subnet is busy. However, if the subnet is not busy, all available validation processes should be utilized.

donaldknoller commented 1 week ago

Thanks for your suggestion. Please note that a variation of this has already been mentioned here which covers the more pertinent issue at hand.