Closed mahdikhashan closed 1 day ago
I implemented XOR and MNIST using MLP. also with some chat with GPT-o1, it suggested to have a small Model like MobileNet or SqueezeNet with TransferLearning to on some database to have a classification (for example cat vs dogs).
current decisions metrics are "time to train/learn", "should be classification problem or sth that can be used with a UI", "play with hyperparameters".
For an image classification model that can be trained in about 1 minute, you would need to use a very lightweight model and a small dataset. Here are a few options:
With these strategies, you can train a simple image classification model in under a minute. Keep in mind that this fast training time typically sacrifices accuracy compared to training for longer periods, but for quick prototypes or basic tasks, it works well.
I can also use checkpoints feature and keep training.
I can also benefit from distributed training
maybe I can do hyperparameter tunning and compare two models?? CNN and MLP
I'm going to use mnist with mlp and cnn, training multiple models, comparing the results and registering all in an object storage, probably use mlflow for experiment tracking.