mahdikhashan / jku-cloud-computing

0 stars 0 forks source link

select model #5

Closed mahdikhashan closed 1 day ago

mahdikhashan commented 2 weeks ago
mahdikhashan commented 1 week ago

I implemented XOR and MNIST using MLP. also with some chat with GPT-o1, it suggested to have a small Model like MobileNet or SqueezeNet with TransferLearning to on some database to have a classification (for example cat vs dogs).

current decisions metrics are "time to train/learn", "should be classification problem or sth that can be used with a UI", "play with hyperparameters".

mahdikhashan commented 1 week ago

For an image classification model that can be trained in about 1 minute, you would need to use a very lightweight model and a small dataset. Here are a few options:

1. MobileNetV2

2. SqueezeNet

3. EfficientNet-Lite

4. Custom CNN (Small Architecture)

Key Factors for Training in 1 Minute:

With these strategies, you can train a simple image classification model in under a minute. Keep in mind that this fast training time typically sacrifices accuracy compared to training for longer periods, but for quick prototypes or basic tasks, it works well.

mahdikhashan commented 1 week ago

I can also use checkpoints feature and keep training.

mahdikhashan commented 1 week ago

I can also benefit from distributed training

mahdikhashan commented 1 week ago

maybe I can do hyperparameter tunning and compare two models?? CNN and MLP

mahdikhashan commented 1 day ago

I'm going to use mnist with mlp and cnn, training multiple models, comparing the results and registering all in an object storage, probably use mlflow for experiment tracking.