agentmorris / MegaDetector

MegaDetector is an AI model that helps conservation folks spend less time doing boring things with camera trap images.
MIT License
103 stars 24 forks source link

compute resources for bespoke classifier training #86

Closed agentmorris closed 1 year ago

agentmorris commented 1 year ago

I recently stepped through your classifier training workflow in an AWS SageMaker Studio Lab instance, and was able to begin fitting and efficientnet-b3 with my own data, but I quickly exhausted the available memory (15GB) and then later disk space (25GB). I think SageMaker Studio Lab is geared towards learning ML and running some simple experiments–it's also free–so it's not terribly surprising that I maxed it out right out of the gate. That said, before I start shopping around for a new classifier training environment, do you happen to have benchmarks on how much memory and disk space the classifier training process will consume?


Issue cloned from Microsoft/CameraTraps, original issue posted by nathanielrindlaub on Jan 31, 2023.

agentmorris commented 1 year ago

Sorry, the best answer is "we don't know". The last time that classifier training pipeline was run, it was likely on an Azure NC6v3 instance, and we probably never tested it on anything smaller, and definitely never tested it without a GPU. Here are some random facts in random order that may be helpful:

Sorry we don't have an easier answer!


(Comment originally posted by agentmorris)

agentmorris commented 1 year ago

Amazing, no this is all super helpful. Thank you @agentmorris!!


(Comment originally posted by nathanielrindlaub)