Ensemble models - Githubissues

Hi @MFreidank

Good point and you are in luck, the next ClearML version will support multi-models 🎉 (finally) Basically instead of single model, a Task will be able to hold multiple models in a key/value alike structure, where the key is the model filename (or manually set model name)

Until 0.18 is out (hopefully towards the end of the month), here is what you can do: Currently multi models are stored in the ClearML artifactory and each one has a unique ID and a link back to the creating Task. The missing piece if for the Task to link back to multiple models, based on their names (currently it links back to the last one). That said, if you do Task.get_task('task_id_here').models['output'] you will get a list of all the output model objects, see docs and here. With the Model object you can query/set the name/tags/url etc. of the model itself.

Specifically regrading your questions:

As explained above you can access all the models a Task produced by querying the Task (or querying the models for the creating Task ID), then you have a list of models you can deploy. The question becomes more complicated if we have multiple snapshots of these models created by the same Task, if this is the case the list is order by the order of the model storage so you could pick the last 3 (let's assume we have 3 models in the ensemble) from the list (you can also verify based on the model name that these are 3 different models and not the same model in different snapshots)
I would add tags to the "chosen" models, so that you know they are part of a package. Obviously the question is how you would know, maybe as part of the training code?! Or a quality-control process testing all models on a blind dataset and choosing the best performing ensemble set ?
I guess that depends on how you serve models and where you store them (BTW: serving integration will also be part of the next release ;)

What do think ?

allegroai / clearml

Ensemble models #318