It would be good to evaluate the model's performance on sct-test-large dataset to see how it generalizes on various types of data.
I could look one by one at the image, generating a qc report after the inference, but I was wondering if there was some metrics to measure the accuracy of the model on all of this data that don't have groundtruth...
It would be good to evaluate the model's performance on sct-test-large dataset to see how it generalizes on various types of data.
I could look one by one at the image, generating a qc report after the inference, but I was wondering if there was some metrics to measure the accuracy of the model on all of this data that don't have groundtruth...