Closed duxuan11 closed 10 months ago
yes, we adopt multi-output mode to train the network, which means the number of output masks is 3, among which we select the one with max score as the final output mask. As for exporting onnx files, samexporter would be helpful.
I don't know what you changed about the original SAM, but when mask is used as output, it has three channels instead of one. And use max score to select.