请问sam系列的onnx模型是否支持automatic推理模式？

CVHub520 / X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

GNU General Public License v3.0

4.14k stars 473 forks source link

Hi there! Regarding your question about automatic inference mode support for SAM's ONNX models:

Yes, it's possible to support automatic inference, but you'll need to handle the tensor transformations correctly. The key is properly managing the mask dimensions between the decoder output and the network's requirements.

I recommend checking out the related samexporter project, e.g., samexporter as a reference. You can modify its export functionality to match your specific needs. The implementation in X-AnyLabeling's segment_anything_2.py can then be adapted accordingly.

For your specific issue with the masks dimensions (1,1,x,y), you'll need to:

Process the decoder output to match the expected num_multimask_outputs transformer dimension
Adjust the tensor reshaping operations to maintain compatibility with the model's expected input format
Handle the mask transformations appropriately in your inference pipeline

Let me know if you need any clarification on implementing these changes!

CVHub520 / X-AnyLabeling

请问sam系列的onnx模型是否支持automatic推理模式？ #709

Search before asking

Question

Additional