Closed saintarian closed 4 months ago
Thank you for reporting the issue. The above logs indicate that the partitioner thought the model would be more efficient if it ran on CPU as compared to Neuron. Hence, all operators got partitioned to CPU. An optimized MaskRCNN support is not part of this year roadmap
Thx. @aws-rhsoln. Closing the issue.
I used a python script (pasted further down) to attempt to convert a MaskRCNN tensorflow model to NeuronX on an Inf2 instance. The full output of running the script is also pasted further down. The conversion seems to have failed based on 2 observations:
neuron-top
showed that the inferentia cores were not being utilized at allPython script:
Installed pip neuron packages:
Full output: