Open ssinhaleite opened 11 months ago
@sheiksadique what's the current state of your graph extraction algorithm? Is it feasible to apply it to this issue?
It should in principle be possible with NIR or NIRTorch utilities but it requires some work and isn't a simple plug and play. We will need to modify the current implementation (which assumes sequential everywhere) dramatically.
Currently the only networks that are supported are sequential models. The chips in fact support branched architectures. The use of this feature is unfortunately limited by proper software support.
The target would be to support a range of network structures as listed below:
Two independent networks:
Two networks with merging outputs
A network with a merge and a split
A network with residual connections:
And finally a complex network structure.
A special module/layer type that support branched data paths might be useful to try these types of architectures.
Current proposal: Use a Branched module that then has child sequential objects and merges their output at the end. The issues with such a construct is it doesn't support models that have two independent submodules that do not merge. So this is not the ideal solution.