Hi. Thanks for previous prompt response. I'm currently tring to synthesize the Allreduce for a custom topology(let's say a ring with 4 or 8 nodes as an example). Some strange problems occurs when doing so. I wonder if you can help.
I want to know how to understand this output. The chunck id seems to not match with each other. And the input/output map is not a proper solution for allreduce.
I'll be really appreciated and happy to offer other trail logs if anyone can help.
Hi. Thanks for previous prompt response. I'm currently tring to synthesize the Allreduce for a custom topology(let's say a ring with 4 or 8 nodes as an example). Some strange problems occurs when doing so. I wonder if you can help.
My Codes:
I stored the collective also into json file for better debug. The logged allreduce json has strange input and output map as follows:
I want to know how to understand this output. The chunck id seems to not match with each other. And the input/output map is not a proper solution for allreduce. I'll be really appreciated and happy to offer other trail logs if anyone can help.