Open fwyc0573 opened 4 months ago
Marking as stale. No activity in 60 days.
I encountered a similar problem about tracing Megatron-LM transformer model to graph, using torch.fx as well. Some customized module is recognized as wrong type.. May I know your specific problem and how did you solve it? Thanks! @fwyc0573
Marking as stale. No activity in 60 days.
Hi everyone,
I'm currently working on a project involving Megatron-LM and I'm looking for a way to obtain the graphs (computation graphs) of sub-models after partitioning, along with the attributes of the operators. I've tried using tools such as torch.fx and the new compile and dynamo tools in PyTorch 2.0, but I've encountered several issues. It seems that some of these problems are related to the custom operators implemented in Megatron-LM.
Could anyone provide a feasible solution or guidance on how to achieve this? Any help would be greatly appreciated!
Thank you in advance!