volcengine / veScale

A PyTorch Native LLM Training Framework
http://vescale.xyz
Apache License 2.0
497 stars 19 forks source link

[QUESTION] how and where to use multi-node trace profiler in paper of megascale #37

Open oliverYoung2001 opened 1 month ago

oliverYoung2001 commented 1 month ago

I read the paper of megascale. And I find that the multi-node trace profiler is really useful for me. Thus I want to know how and where to use this tool ?

Screenshot 2024-05-24 at 15 09 56
pengyanghua commented 1 month ago

@oliverYoung2001 Hi, the cuda event monitor tool will be released by the end of July, as said in the README.md.