skylineprof / skyline

🏙 Interactive in-editor performance profiling, visualization, and debugging for PyTorch neural networks.
Apache License 2.0
30 stars 3 forks source link

Add a per-operation run time breakdown #28

Closed geoffxy closed 4 years ago

geoffxy commented 4 years ago

We need to add in the per-operation run time breakdown again. Previously we used a custom profiling solution. This time we should explore the possibility of using some other profiling tools (e.g., the PyTorch profiler that ships with NVIDIA's apex).

geoffxy commented 4 years ago

I've looked through apex/pyprof and decided that it is not well suited for our use case. There are a couple of reasons why:

geoffxy commented 4 years ago

Completed as of commit 111661a90a7da909ada66d63a57c11b34d477216.