Closed xuzhao9 closed 3 days ago
We need to isolate the operator to a single op when spawning subprocess for nsys.
Test plan:
python run.py --op embedding,rms_norm --num-inputs 1 --metrics nsys_rep —csv --dump-csv
@xuzhao9 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.
@xuzhao9 merged this pull request in pytorch-labs/tritonbench@e40e68faab2e7486c38f3dd96c0c23220c80ef60.
We need to isolate the operator to a single op when spawning subprocess for nsys.
Test plan: