Open s-jovic opened 2 months ago
Added e2e test for 128 seq len and proper PCC check in https://github.com/tenstorrent/tt-metal/pull/8218.
To add (after optimized prefill version is enabled):
Added e2e and perf tests here: https://github.com/tenstorrent/tt-metal/commit/4fc1bb496fad50b1ee8c4bbdb4ab0be631d19dc7.
can we close this?
I'll leave this for @s-jovic to check when she is back from vacation, I don't think we have proper 2k e2e tests yet.
Current CI coverage for Falcon 7b prefill:
test_perf_falcon.py
is ran as a part of model perf regression pipeline; PCC is checked here but the threshold is 0.89 so it's not very usefultest_perf_falcon.py
seems like a good test to discover hangs and perf regression, but we need to add one for 2k seq len as well, and proper e2e PCC tests for both 128 and 2k seq len.