tenstorrent / tt-mlir

Tenstorrent MLIR compiler
https://tenstorrent.github.io/tt-mlir/
Apache License 2.0
75 stars 11 forks source link

TTNN Op Interface: Op Perf model #305

Open rjakovljevicTT opened 3 months ago

rjakovljevicTT commented 3 months ago

What Provide a TTNN API to return Op perf estimate to the TT-MLIR optimizer. The goal of this issue is to do an end-to-end PoC and conclude what would be the proper solution. Based on the conclusion we will open a new issue to define the whole work.

PoC - September '24

PoC Initially, on an example of MNIST NN and all Ops that are parts of MNIST, do the following tasks:

The above tasks should be done Iteratively until it works well for both TT-MLIR Optimizer and TTNN.

rjakovljevicTT commented 3 months ago

We know it would be a lot of complex work to provide and maintain "Op perf" API with Ops being actively developed and optimized, hence we should also explore potential alternatives in terms of how Optimizer would make performance related decisions. This is to be discussed at least with @s-jovic, @pavlepopovic, @nobradovictt.

mbezuljTT commented 2 months ago

For this sprint, we are not implementing proposed POC.

303 and #304 have been flagged as more urgent, and they carry similar challenges as #305:

On the perf estimate itself, we do want to make a set of experiments to establish a method to measure/calculate perf of op (compute and data movement). Captured under PoC - September '24