tenstorrent / tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.
Apache License 2.0
471 stars 75 forks source link

Model optimization: YoloV4 #14257

Open mywoodstock opened 3 weeks ago

mywoodstock commented 3 weeks ago
shwetankTT commented 2 days ago

https://docs.google.com/spreadsheets/d/1DkhdEmNB8ZVjb0LbgxGXhtoyRxFtySBl1jBGwheEs_U/edit?usp=sharing

Tried multiple permutations and combinations for using row_major_layout the current config is giving us the best output. This is for Down1 (submodel of yolo4)

Concern: Unary Ops taking too much time.

shwetankTT commented 1 day ago

Neck perf improvement. https://docs.google.com/spreadsheets/d/1Zhm2PSm-GVqM5ujH3fru5EwEWG1vN6eqF5kA0y_AxOE/edit?usp=sharing