tenstorrent / tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.
Apache License 2.0
303 stars 26 forks source link

Mistral/Mixtral WH bringup #5337

Open mtairum opened 4 months ago

mtairum commented 4 months ago

Bring-up of Mistral and Mixtral on Wormhole for single-chip and multi-chip respectively.

Mixtral8x7b

Status

In Main, inside models/demos/t3000/mixtral8x7b

Perf Target -> 33 tok/s/u

[6 Jun 2024]

[25 May 2024]

Tasks

Mistral

Status

On main models/demos/mistral.

Device perf: 13.3 tok/s/u e2e perf: 10.9 tok/s/u

Tasks

Issues

mtairum commented 2 months ago

I've updated the main issue with a list of ongoing & future tasks/issues. I might have missed something, so feel free to edit it.

Also added Jack to the issue.