Threading is supported in miri, except the num_cpus::get_physical call which needs the disable isolation flag
Miri is extremely slow at running the full unoptimized gemm loop unfortunately, any non-trivial matrix sizes are skipped in tests (which is a shame, there are more branches to cover for larger sizes).