Closed wenscarl closed 2 months ago
Hi @wenscarl , thanks a lot for your feedback. This is because our FLOPS lookup table does not match your device information, ‘NVIDIA H100 80GB HBM3’. I believe this MR should fix the problem. https://github.com/bytedance/flux/pull/15
Verified to close. Thanks.
Describe the bug Run with the latest update with sm90 support on H100x8 with nvlink, but running into a error.
To Reproduce ./build.sh --arch 90 --package ./scripts/launch.sh test/test_gemm_rs.py 4096 12288 49152 --dtype=float16 --iters=10
Environment