Closed mingxu1067 closed 3 weeks ago
/te-ci jax
/te-ci jax
Can you tell what speed up you see with this PR?
/te-ci jax
@denera As this PR is removed some old API not used, we need to have that documented in the next release. Is there a place we need to add them to be sure to be included in the next releases?
/te-ci jax
/te-ci jax
LGTM 👍. Would be interesting to see the diff in performance if any.
Change to Draft for waiting internal verifiy
/te-ci jax
Description
Reformatted FP8 meta to one set per tensor, removed
fp8_max
andscale_inv
from the set of FP8 meta, and deleted unused functions and types.Fixes # (issue) To avoid unnecessary
slice
of FP8 meta then unblock pipeliner to re-schedule the collectives.Type of change
Changes
Please list the changes introduced in this PR:
fp8_max
andscale_inv
from FP8 meta set.update_fp8_metas
.ShardingType
andMajorShardingType
.Checklist: