Closed suisiyuan closed 3 months ago
add batch_gemm, group_gemm; add int8 dtype to gemm ops; fix situation that world_size exceeds available devices.
support more ops cast, silu, swiglu, div, mul, sub, gemv, reducemax, reducemin, reducesum, p2p modify workloads support input shape groups
add batch_gemm, group_gemm; add int8 dtype to gemm ops; fix situation that world_size exceeds available devices.