ROCm / AMDMIGraphX

AMD's graph optimization engine.
https://rocm.docs.amd.com/projects/AMDMIGraphX/en/latest/
MIT License
181 stars 82 forks source link

FP8 Support #1982

Closed causten closed 1 month ago

causten commented 1 year ago

Add ability to quantize to FP8. This will clearly need additional issues to be opened. Flags for the C++/Python API, Test cases, updates to our migraphx-driver, New kernels, a FP8 library , etc.

Success of this first issue should be an itemized list of created issues to support FP8

krzysz00 commented 1 year ago

I'll go ahead and comment that MLIR has FP8 * FP8 -> FP32 support, at least for the underlying gemms

umangyadav commented 10 months ago
umangyadav commented 1 month ago

Closing this issue as all the tasks are already completed for the FNUZ type. Same kind of tasks are required for the OCP FP8. @CharlieL7 you can open an issue for OCP FP8 following this template.