flexflow / FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving
https://flexflow.readthedocs.io
Apache License 2.0
1.59k stars 218 forks source link

Change datatype for linear kernels away from void * in .cc #1409

Closed oOTigger closed 3 weeks ago

oOTigger commented 3 weeks ago

cublasGemmEx takes a bunch of void and then the appropriate datatype. Moves the cast for the actual datatype to void in the kernel code.

Linked Issues:


This change is Reviewable