Closed FlyFoxPlayer closed 6 months ago
Hi @FlyFoxPlayer,
Thanks for your interest and issue.
I'm sorry for leaving out the eval scripts for W4A16 and FP16. We have provided the experiments setup as well as reproduce scrips in this commit.
Hello @happierpig, is the FP16.cu file also missing in the project directory kernels/baselines/src? I want to know how to evaluate FP16 baseline.
Hello, regarding the efficiency evaluation experiment, it seems that there are only codes for evaluating the throughput and latency of Atom and SmoothQuant. I would like to ask how the throughput and latency results for FP16 and AWQ were obtained?