Open akamaus opened 2 months ago
It's said in the FAQ that models are served in FP16 to conserve the GPU memory. What's about support for cards like 1080Ti than? AFAIK they have terrible performance for FP16 (something like 150GFLOP/S compared to 10TFLOP/S on FP32)
It's said in the FAQ that models are served in FP16 to conserve the GPU memory. What's about support for cards like 1080Ti than? AFAIK they have terrible performance for FP16 (something like 150GFLOP/S compared to 10TFLOP/S on FP32)