Open philschmid opened 2 years ago
AITemplate is coming from Meta production needs, we don't have T4/V100 so in our first release we didn't consider about this. We will help to pass the voice to NVIDIA to see whether they can help.
Hi @philschmid. There's another open source inference acceleration library called voltaML, which gives support for T4. Please check it out
Hello 🙋🏻♂️
It is very cool to see MetaAI going into inference optimization! This will help the community and companies to much long term speaking! While reading through the announcement blog post i noticed that
Which awesome but might be a big limitation for many since A100 is still not very accessible. Having support for NVIDIA T4 (Turing), which is most widely available GPU in public clouds would be very helpful.