Open dearowen opened 2 years ago
does EnergonAI support gpt model with int8 quantitation in model parallel?
Same question. And under other parallel paradigms like tensor/pipeline parallelism.
does EnergonAI support gpt model with int8 quantitation in model parallel?