hpcaitech / EnergonAI

Large-scale model inference.
Apache License 2.0
630 stars 90 forks source link

need guidelines on converting OPT-17B checkpoint #161

Closed gulzainali98 closed 2 years ago

gulzainali98 commented 2 years ago

Hello, I wan to serve OPT-175B model that has about 992 shards which needs to be resharded into 8 models first. Can you guide me, how i can exploit EnergonAI for OPT-175B network please?