Closed dskhudia closed 1 year ago
Is this deployable? If so, could you include a deployment yaml?
@dakinggg : Yaml coming up after we have an image with FT.
@dakinggg : Let us merge this and if there are minor changes we can make them later.
Resolving some of the lint issues.
FT model handler for our mpt models.
Currently it converts the model from hf checkpoint to FT format on the fly. We may want to use a pre-converted model if the model startup time is unacceptable.
Command I used to run it (Both FasterTransformer and conversion script should be in the pythonpath):