cohere-ai / cohere-aws

MIT License
58 stars 9 forks source link

Support SageMaker BYO finetuning #168

Closed youran-qi closed 1 month ago

youran-qi commented 1 month ago

This PR adds a new method called export_finetune to Client to support the SageMaker BYO finetuning.

This PR also adds a Jupyter Notebook to show how to export the customer's own finetuned merged weights to TensorRT-LLM engine, and deploy the endpoint from the exported TensorRT-LLM engine.

This PR has been tested from end to end: every cell in the Jupyter Notebook has been run and tested, all of which work well.