ELS-RD / transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
https://els-rd.github.io/transformer-deploy/
Apache License 2.0
1.64k stars 150 forks source link

fix: expose load_external_data for large models #141

Closed ayoub-louati closed 1 year ago

ayoub-louati commented 1 year ago

Tested on cloud-dev with roberta-large. closes #138

ayoub-louati commented 1 year ago

To test you can just run the convert_model command with the given parameters, for example (in local): convert_model -m roberta-base --backend onnx --seq-len 16 256 256 --task embedding --load-external-data (you can test other models, here it is only an example since the issue is based on roberta model)