ggml is gaining traction (e.g. llama.cpp has 10k stars), and it would be great to extend optimum.exporters and enable the community to export PyTorch/Tensorflow transformers weights to the format expected by ggml, having a more streamlined and single-entry export.
Feature request
ggml is gaining traction (e.g. llama.cpp has 10k stars), and it would be great to extend optimum.exporters and enable the community to export PyTorch/Tensorflow transformers weights to the format expected by ggml, having a more streamlined and single-entry export.
This could avoid duplicates as https://github.com/ggerganov/llama.cpp/blob/master/convert-pth-to-ggml.py https://github.com/ggerganov/whisper.cpp/blob/master/models/convert-pt-to-ggml.py https://github.com/ggerganov/ggml/blob/master/examples/gpt-j/convert-h5-to-ggml.py
Motivation
/
Your contribution
I could have a look at it and submit a POC, cc @NouamaneTazi @ggerganov
Open to contribution as well, I don't expect it to be too much work