Support for ggml .bin format models

There are a bunch of FP8 .bin models with magic lmgg on huggingface: https://huggingface.co/ggerganov/whisper.cpp/tree/main.

As an exercise in getting to understand the codebase better, I'm going to try to add support for these, directly.

Is it really as simple as just extending whisper_model_load in whisper.cpp?

Would it be acceptable to refactor it to identify the model type, and then call the appropriate load function?

Is there any information that would need to be inferred that is in not contained in these

ggerganov / whisper.cpp