Open Liberxue opened 2 months ago
We could pull all of this information from the .gguf file. For example, the llama 8b chat model defines the .gguf link, tokenizer and chat format, but if you look at the data defined in the .gguf file for llama 8b chat it has all that info. We just need to turn it into a form we can use. The trickiest bit is the chat format. It is defined in jinja which has embedded functions. It looks like https://docs.rs/minijinja/latest/minijinja/ might work for rust?
Hey
I have a suggestion that I believe could enhance its flexibility and ease of use:
Would it be possible to implement example floneum/models/xxx/src/source.rs using JSON or YAML configuration files to support multiple models? The benefits I envision are:
If this aligns with the project's goals, I'd be willing to contribute by submitting a Pull Request to implement this feature.