Pricing ambiguity for same model names from different providers in TokenCounter

Agreed, here are some other improvements that could lead to better token cost estimations

Use more precise tokenizers instead of a default one for everything
- This can currently lead to a small margin of error (10/15%) in our token count estimates.
- Reason for this is that there's no general purpose tokenizer libraries and TokenCountingHandler requires tokenizers to be passed.
- We'd have to integrate each individual tokenizer provider
As we add models and keep using the trick above, we should:
- Try to find tokenizers for Claude, Llama, etc. and integrate them
- Or at least try to compare the output of our default tokenizer with an exact tokenizer to get an idea of the mulitplier we could apply on our estimates.
Make sure that images are counted properly by our implementation relying on llamaindex callbacks

Also, TokenCountingHandler requires us to pass a callable tokenizer, which means we need to find tokenizers that llamaindex already supports as part of their callbacks. Reliance on their callbacks also forces us to instantiate the TokenCounter as the very first object which kind of pollutes our lavague-tests configs.

We may have an easier time counting tokens if we didn't rely on llamaindex token counting 😊

lavague-ai / LaVague

Pricing ambiguity for same model names from different providers in TokenCounter #496