why not use the candle Bert models and Tokenizer::from_file ?

gabrielmbmb / candle-holder

A Rust crate offering similar functionality to the Python transformers package using Candle.

Apache License 2.0

13 stars 0 forks source link

Hi @jondot,

at first I tried using candle-transformers models, but there are many structs that are not public, therefore cannot be imported from candle-holder. Also, to implement PreTrainedModel trait from candle-holder was easy to do this way.

I'm using Tokenizer::from_file at first method to build the tokenizer for the model, but there are sometimes that a model from the Hub was uploaded a long time ago, when https://github.com/huggingface/tokenizers was not used, and as a second method I build the Tokenizer using the vocab.

gabrielmbmb / candle-holder

why not use the candle Bert models and Tokenizer::from_file ? #2