Closed kylebgorman closed 2 months ago
The CustomEncoding class says it is for encoders that lack a Rust-backed tokenizer ("like ByT5"). However it is used in the implementation of all data sets.
CustomEncoding
Can we:
tokenizers.Encoding
@DanielYakubov please advise here.
The
CustomEncoding
class says it is for encoders that lack a Rust-backed tokenizer ("like ByT5"). However it is used in the implementation of all data sets.Can we:
tokenizers.Encoding
in its place?@DanielYakubov please advise here.