Closed sanchit-gandhi closed 2 years ago
do_lower_case
do_upper
if
tokenizer.do_lower_case
The Wav2Vec2 Librispeech tokenizer config has been updated accordingly: https://huggingface.co/speech-seq2seq/flax-wav2vec2-large-lv60-scan/commit/e9904676455f659b34ce9bb5f3c6f1c64eb4bcf3
do_lower_case
to the value ofdo_upper
(True/False) when the tokenizer is created: https://github.com/sanchit-gandhi/seq2seq-speech/blob/0ff54665154a476bcd741603250453709cc480c1/get_ctc_tokenizer.py#L268do_lower_case
->do_lower_case
will take the correct bool value assigned when the tokenizer is createdif
statement to ensuretokenizer.do_lower_case
is set correctlyThe Wav2Vec2 Librispeech tokenizer config has been updated accordingly: https://huggingface.co/speech-seq2seq/flax-wav2vec2-large-lv60-scan/commit/e9904676455f659b34ce9bb5f3c6f1c64eb4bcf3