-
Can you tell how you did pre-processing of Korean Text?
-
I dont find the .py about how to preprocessing text?could you push it?
-
Looks like there is no built-in support in Tokenizer for Chinese text parsing. It can be built using Jieba package, just need some coding work.
-
keras.layers.TextVectorization does not convert Cyrillic characters to lowercase with 'lower_and_strip_punctuation'.
Deprecated keras.preprocessing.text.Tokenizer does this.
```
#================…
-
Adding extra text pre-processing options might come in handy for different use cases and might improve our model performances on some datasets. These options can be implemented:
- Removal of Emojis…
-
@yerkanattt
-
- Outline the ML models and coding requirements.
- Preprocessing pdfs: extract text, preprocess and tokenize to suitable size for llama 3. Can be done in python
- draft a prompt
- translate prompt to …
-
Hi, I am learning lip reading recently. I am wonder about how to deal with text data. Does it transfer word to phoneme? I cannot understand clearly.
-
-
When I run inference on a Llama3 model finetuned using Ludwig, I keep getting this error:
```
set_cols, feature, missing_value_strategy, computed_fill_value, backend)
1756 logger.warni…