-
Hi
Thanks for your awesome work!
I tried the Arabic TTS voice (Kareem), and I noticed that an important text preprocessing step is missing.
Arabic text is usually unvocalized (aka diacritized…
-
Currently there are three separate files for preprocessing the documentation. Each file has slightly different functionality and requires specific instructions to ensure the preprocessing is done corr…
-
To implement a common black box we need text loading, extraction of words to be attacked, perturbations, distance metrics, models.
Text Loading needs to be very uniform and universal, it should enc…
-
Add text preprocessing options for NER filter. The options should go in Philter's configuration file since they apply to the model and not to a filter profile.
-
As an admin, I want to be able to parse and preprocess raw text data so I can feed it into a machine learning model for sentiment analysis.
-
Hi!
Great work! Congratulations! Thanks for releasing the code!
However, I am not able to reproduce the results for taskrunners using any of the `allenai/uio2-large`, `allenai/uio2-xl` or `allen…
-
- coming out of #2728
If there is a part definition that is based on a record type that is unknown in the project's dsd, the part should be skippable with the option `drop_on_unknown_record_type: …
-
This is a list of follow-up tasks to #1300.
# General implementation
* [x] Improve text example to include more meaningful dataset
* [x] Improve text example to contain links to further material …
-
keras.layers.TextVectorization does not convert Cyrillic characters to lowercase with 'lower_and_strip_punctuation'.
Deprecated keras.preprocessing.text.Tokenizer does this.
```
#================…
-
Can you tell how you did pre-processing of Korean Text?