-
Ironically, [Diacritics Restoration Using Neural Networks](https://aclweb.org/anthology/papers/L/L18/L18-1247/) lists "Jan Hajic" on the page and in the BibTeX whereas it's spelled "Jan Hajič" in the …
-
If I want to generate the data for the romanian language, how could I do that? Thanks a lot!
-
**Describe the bug**
If there is a comma in the parsed sentence, the PROIEL model:
a) does not tokenize the comma, it just bundles it with the preceding word. The lemma is affected similarly.
b…
-
I installed the Whisper medium Arabic model (STT) to test out the punctuation restoration but it isn't clear how to use it.
The button in settings says:
> This option only works with models that…
-
Hi
Thanks for open sourcing your work. This is a good push for Arabic NLP.
I've a question regarding WER calculation.
In Table (5) - CE WER for D3 model is shown as 109..
Is it ment to be a …
-
Create table for words and nouns. Query it when generating usernames
Reference: https://stackoverflow.com/questions/8674718/best-way-to-select-random-rows-postgresql
-
-
Hi,
I've spent some time for training a small cs dataset (downloaded from given link) by following these steps:
- Download dataset and copy to project folder
- Run diacritization_stripping
- G…
-
EDIT 12/11/2021: Editing this issue to have a clear and focused discussion.
> @xavidelamo: List of countries and subnational divisions: Please ensure that the list of names exactly matches with …
-
## Make sure you have updated your configuration to the most recent version of this projects files _and_ are using the latest version of the "options" required by your Firefox version >_before_< repor…