-
Hi
I checked out Rasa Whatlies on arabic and English using BytePairLanguage. I used pca and umap to see if similar messages cluster. But they don't. Both for English and arabic, they don't seems to…
-
i'm trying to train a LM with my arabic corpus but i have a problem with :
corpus = TextCorpus(Path('/content/drive/My Drive/arabic_corpus'),
dictionary,
i…
-
Hi,
There is a `NonMatchingChecksumError` error for the `lid_msaea` (language identification for Modern Standard Arabic - Egyptian Arabic) dataset from the LinCE benchmark due to a minor update on …
-
**Describe the bug**
I train the model on huge intents (+11000) in Arabic, all is working great, except for the fact that the model doesn't capture false positives in high rate with also high confid…
-
These are both simplistic assertions about an external resource that provide no useful information to a user or a user-agent.
An external resource can have multiple text directions, and languages; att…
-
We are beginning to think about enhancements to the various Cologne Sanskrit Lexicon dictionaries, such as in the [alternate head words](https://github.com/sanskrit-lexicon/alternateheadwords) reposi…
-
_Even though it is not explicitly mentioned, but it looks like this repo is NLP focused, so let me know if this is out of context._
I'd like to add [Arabic Font Classification](https://mhmoodlan.gith…
-
Hi,
yesterday I uploaded a new model to `german-nlp-group/electra-base-german-uncased`:
```bash
$ transformers-cli s3 ls --organization german-nlp-group
Neither PyTorch nor TensorFlow >= 2.0 hav…
-
Thank you for your contribution to Arabic NLP,
I am trying to tune (continue pre-training) the pre-trained model on more task-specific data, and I can't download the model weights, specifically, I ne…
-
Dear @all,
I'm trying to load the English BPEmb model with vocabulary size 30k and 300-dimensional embeddings.
`bpemb_en = BPEmb(lang="en", vs=30000, dim=300)`
Every time I get the same err…