-
I have installed it using python setup.py develop --user
I have downloaded the code and when i try to run train.py , it is showing the below error (please check the attached file). please check it. …
-
I received an Excel file from our member [Skillwiki](https://tatoeba.org/eng/user/profile/Skillwiki) containing around 70,000 sentences in 11 languages:
- Arabic
- Mandarin Chinese
- English
- F…
trang updated
2 years ago
-
All CLTK corpora text repos need a converter.py and the subsequent converted cltk_json dir with json files that are produced by converter.py.
One example of a converter.py is here: https://github.…
-
To make progress on Indic shaping we've assembled a corpus of words and syllables by scraping Wikipedia for the ten Indic languages we plan to support (hi.wikipedia.org, bn.wikipedia.org, etc.)
Tha…
-
-
Hi @stefan-it thanks for the awesome job of providing all these embeddings.
I'm wondering how you trained them and if I could perhaps create a "fast" version of the Swedish ones myself?
I'm in ne…
-
I am generating pre training data for hindi, I am using sentence piece vocab for it. Getting the following error.
```
python build_pretraining_dataset.py --corpus-dir data --vocab-file spie
ce.voca…
-
The corpus folder contains 109 files, while hte copyright/license file listing contains 132 entries.
-
# Offline Alternative to Google's Read Along App in Hindi
## Description
Develop an offline application (POC - web) that can display a set of Hindi words and accurately determine if the user has p…
-
In Hindi, the superlative form of an adjective uses the following construction:
> सब**से** अच्छा
> _sab-**se** acchā_
> all.**ABL** good
> "best"
The comparative degree + "than" is indicated …