-
Develop models for Hindi from the [IIT Bombay English Hindi Parallel Corpus](http://www.cfilt.iitb.ac.in/iitb_parallel/) using Cython/spaCy-multilingual.
-
Hello,
I installed owncloud through ubuntu software center.
I went to "http://localhost/owncloud/" in my browser to setup owncloud. After typing in username and password and pressing "Finish setup…
-
hello sir,
While using your test data of image and ocr text, after loading text for spell check the system perfomes well ,(for eg. misspelled words are represented with colour).
But when it comes…
-
We must produce a script to transform [mci.txt](https://github.com/sanskrit-coders/stardict-sanskrit/blob/master/sa-kAvya/mbh-cultural-index/mUlam/mci.txt) into a babylon dictionary.
Regarding script…
-
Once upon a time paper and books were costly, and entries were compact.
So, we have for hari:
> हरि hár-i, a. [√ 3. hṛ, be yellow] tawny, yellow (esp. of horses); greenish (rare, C.); m. (C.) steed …
-
namaste @damooo
could you share the scripts you used to scrape tamil-lexicon off dsal? hopefully we can reuse that for other dictionaries there?
-
This issue devoted to comments regarding meta-line/iast conversion of the Cologne digitization of
`Monier-Williams Sanskrit-English Dictionary, 1899`.
This conversion will present some unique chal…
-
The output of devanagari is not good and should be improved. As can be seen in the examples below the problem is not with the generic fontloader -- its output is like the one in context (which is not …
-
In Level 2, should the sentence be tokenized first before lexical splits? E.g. say with whitespace (to start with) as a token boundary.
If we tokenize the sentence first, may be following output pa…
-
In recent times, Machine Learning (ML) based algorithms have been able to achieve
very promising results on many pattern recognition tasks, such as speech, handwriting,
activity and gesture recogn…