-
The UDHR source names this language as Aromanian [rup], but uses a BCP47 code of [rmy], which is Vlax Romani. Which is it?
https://unicode.org/udhr/d/udhr_rmy.html
-
The translation of Article 1 of UDHR in Northern Sami (`sme`) is missing the second sentence.
While I couldn't bother to check if the other parts had anything missing, Omniglot has kindly [provided…
-
Hello!
First of all, thank you for this wonderful project.
It seems that franc limits the text sample to analyse to a hard-coded 2048 chars in these lines
https://github.com/wooorm/franc/blob…
-
scrolling 1000+ users takes ages.... the scroll bar stares me in the face. please let me click and drag it.
while we're at it can we please pin the userlist to the side of the window in the chat ta…
-
I came to know about this repository, I was about to send a mail.
Following Corrections are requested for UDHR Translations.
1. Replacement of colon by Visarga
The Sanskrit Translation uses colo…
-
I'm working on a crystal port of [franc](https://github.com/wooorm/franc) and once it's working I'd like to merge it in Cadmium if you're ok with this.
-
I'm writing a function that receives a corpus object but it doesn't know which corpus is it. In this function I need to get all the words in the corpus; when I try with `machado` or `stopwords` corpor…
-
Use-Cases for Privacy / Dignity (and/or user-stories, et.al.) required.
ghost updated
5 years ago
-
#### Expected Behavior
API methods of NLTK need dependencies (listed below). This can be done by commands:
```
import nltk
nltk.download('all')
```
The details of dependencies:
```pun…
-
Maghrebi Arabic is the variety of Arabic spoken (and written) across the Maghreb region. Depending on the situation, it can be written in Arabic script, Hebrew script, or Latin script.
For now, I …