-
The Programming Historian has received the following tutorial on 'Sentiment Analysis with 'syuzhet' using R' translated by @acrymble. This lesson is now under review and can be read at:
http://pr…
-
- uid: indic_nlp_corpus
- type: processed
- description:
- name: Indic NLP Corpus
- description: The IndicNLP corpus is a largescale, general-domain corpus containing 2.7 billion words for 10 Ind…
-
The NER dataset is pointed to https://github.com/MISabic/NER-Bangla-Dataset however this link returns 404-Page not found.
Would you pls add details of the dataset used for training for NER task!
It …
-
(Apologies if this is not the right repository to report this issue)
The data prepared for the Malayalam language has an issue. Consistantly there is a space before and after the Virama ് (U+0D4D)…
-
spaCy's sentencizer fails with languages like Bengali, Hindi, Kannada, Sinhala, Tamil, Telugu and Urdu, while these languages are said to be supported in the doc.
With languages like Bengali and Hi…
-
Need to create a template poster for sponsor announcement.
-
Indic
- [x] Hindi - To be done later
- [ ] Gujarati - To be done later
- [ ] Tamil
- [ ] Telegu
- [ ] Bengali
Asian
- [ ] Chinese
- [ ] Korean
- [ ] Japanese
We should be able to add con…
-
Facebook's recent open sourced `fasttext` https://github.com/facebookresearch/fastText improves the `word2vec` SkipGram model. It follows a similar output format for `word` - `vector` key value pairs,…
-
With PR https://github.com/RasaHQ/rasa_nlu/pull/1095 we have an embedding + crf pipeline which can do intent and entity recognition in any language.
**If you are currently testing this and are wil…
amn41 updated
6 years ago
-
Info about spaCy
spaCy version 2.0.11
Location /home/prashant/.local/lib/python3.6/site-packages/spacy
Platform Linux-4.15.0-20-generic-x86_64-wi…