-
It's so great that RediSearch could be another choice for full-text search since i am an Elasticsearch fans and a search engineer for almost ten years.
I am the author of [Friso](https://github.com…
-
Hello there
I participated on the Whisper fine-tuning event hold last December. As result, I trained some models for Catalan language finetuned using Common Voice 11. Here are the models that we tr…
-
Hello,
Thanks for creating this very helpful tool!
I am fine-tuning the **_model (GPT-J-6B)_** for the question answering on the private documents. I have 1000+ documents and they are all in text f…
-
# Welcome to the Common Voice Community !
> Common Voice aims to make speech technology accessible to everyone by building an open sourced dataset of labelled voice data that is representative of l…
-
Hi all,
I have gone through the docs and i have a doubt regarding the data in NER training.
https://github.com/zalandoresearch/flair/blob/master/resources/docs/TUTORIAL_6_CORPUS.md
Can i Have…
-
https://github.com/tesseract-ocr/tesseract/issues/648#issuecomment-271987456
>Indic may be troubled by the length of the compressed codes used.
@theraysmith Can you explain a little more about t…
-
Natural Language Processing (NLP) enables machine learning algorithms to organize and understand human language. NLP enables machines to not only gather text and speech but also identify the core mean…
-
I already have tagged data in the following format:
**WORD tab POS_TAG tab NER**
भारत NNP loc
की PSP O
पहली QO O
महिला NNC O
फोटो NN O
जर्नलिस्ट NNPC O
होमी NNP per
सरकार NN O
से PSP O
…
-
### 🚀 The feature
#Hindi Language Support for Indians
As for Indians, Hindi is also must be considered in Doctr-Vocabs.
### Motivation, pitch
As in India, Mostly documents are in Hindi langu…
-
Hi, I'm trying to run `./download_data.sh $SUBSCRIPTION_KEY`, and I ran into some issues. I've tried both `indictrans` and with microsoft translator subscription ID. With `indictrans` package, I seem …
iglee updated
2 years ago