Caucasus-Rosetta / Lingua-Corpus

Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)
Apache License 2.0
33 stars 6 forks source link

Interview with Prefect Strangers magazine #86

Closed danielinux7 closed 3 years ago

danielinux7 commented 3 years ago

Ахцәажәара

The community manager of Common Voice asked me if I am interested to have an interview with Perfect Strangers magazine as a contributor to Common Voice, I'll have to answer few questions.

Ауадаҩрақәа

  1. Both: In what settings do you find broadening voice samples most crucial? What do you think is at stake in these settings?
  2. Community Member: How did each of you become involved in the project? What drew you to it?
  3. Hillary/ Community member: How do you circulate and make the current datasets accessible to tech companies who might benefit from them?
  4. Team: I was trying to sample my voice on the website and I was curious to know about how you train your voice validators to validate samples and what sorts of cues do they look for?

Аӡбарақәа

  1. I think gender equality is an important part, and we could make sure it applies properly on contributors. Having a balanced unbiased dataset is at stake, which would allow a technology that works for everyone.
  2. I came across Common Voice through an article that highlighted the top open source projects of the year, that was back in 2017. I realized the importance of this project, people are becoming more and more involved in voice technologies, it's becoming part of our lives, I'm using it right now to answer your questions!
  3. At this point, we are getting companies and organizations involved in the process of building up the dataset, then gradually bring awareness to what is possible.
  4. As you go on to validate or sample voice on the website, you would come across a link that takes you to the "understand contribution criteria", which is the guideline, but in general the validators should look for misreadings, background voices.