sugarlabs / GSoC

A guide for participating in Google Summer of Code with Sugar Labs
299 stars 138 forks source link

Pronunciation Checker Activity #12

Closed vaibhavdaren closed 5 years ago

vaibhavdaren commented 5 years ago

Pronunciation Checker

The activity’s main motive is to help a student to learn correct pronunciation. The main idea is to develop an activity which will convert voice into text and match the text with the correctly spelled phonetics which will be available as per word. This text will be as per phonetics rules. The text will then be matched with the correct phonetics already present for the word to be pronounced. The score will be given as per the correctness of the pronounced word. This will help students to learn correct pronunciation as well as phonetics of the word. There can be levels for this activity like words, sentences, paragraph

Open for discussion.

rhl-bthr commented 5 years ago

Thanks, here are my comments;

vaibhavdaren commented 5 years ago

Yes, the student will need to make his own system to get scoring. My idea was to make a match with each alphabet. eg. about phonetics is əbawt. If the speech to text converted for 4 alphabets matches same as out of 5, we can score him 80% correct and highlight the wrongly pronounced alphabet. Suppose the spoken phonetics result is abawt then he will get 80% score. I myself was working on ML for the last few weeks and I guess @iqraceme might have good experience in this field.

cemeiq commented 5 years ago

@vaibhavdaren, Yeah I have some experience with ML.

rhl-bthr commented 5 years ago

Thanks,

quozl commented 5 years ago

Do the end users of Sugar Desktop have such a need?

I'm struggling to understand how I could teach this subject to the primary school children in our target demographic, but I thought that was just my ignorance in the subject. Can I see a lesson plan?

assumes that the student speaks in English

We have previously valued all languages, through our human translation team, and by using iconography instead of text for user interfaces. At OLPC, most of our devices went to countries that speak Spanish or a derivative. So I would like to hear how the idea would work for Spanish and other languages.

Assuming a microphone and speaker is okay.

vaibhavdaren commented 5 years ago

What will be the tentative size of the trained model that will be embedded in the activity?

  • Being trained model expecting to be of size 400-500 MB. Are there an open source datasets for this purpose?
  • No, I didn't find any such open source dataset. Do the end users of Sugar Desktop have such a need?
  • The use of this activity will be for users of all age of students as they will learn how to pronounce properly. There can be different words for different age groups.
  • In various Asian developing countries, children are taught how to speak as per correct phonemes. In our own college and in some of the examination boards of India and especially the ICSC board has a pattern to teach students in class manually through the correct phonemes. This is one of the most important parts of language learning in their course curriculum.

Curriculum CISCE

please check at page no. 82 for this course curriculum. The main purpose of this initiatives are cross cultural communication.

This idea assumes that the student speaks in English (if not, then this idea will require more effort than a GSoC Project)

Actually phonomes are same for all the languages except there are some which are used in one but not in others.English uses most of the common phonomes

We have previously valued all languages, through our human translation team, and by using iconography instead of text for user interfaces. At OLPC, most of our devices went to countries that speak Spanish or a derivative. So I would like to hear how the idea would work for Spanish and other languages.

If you reffer the phonology at wikipedia,

Phonology is a branch of linguistics concerned with the systematic organization of sounds in languages. It has traditionally focused largely on the study of the systems of phonemes in particular languages (and therefore used to be also called phonemics, or phonematics), but it may also cover any linguistic analysis either at a level beneath the word (including syllable, onset and rime, articulatory gestures, articulatory features, mora, etc.) or at all levels of language where sound is considered to be structured for conveying linguistic meaning.

These Phonetics Inventory of few of the languages:-

  1. English
  2. French
  3. Spanish
quozl commented 5 years ago

That sounds like a style of mechanised memorisation learning which is somewhat foreign to Sugar Labs. @walterbender, can you suggest how to modify this lesson plan to better fit with constructivist teaching methods?

walterbender commented 5 years ago

I would like to see some activity associated with this Activity that involves some creativity by the students and perhaps some sort of collaboration or interaction between students (taking advantage of Sugar's strengths) and leveraging our pedagogical approach. Maybe combine it with a chat activity where the chat is enabled by proper pronunciation (and perhaps something fun happens in the case that there is mispronunciation). And where the students can also "critique" and support each other?

rhl-bthr commented 5 years ago

Fixed in 5d22851a