SlangLab-NU / VoiceCollector

Apache License 2.0
1 stars 1 forks source link

Make data from databases usable for ASR training #32

Open aanchan opened 9 months ago

aanchan commented 9 months ago

WWW As a researcher wanting to use data collected from voice collector, I would like to have a tool/script that can put together information from the audio_table and the references_table in order to generate a CSV file containing paths to audio files and their corresponding transcriptions. The audio files themselves are stored in minio. The script would need to:

AC Submit a pull-request to the VoiceCollector repo with a util written in python that outputs a csv file.

aanchan commented 9 months ago

Sample references: 1|Image|Read the following book pages|1|thomas_birthday_1.jpg 2|Image|Read the following book pages|2|thomas_birthday_2.jpg 3|Image|Read the following book pages|3|thomas_birthday_3.jpg 4|Image|Read the following book pages|4|thomas_birthday_4.jpg 5|Image|Read the following book pages|5|thomas_birthday_5.jpg 6|Image|Read the following book pages|6|thomas_birthday_6.jpg 7|Image|Read the following book pages|7|thomas_birthday_7.jpg 8|Image|Read the following book pages|8|thomas_birthday_8.jpg 9|Image|Read the following book pages|9|thomas_birthday_9.jpg 10|Image|Read the following book pages|10|thomas_birthday_10.jpg 11|Image|Read the following book pages|11|thomas_birthday_11.jpg 12|Image|Read the following book pages|12|thomas_birthday_12.jpg 13|Image|Read the following book pages|13|thomas_birthday_13.jpg 14|Image|Read the following book pages|14|thomas_birthday_14.jpg 15|Image|Read the following book pages|15|thomas_birthday_15.jpg 16|Image|Read the following book pages|16|thomas_birthday_16.jpg 17|Sentences (Bilabials)|My mom climbed Burnaby Mountain from the bottom in thirty minutes.|17| 18|Sentences (Liquids & Glides)|Luna yelled at the little layered pastry for tasting yucky.|18| 19|Automatic phrases|I have one cookie jar with five cookies.|19| 20|Automatic phrases|I see two birds and three chicks in the nest.|20| 21|Automatic phrases|I would like four pieces of bread, please.|21| 22|Automatic phrases|There are ten cars waiting in line.|22| 23|Image|Tell me what is happening in this image|23|Figure-1.png 24|Automatic phrases|Thank you for cooking our dinner. I really appreciate it.|24| 25|Automatic phrases|I would like to throw a ball.|25| 26|Automatic phrases|May I have some water please?|26| 27|Automatic phrases|Are we having pizza or pasta for dinner?|27| 28|Automatic phrases|We could have dinner at six and then go to a movie. How does that sound?|28| 29|Automatic phrases| Excuse me, do you know what time it is?|29| 30|Automatic phrases|I’m not sure if we should paint the room yellow or blue. What do you think?|30| 31|Automatic phrases|When is grandma coming over?|31| 32|Automatic phrases|I can’t wait for my birthday party!|32| 33|Automatic phrases| How do you think animals communicate?|33| 34|Automatic phrases|My friend and I have the same teacher!|34| 35|Automatic phrases|Can we have a dog please?|35| 36|Automatic phrases|I saw a cool lizard at school today!|36| 37|Image|Read the following book pages|37|good_sport_1.jpg 38|Image|Read the following book pages|38|good_sport_2.jpg 39|Image|Read the following book pages|39|good_sport_3.jpg 40|Image|Read the following book pages|40|good_sport_4.jpg 41|Image|Read the following book pages|41|good_sport_5.jpg 42|Image|Read the following book pages|42|good_sport_6.jpg 43|Image|Read the following book pages|43|good_sport_7.jpg 44|Image|Read the following book pages|44|good_sport_8.jpg 45|Image|Read the following book pages|45|good_sport_9.jpg 46|Image|Read the following book pages|46|good_sport_10.jpg 47|Image|Read the following book pages|47|good_sport_11.jpg 48|Image|Read the following book pages|48|good_sport_12.jpg

aanchan commented 9 months ago

Sample audio database: 1|90.5452132089414|test90.5452132089414.webm|2023-08-15 22:53:12|34 2|15.950413213527415|test15.950413213527415.webm|2023-08-16 00:11:09|9 3|86.14580150748988|test86.14580150748988.webm|2023-08-16 00:13:58|1 4|71.71650248448806|test71.71650248448806.webm|2023-08-16 00:14:12|2 5|68.21038450849358|test68.21038450849358.webm|2023-08-16 00:14:40|3 6|29.20648249613933|test29.20648249613933.webm|2023-08-16 00:15:14|4 7|38.11082270888455|test38.11082270888455.webm|2023-08-16 00:15:32|5 8|61.84887536835464|test61.84887536835464.webm|2023-08-16 00:15:48|6 9|47.776109238471534|test47.776109238471534.webm|2023-08-16 00:16:10|7 10|66.88089739412312|test66.88089739412312.webm|2023-08-16 00:16:28|8 11|9.292635633744716|test9.292635633744716.webm|2023-08-16 00:16:37|9 12|42.23791141344206|test42.23791141344206.webm|2023-08-16 00:16:54|10 13|41.71153386762674|test41.71153386762674.webm|2023-08-16 00:17:15|11 14|49.547201033846086|test49.547201033846086.webm|2023-08-16 00:17:32|12 15|75.28569496432979|test75.28569496432979.webm|2023-08-16 00:17:45|13 16|44.14968264350971|test44.14968264350971.webm|2023-08-16 00:18:07|14 17|95.62971915318631|test95.62971915318631.webm|2023-08-16 00:18:23|15 18|43.918835634562555|test43.918835634562555.webm|2023-08-16 00:18:31|16 19|77.39768337966926|test77.39768337966926.webm|2023-08-16 00:18:51|17 20|90.59317492183456|test90.59317492183456.webm|2023-08-16 00:19:12|18 21|87.36867897123588|test87.36867897123588.webm|2023-08-16 00:19:23|19 22|67.377431143301|test67.377431143301.webm|2023-08-16 00:19:35|20 23|46.158647816474094|test46.158647816474094.webm|2023-08-16 00:19:45|21 24|27.833182974129866|test27.833182974129866.webm|2023-08-16 00:19:56|22 25|33.56565705176375|test33.56565705176375.webm|2023-08-16 00:20:33|23 26|0.39116235803118293|test0.39116235803118293.webm|2023-08-16 00:20:54|24 27|26.545382621622117|test26.545382621622117.webm|2023-08-16 00:21:06|25 28|9.704355622565553|test9.704355622565553.webm|2023-08-16 00:21:15|26 29|36.658478108390824|test36.658478108390824.webm|2023-08-16 00:21:25|27 30|8.568135689765576|test8.568135689765576.webm|2023-08-16 00:21:40|28 31|93.57705897295713|test93.57705897295713.webm|2023-08-16 00:21:53|29 32|7.383309701864382|test7.383309701864382.webm|2023-08-16 00:22:11|30 33|80.42346465121872|test80.42346465121872.webm|2023-08-16 00:22:23|31 34|37.56547864439701|test37.56547864439701.webm|2023-08-16 00:22:34|32 35|20.86044068854904|test20.86044068854904.webm|2023-08-16 00:22:43|33 36|95.49690043586709|test95.49690043586709.webm|2023-08-16 00:22:57|34 37|69.10512978049334|test69.10512978049334.webm|2023-08-16 00:23:04|35 38|10.049812363736145|test10.049812363736145.webm|2023-08-16 00:23:15|36 39|25.246488021252222|test25.246488021252222.webm|2023-08-16 00:23:36|37 40|60.05392971699908|test60.05392971699908.webm|2023-08-16 00:24:00|38 41|36.587912063866625|test36.587912063866625.webm|2023-08-16 00:24:14|39 42|85.14404436260936|test85.14404436260936.webm|2023-08-16 00:24:29|40 43|2.7165354532003194|test2.7165354532003194.webm|2023-08-16 00:24:48|41 44|9.362182439458277|test9.362182439458277.webm|2023-08-16 00:25:03|42 45|8.320784769377166|test8.320784769377166.webm|2023-08-16 00:25:10|43 46|8.415866978231822|test8.415866978231822.webm|2023-08-16 00:25:21|44 47|55.25154535277419|test55.25154535277419.webm|2023-08-16 00:25:31|45 48|89.3926082242285|test89.3926082242285.webm|2023-08-16 00:25:45|46 49|22.972933361290625|test22.972933361290625.webm|2023-08-16 00:25:58|47 50|72.00705289164486|test72.00705289164486.webm|2023-08-16 00:26:04|48

aanchan commented 9 months ago

Sample output CSV : https://github.com/SlangLab-NU/links/blob/main/xlsr-53_Torgo_dataset/output_modified.csv