Closed burrage closed 1 year ago
@weepingwitch This report is still a work in progress, but the preliminary results we're seeing point to an issue with missing media assets that we should look into, given the prevalence.
I'm spinning up a separate task for the investigation but tagging here so we can keep apprised of when the report is done done.
@vinay961861 @Ashish1998 Just a heads up, we looked at Luganda, Afrikaans, and Australian English and found that the files should all be on the ftm_languagepacks repo. Do you know if the script checking for the media is case insensitive?
Is it also looking in multiple folders for the audio? Or just one? I know for our FTM web versions, we mentioned we would ultimately put everything into a single /audios/ folder, but on the ftm_languagepacks repo, we have multiple directories like "/Words/", "/Letters/", etc.
Hiii @burrage we noticed that the .wav extension is in upper and lower case so we got more issues, so now we updated the script. Even then we got some missing audios list, so the problem here was in the json it is showing as letters folder, but the audio is actually present in word folder.
Hey @burrage Any update on this ?
Which language should we look at for examples @Ashish1998 @divyahariharan2103 ? It sounds like we would need to re-generate the JSON with an updated path if that's the case.
@weepingwitch See above messages for JSON path confusion in the audio assets. We had decided to forego the more specific subfolders and lump everything together in a root .../[language]/audios/ folder to cutdown the onus on correctly filing each media asset-- with Stephanie, we determined there wouldn't be any overlap if we combined to one root folder.
Sorry, misunderstood what was written.
@weepingwitch , I see the JSONs have that agreed-upon path already .../[language]/audios/ so we are good there.
@Ashish1998 @divyahariharan2103 We'll need a task to consolidate existing subfolders of the audio we've downloaded into this root level .../[language]/audios/ structure. Does it make sense to do this physical consolidation as part of another process we already have? Like when we are converting the files to MP3?
@Rajesh1041 Mentioned we have this consolidation happening already so we do not need to make a new task. But we should re-run the reports!
Hiii @burrage we have updated the issue document. Please have a look into it. Thank you.
User Story As a developer and Product Owner, I want to run the JSON check and missing audio scripts on 20 languages to write a report on what we find if we were to try to build each of these languages, So that we can identify any further problems with being able to build these localizations.
Acceptance Criteria Given that I have run the JSON check and missing audio scripts on 20 language JSON files and for 20 language packs from GitHub and have recorded the results in the following Google Doc, When I view the doc, Then I can see what languages could be built without issue.
Language JSON files
Languages:
Zulu
Pashto
Luganda
Nepali
Ukrainian
Afrikaans
Amharic
Azerbaijani
Bangla
Australian English
Indian English
South African English
West African (Nigerian) English
Farsi
Georgian
Haitian Creole
Hausa
Igbo
isiXhosa
Javanese
If we have time, remaining languages:
Kinyarwanda
Kiswahili
Kurdish
Malagasy
Malay
Marathi
Ndebele
Oromo
SePedi
Sesotho
Shona
Siswati
Somali
Spanish
Tagalog
Tajik
Thai
Tsonga
Tswana
Turkish
Twi
Urdu
Venda
Vietnamese
Wolof
Yoruba