Umbaji / NMTMD

Official repository for the Opensource Textdataset for NMT for local langues in West Africa (EWE Corpus)
https://www.umbaji.org/lang/en/home/speech-recognition-for-local-langages
MIT License
23 stars 10 forks source link

Create a github action to run data_preprocessing automatically on pushed data #4

Open Umbaji001 opened 1 year ago

Umbaji001 commented 6 months ago

Need to change the json file to the same format as the csv. Remember, we shall query the dataset in this way data[1,"MAT"] more precisely data[1,"MAT"][1], need a python corrector for that