Closed SamuelCahyawijaya closed 6 months ago
I need to contact the dataset provider. The dataset requires Git LFS to download (all the zip files in the speech folders are Git LFS pointers), but this error occured.
Hello all, I have tried contacting the dataset provider for struct_amb_ind, but there is no response. I think I will unassign myself from this task if the author does not respond within early next week.
Hello all, I have tried contacting the dataset provider for struct_amb_ind, but there is no response. I think I will unassign myself from this task if the author does not respond within early next week.
The author does not respond to the Git LFS bandwidth problem. I am unassigning myself from this task, and might retake the task once I have the update for the problem.
Hi @jen-santoso, sorry for the late reply. @ruhiyahfw, the dataset owner, is looking into the problem causing this right now. Let's wait for an update from her for the time being. 🙏 Thanks for waiting!
Hi @jen-santoso, sorry for the late reply. @ruhiyahfw, the dataset owner, is looking into the problem causing this right now. Let's wait for an update from her for the time being. 🙏 Thanks for waiting!
Hi @ruhiyahfw, is there any update on this?
Due to some technical issues, the dataset owner can't push the data to the repo. However, she gave me access to the data via other means. Maybe we can treat it as a _LOCAL = True
dataloader going forward for now, @jen-santoso? I'll send you the data URL via Discord.
Thank you @holylovenia ! I will retake the ticket again!
So, how to get the data for the dataset? I think we can add this information to the dataloader as well (like for example: please contact xxx to get the access to the dataset). What do you guys think? @jen-santoso @holylovenia
Dataloader name:
struct_amb_ind/struct_amb_ind.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?struct_amb_ind