Closed SamuelCahyawijaya closed 3 months ago
Hello @holylovenia @SamuelCahyawijaya @sabilmakbar,
After checking the data, I find that this dataset is used for sign languages, which means that data are all pictures. I don't think it could support language modeling task. Please have a check.
Apologies for laterep, we're on it rn as we found some other datasets having similar issues w/ this one.
@Alex-HaochenLi Sorry for the mistake. I've fixed the datasheet to have Sign Language Recognition
instead of Language Modeling
. Probably we have to add a new task in the constants.py to cater to this dataloader, though.
What do you think?
cc: @sabilmakbar @SamuelCahyawijaya
Hi @Alex-HaochenLi, @sabilmakbar has kindly added the SIGN_LANGUAGE_RECOGNITION
task so you can proceed with the dataloader implementation.
Hi @, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help.
yes
Dataloader name:
mywsl2023/mywsl2023.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?mywsl2023