aayush97 / semeval2023-afrisenti

A low-resource sentiment analysis project for African Languages
MIT License
0 stars 0 forks source link

Vk data #7

Closed karthik-44 closed 2 years ago

karthik-44 commented 2 years ago

dz, ha, ma files of train have somehow the column names with extra space

dz, ha, ig, ma, pt of dev also has column name ' tweet' instead of 'tweet', Lets fix this as well.

This isn't any show stopper for any other analysis as the languages should be considered individually/seperately while doing analysis.

But lets fix it. Fixed

matrain.ID value has naming convention like 'my'. Need to change to 'ma_'.

Again this doesn't cause serious issue. Fixed

ma_dev.csv doesn't have this issue

Another problem is in class labels, there is an extra space. ' positive', ' negative', ' neutral'.

This is fixed as well.