Open frankdarkluo opened 11 months ago
@frankdarkluo: For the multilingual data, we sampled from several datasets, for the sentiment analysis and LID, we use the data from NusaX, while for the MT datasets you can check the detail on the following machine translation script.
Hope it helps!
I wonder if the multilingual data used in this paper could be open-sourced or linked?