microsoft / GLUECoS

A benchmark for code-switched NLP, ACL 2020
https://microsoft.github.io/GLUECoS
MIT License
73 stars 58 forks source link

Getting "Tweet doesn't exist", "Contact author for complete dataset" using download.sh #75

Closed goru001 closed 2 years ago

goru001 commented 2 years ago

Hi @ssitaram,

I was trying to get GLUECoS datasets but got multiple logs saying "Tweet doesn't exist", "Contact author for complete dataset", possibly because few tweets which were there in the dataset do not exist anymore. It'll be great if you can share what should we do in such scenarios as train/test datasets might not be same as required for evaluation?

Thanks, Gaurav

Genius1237 commented 2 years ago

All the tweets weren't available when we ran it too. That's a generic warming from the script we used. Don't think you have to worry

30 Mar 2022 06:51:02 Gaurav Arora @.***>:

Hi @ssitaram[https://github.com/ssitaram],

I was trying to get GLUECoS datasets but got multiple logs saying "Tweet doesn't exist", "Contact author for complete dataset", possibly because few tweets which were there in the dataset do not exist anymore. It'll be great if you can share what should we do in such scenarios as train/test datasets might not be same as required for evaluation?

Thanks, Gaurav

— Reply to this email directly, view it on GitHub[https://github.com/microsoft/GLUECoS/issues/75], or unsubscribe[https://github.com/notifications/unsubscribe-auth/ADZB3YY4PS2C2YSEZLA2BMTVCQ52HANCNFSM5SBVGJSQ]. You are receiving this because you are subscribed to this thread.[Tracking image][https://github.com/notifications/beacon/ADZB3Y7GQR7EM5E4ET4BMGDVCQ52HA5CNFSM5SBVGJS2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4RVWE74A.gif]