kanishk-adapt / semeval-task10

Repo for SemEval Task #10 EDOS 2023. created and maintained for DCU - ADAPT submissions
Other
0 stars 0 forks source link

cache tags in a format that loads faster #28

Closed jowagner closed 1 year ago

jowagner commented 1 year ago

The current caching of tags only yields a small speed improvement. Pandas is just too slow. A simple hand-coded the format, e.g. tab-separated data read with line.rstrip().split('\t'), is expected to be much faster here.