NVIDIA / sentiment-discovery

Unsupervised Language Modeling at scale for robust sentiment classification
Other
1.06k stars 202 forks source link

Reproducing SemEval results #61

Open trivedigaurav opened 4 years ago

trivedigaurav commented 4 years ago

It appears like run_clf_multihead.py expects test dataset to have gold standard labels. However, the csv in data/semeval/test.csv doesn't contain any labels. As a result, the script fails to generate metrics to reproducing the results mentioned here: https://github.com/NVIDIA/sentiment-discovery/blob/master/analysis/reproduction.md#finetuning-classifiers