Closed mhueppe closed 2 weeks ago
Download and analyse the already existing dataset which inclues ~10k sample points of abstrac/title pairs from the Association of Computational Linguistics Reference to dataset: https://github.com/gcunhase/ArXivAbsTitleDataset
Commit 733c548 implements some data analysis already. Data Analysis: distributionTitles.png distributionTitles_leng.png proportionOfOverlap.png
Download and analyse the already existing dataset which inclues ~10k sample points of abstrac/title pairs from the Association of Computational Linguistics Reference to dataset: https://github.com/gcunhase/ArXivAbsTitleDataset