HamedBabaei / author-obfuscation

4 stars 0 forks source link

corpora #1

Open HamedBabaei opened 5 years ago

HamedBabaei commented 5 years ago

I will add the previous year's corpora for author masking task. these corpora contain 205 problems in English for author obfuscation task from 2016.

besides that, I will add the author verification training corpus which used for evaluation of author obfuscation systems in the previous years too.

HamedBabaei commented 5 years ago

I added the corpora dir to the repository and the first analysis of it corpora ( the size of each corpus ) are shown in the below table:

corpus size
author masking 205
author verification pan2013 10
author verification pan2014-essay 200
author verification pan2014-novel 100
author verification pan2015 100
overall 615