ufal / perl-pmltq-web

Simple web build on the top of the PML Tree Query server
https://lindat.mff.cuni.cz/services/pmltq/
0 stars 0 forks source link

Add GRUG Parallel Treebank #10

Closed stranak closed 6 years ago

stranak commented 9 years ago

http://fedora.clarin-d.uni-saarland.de/grug/, it is CC-BY, Tiger XML format.

dan-zeman commented 9 years ago

Unfortunately the download contains only 40-50 sentences per language, while documentation mentions 2000+ sentences :-( I downloaded the data recently and tried to contact the author about it but got no reply.

stranak commented 9 years ago

Hm, that is very unfortunate indeed. It is barely worth the work for 50 sentences, even if it may be rather little work, given the Tiger XML format. For 2000 it would be definitely worth it.

Will you write the author and ask him about it?

dan-zeman commented 9 years ago

Go to /net/data/treebanks/GRUG and try grep '<s ' GEO/*.tig | wc -l GEO: 45 GER: 45 RUS: 46 UKR: 50

I wrote to Oleg Kapanadze on March 30 and asked whether bigger data was available but he has not replied yet.