jimregan / mlode

Automatically exported from code.google.com/p/mlode
0 stars 0 forks source link

CKAN entry for SentiWS-RDF #71

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
CKAN: http://thedatahub.org/dataset/sentiws

-Replaced POStags with olia.owl resource URIs
-Replaced German Umlauts with HTML-Entities
-Defined a minimal vocabulary
-Converted SentiWS to RDF using a bash script
-Parsing with rapper returned 15736 triple

Please try to verify my conversion:

bash makeSentiRDF.sh SentiWS_v1.8c_Negative.txt Negative
bash makeSentiRDF.sh SentiWS_v1.8c_Positive.txt Positive
cat sentiWS.nt senti-vocab > senti.nt

The number of triples seems to be too low.

Original issue reported on code.google.com by der.brue...@googlemail.com on 13 Aug 2012 at 4:28

Attachments:

GoogleCodeExporter commented 9 years ago
I've checked the script and found the error. The new RDF Dump in thedatahub 
counts 66040 triple.

Maybe we could add wiktionary links? How to link words automatically to 
wiktionary-entries?

Original comment by der.brue...@googlemail.com on 14 Aug 2012 at 9:09

Attachments:

GoogleCodeExporter commented 9 years ago

Original comment by der.brue...@googlemail.com on 19 Sep 2012 at 11:46