jimregan / mlode

Automatically exported from code.google.com/p/mlode
0 stars 0 forks source link

WordNet dataset to be verified #98

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Word Net 3.0.0 has been converted from tsv to turtle, so it is needed to be 
verified before it is hosted on mlod

Original issue reported on code.google.com by mofeed.m...@gmail.com on 19 Sep 2012 at 5:08

GoogleCodeExporter commented 9 years ago
I have no idea, which wordnet you mean. Where is the datahub link?

Original comment by kur...@googlemail.com on 19 Sep 2012 at 5:28

GoogleCodeExporter commented 9 years ago
**SentiWordnet**  was required by a sent email to be converted which is 
available in:   http://sentiwordnet.isti.cnr.it/. I searched for CKAN entry but 
i did not find any. I converted it usinf google-refine  and i need verfication 
for it in order to be hosted.

Original comment by mofeed.m...@gmail.com on 20 Sep 2012 at 2:17

GoogleCodeExporter commented 9 years ago
Yes, but then, where is the data?

Original comment by kur...@googlemail.com on 20 Sep 2012 at 3:37

GoogleCodeExporter commented 9 years ago
It was in the process of some modifications 

Original comment by mofeed.m...@gmail.com on 20 Sep 2012 at 4:26

Attachments:

GoogleCodeExporter commented 9 years ago
You really should parse out the #number ends of the labels and synseTerms. You 
should also link the words to http://wiktionary.dbpedia.org/resource/$word and 
add 

<http://purl.org/linguistics/gold/inLanguage> 
<http://www.lexvo.org/data/iso639-3/eng>
<http://purl.org/linguistics/gold/inLanguage> 
<http://www.glottolog.org/resource/languoid/id/stan1293>
<http://purl.org/linguistics/gold/inLanguage> <http://wals.info/language/eng>

to every triple (if they are english)

Original comment by der.brue...@googlemail.com on 21 Sep 2012 at 1:56

GoogleCodeExporter commented 9 years ago
You mean by parse out removing this tile from the word lists???   also for the 
http://wiktionary.dbpedia.org/resource/$word it shows in the browser an empty 
page. is that ok?

Original comment by mofeed.m...@gmail.com on 22 Sep 2012 at 6:40

GoogleCodeExporter commented 9 years ago
$word is a variable

Original comment by hellm...@informatik.uni-leipzig.de on 22 Sep 2012 at 7:06

GoogleCodeExporter commented 9 years ago
Like Sebastian said, $word is a variable for the word in question. You should 
also remove the #numbers on the ends of the words, so we have the pure word 
there.

Original comment by der.brue...@googlemail.com on 22 Sep 2012 at 7:18

GoogleCodeExporter commented 9 years ago
This is the dataset after revision and ready for verification.

Original comment by mofeed.m...@gmail.com on 27 Sep 2012 at 2:39

Attachments:

GoogleCodeExporter commented 9 years ago
removing duplicate dataset label

Original comment by joregan on 12 Apr 2015 at 11:35

GoogleCodeExporter commented 9 years ago
still not removed; trying both

Original comment by joregan on 12 Apr 2015 at 11:36