biscicol / triplifier

The triplifier converts Spreadsheets, databases, and Darwin Core Archives into RDF/N3 files suitable for use on the Semantic Web.
1 stars 0 forks source link

Drop redundant triples #58

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?

triplify a typical DwCa

What is the expected output? What do you see instead?

Want to see each unique triple represented in the data set.

Actually see one triple per field redundant or not.

Please use labels and text to provide additional information.

If we remove the 9M triples with empty object from the VertNet data set 
we are left with 11.7M triples (11,698,611)

If we remove the redundant triples we are left with 2.4M triples (2,369,196)

* with empty and redundant triples present, 
  88.8% of the triplifiers output caries no information 

 (on the VertNet data set).

Original issue reported on code.google.com by tom.con...@gmail.com on 23 Apr 2013 at 5:59

GoogleCodeExporter commented 9 years ago
Should be fixed with revision 229.

Original comment by stucky.b...@gmail.com on 3 May 2013 at 9:05