diachron / quality

Dataset Quality Assessment (part of WP5 of the Diachron EU FP7 project)
MIT License
8 stars 4 forks source link

Completeness metrics against gold standard #57

Open clange opened 9 years ago

clange commented 9 years ago

Simple approach: “gold standard” is a subset of a well-known dataset (obtained with a SPARQL CONSTRUCT query against the dataset's endpoint). Compute the set difference between, say, the labels of the resources in the gold standard and the labels of the resources in the dataset under assessment. For now, we'll do exact matches, not approximate matches.