Closed nichtich closed 2 years ago
The database uniqueness constraint on (ppn,voc,notation) should be removed when cleanup (#2) is extended to make sure the same PPN is not indexed multiple times with the same voc & notation.
(ppn,voc,notation)
Requires to extend this line: we can assume $voc.tsv is grouped by PPN but not stored by notation, e.g.
$voc.tsv
12345 XX DA3 12345 YY 12345 XX
The database uniqueness constraint on
(ppn,voc,notation)
should be removed when cleanup (#2) is extended to make sure the same PPN is not indexed multiple times with the same voc & notation.