Open pvgenuchten opened 3 weeks ago
in link extraction phase, append each link with a reference to its source
For each url extracted what makes sense to track as reference?
FYI @pvgenuchten
Relevant here is the record (Id or url) on which a certain link was identified, for example https://soilwise-he.containers.wur.nl/cat/collections/metadata:main/items/10.1007/698_2022_928
currently we seem to not store which record has a bad link, only if the url is incorrect, which makes it hard to understand where an improvement is needed
suggestion: