Closed adamreichold closed 2 years ago
Related to but not the same as #23. Most likely, is more an issue with the harvested source changing as we harvest it if it is very large, so we might just want to log this as a warning in a generic dataset writing (and sanitizing) function.
It appears that some sources allocate duplicate dataset identifiers which we currently implicitly handle via last-write-wins. This should be replaced a explicit handling that decides which version should be used.