LTER-LIFE / VeluweProtoDT

Veluwe proto-DT: a digital mini twin of tree phenology and climate scenarios
0 stars 0 forks source link

Validate XML, EML, DwC-A #13

Open StefanVriend opened 10 months ago

StefanVriend commented 10 months ago

There are packages and tools to validate Darwin Core-Archives, as well as their components (XML and EML). We should implement this process as part of our bud burst data workflow.

StefanVriend commented 10 months ago

DwC-A

There is an R package to validate DwC-A files: {finch}. The downside is that it requires the file to be available through URL.

GBIF also offers a web-based data validator: https://www.gbif.org/tools/data-validator.

The best option seems to write our own functions, based on the {finch} package.

CherineJ commented 9 months ago

I also found the OBIS tools package (obistools) which is not primarily suited to validate the whole DwC-A but seems to have some nice features to check e.g., whether eventID and parent eventID correspond and for checking some criteria of the separate tables (e.g., whether certain DwC-A terms are included or contain missing values etc.). maybe we can implement that in our own validation functions.