inbo / camtrapdp

R package to read and manipulate Camera Trap Data Packages (Camtrap DP)
https://inbo.github.io/camtrapdp/
Other
3 stars 0 forks source link

Create `validate()` and helper functions to validate integrity of a Camtrap DP #58

Open peterdesmet opened 1 year ago

peterdesmet commented 1 year ago

Suggested in camtraptor July 2023 coding sprint

An important aspect before analysing or publishing data is to check whether the dataset does not contain any major integrity errors, such as missing dates, coordinates, values not meeting controlled vocabularies or relationships between tables not being correct. Although validation is possible with the Python software Frictionless Framework, for most users the returned error messages are hard to parse.

Suggestions for functions:

While it would be useful if these were functions of the frictionless R package, it might not be what we expect for camtraptor. Frictionless would have its validation run on resources (i.e. csv files + schemas), since returned data frames lose the connection with their schema, so it is not possible to validate for relationships or unique, as that information is lost. Camtraptor on the other hand, wants to validate the (already read) data frames.

damianooldoni commented 2 months ago

@peterdesmet: wondering if we should move this issue to camtrapdp repo.

peterdesmet commented 2 months ago

Ah yes, will do.