legumeinfo / datastore-specifications

Specifications for directory naming, file naming, file contents in the LIS datastore
2 stars 0 forks source link

Need QC specification for each collection for file-based QC #5

Closed sammyjava closed 1 year ago

sammyjava commented 2 years ago

We need to have file-based QC, because it's too large a burden to have to rebuild a mine way down the line when a bad file gets loaded. So let's put QC.md under each section of this repo with QC requirements. I've started that under annotations.

sammyjava commented 2 years ago

This issue has been solved as far as the README files go with the JSON schema and validate script. But we should still validate GFFs, especially, plus be sure that FASTAs have the same identifiers as in the corresponding GFFs.

sammyjava commented 1 year ago

This has been further implemented, in an ongoing basis, with the org.ncgr.datastore.validation package, which will never be "finished" so I'll close this issue.