The pyQuARC tool reads and evaluates metadata records with a focus on the consistency and robustness of the metadata. pyQuARC flags opportunities to improve or add to contextual metadata information in order to help the user connect to relevant data products. pyQuARC also ensures that information common to both the data product and the file-level metadata are consistent and compatible. pyQuARC frees up human evaluators to make more sophisticated assessments such as whether an abstract accurately describes the data and provides the correct contextual information. The base pyQuARC package assesses descriptive metadata used to catalog Earth observation data products and files. As open source software, pyQuARC can be adapted and customized by data providers to allow for quality checks that evolve with their needs, including checking metadata not covered in base package.
Motivation
To make pyQuARC easier for the users, it is set up in a way that a new rule can be added just by adding entries to a set of JSON files. A new rule basically points to a checks that is already implemented.
We need documentation of all the available checks so the user can just go through it and find the check that they need to create the new rule. It should include the description of what the check does and the check id that can be used in the JSON files.
Changes made
Restructured the schema files rule_mapping.json and checks.json, such that the checks can be reused and users have to only make changes in the rule_mapping.json file.
Changes include:
Moved data from checks.json to rule_mapping.json
Added check_id to rule_mapping.json to point to already existing check (helps remove duplicates from checks.json)
Removed the duplicate checks from checks.json
Deliverable
The documentation for all the available checks is provided in the file CHECKS.md
Motivation To make pyQuARC easier for the users, it is set up in a way that a new rule can be added just by adding entries to a set of JSON files. A new rule basically points to a checks that is already implemented.
We need documentation of all the available checks so the user can just go through it and find the check that they need to create the new rule. It should include the description of what the check does and the check id that can be used in the JSON files.
Changes made Restructured the schema files
rule_mapping.json
andchecks.json
, such that thechecks
can be reused and users have to only make changes in therule_mapping.json
file. Changes include:data
fromchecks.json
torule_mapping.json
check_id
torule_mapping.json
to point to already existing check (helps remove duplicates fromchecks.json
)checks.json
Deliverable The documentation for all the available checks is provided in the file CHECKS.md