We need to agree on a format for storing the data model. It should preferably be lightweight, practical, simple to incorporate into code, and allow validation against the standard. Obvious possibilities are:
Pydantic (my favourite)
JSON Schema
I have no direct experience with either - and anyway they seem to be interconvertible. What is already in use, and what would people propose?
A less obvious possibility would be mmCIF. This might be a very good choice for formalising a standard later, being precisely defined, authoritative, and used for deposition. However it is heavyweight and hard to work with, so probably not a good choice to begin with.
We need to agree on a format for storing the data model. It should preferably be lightweight, practical, simple to incorporate into code, and allow validation against the standard. Obvious possibilities are:
I have no direct experience with either - and anyway they seem to be interconvertible. What is already in use, and what would people propose?
A less obvious possibility would be mmCIF. This might be a very good choice for formalising a standard later, being precisely defined, authoritative, and used for deposition. However it is heavyweight and hard to work with, so probably not a good choice to begin with.