Open brindakv opened 8 months ago
@svoinea to identify all tables that are not in the json config file that are populated in the system generated mmcif file.
Also, identify tables that are not in ermrest but are taken as is from the initial mmCIF (e.g., atom_site): https://github.com/informatics-isi-edu/protein-database/blob/master/scripts/pdb_processing/config/mmcif_tables_input2output.json
input.cif
) --> output.cif
1.2 testSchemaDataPrepValidate-ihm.py(output.cif
) --> *.json
1.3 loadTablesFromJSONinput.cif
) --> output.cif
2.2 write into the <structure_id>.cif
file:
<structure_id>
valueermrest_table_defs.json
file which is generated from the json-full-db-ihm_dev_full-col-ihm_dev_full.json
filechem_comp_atom
tableaudit_conform
write the Supported_Dictionaryoutput.cif
file, write the lines of the tables present in the mmcif_tables_input2output.json
fileEntry_Generated_File
<structure_id>.cif
file
hatrac
the Entry_Generated_File
or the Entry_Error_File
ermrest
the Conform_Dictionary
entriesSUBMISSION COMPLETE
3.1 Get Accession Code
3.2 addReleaseRecords for HOLD:
3.3 generate the <Accession_Code>.cif
file from the <structure_id>.cif
file:
Accession_Code
valuepdbx_database_status
and pdbx_audit_revision
blocks in the system generated mmCIF fileihm_entry_collection
and ihm_entry_collection_mapping
tablessingularity
for the report validationpdbx_database_status
and pdbx_audit_revision
blocks in the system generated mmCIF file
This json file is used to identify tables populated in system generated mmCIF file.
However,
ihm_entry_collection
is populated using the backend: https://github.com/informatics-isi-edu/protein-database/blob/master/scripts/pdb_processing/worker/pdb/clientlib/pdb_workflow_processing_lib/client.py#L2141The process needs to be reviewed so that the configuration can be controlled in one place.
Update documentation accordingly: https://github.com/informatics-isi-edu/protein-database/wiki/Creating-mmCIF-files.