ncbo / ncbo_cron

Jobs that run on a regular basis in the NCBO infrastructure
Other
2 stars 6 forks source link

ontology pull mechanism creates new submissions when content of ontology doesn't change #7

Open alexskr opened 7 years ago

alexskr commented 7 years ago

ncbo_cron ontology pull mechanism creates new entries for ontologies every day while the content of ontology doesn't change. For example, some OBO files have "data-version" or "date" metadata entries automatically generated everyday while the actual content of ontology doesn't change for years. Pull mechanism should probably be adjusted to accommodate this behavior. Take a look at MA and SBO ontologies: [root@bmir-admin1 repository]# diff MA/39/ma.obo MA/125/ma.obo 2c2 < data-version: releases/2016-07-16

data-version: releases/2016-12-22 [root@bmir-admin1 repository]# diff SBO/585/SBO_OBO.obo SBO/565/SBO_OBO.obo 2c2 < date: 24:01:2017 07:00

date: 01:11:2016 07:00