open-reaction-database / ord-schema

Schema for the Open Reaction Database
https://open-reaction-database.org
Apache License 2.0
93 stars 26 forks source link

process_dataset.py is not correctly handling deleted files #569

Closed skearnes closed 3 years ago

skearnes commented 3 years ago

The comment on https://github.com/open-reaction-database/ord-data/pull/73 should show the number of reactions removed. There's a continue in process_dataset.py that is the source of the problem.