Harvest metadata from the Protein Data Bank and map to our dataset schema
Can use the outbreak.info PDB parser as inspiration, which harvests only the COVID-19-related structures. It's highly likely that this script isn't optimized or necessarily the best path though ;)