biopragmatics / bioregistry

📮 An integrative registry of biological databases, ontologies, and nomenclatures.
https://bioregistry.io
MIT License
115 stars 49 forks source link

Add missing descriptions and homepages #73

Closed cthoyt closed 2 years ago

cthoyt commented 3 years ago

Curation sheet generated with:

import pandas as pd
import pystow

import bioregistry

def main():
    rows = []
    for prefix in bioregistry.read_registry():
        homepage = bioregistry.get_homepage(prefix)
        desc = bioregistry.get_description(prefix)
        if not homepage or not desc:
            rows.append((
                prefix,
                bioregistry.get_name(prefix),
                homepage,
                desc,
                '',
            ))

    path = pystow.join('bioregistry', 'curation', name='bioregistry_missing_description.tsv')
    df = pd.DataFrame(rows, columns=['prefix', 'name', 'homepage', 'description', 'reference'])
    df.to_csv(path, index=False, sep='\t')

if __name__ == '__main__':
    main()
cthoyt commented 2 years ago

This is done thanks to help from @ddomingof