FINNGEN / autoreporting

MIT License
0 stars 1 forks source link

Refactor data_access APIs and data structures #130

Closed Lipastomies closed 1 year ago

Lipastomies commented 4 years ago

The APIs for the gwas catalog and such are quite messy right now. Sometimes the input data for functions and methods is pandas dataframes, sometimes it is lists of dicts. Preferably no dicts would be used as ad-hoc classes, as those require one to keep in mind the whole data structure.

Possibilities include:

1) replacing everything with pandas dataframes.

In my opinion, it makes sense to define some specific classes/named tuples, for example for gwas catalog variants, and ensembl api output, and then flesh out the structure with those in mind.