Open Sulstice opened 4 months ago
@Nickspizza001 So let's start slow with this one.
This first thing to do would be to take global-chem tsv and convert it into a parquet file.
Open a pull request and show me.
https://github.com/Global-Chem/global-chem-brick/tree/dami
Parquet is used because it compresses large files better than others
We are going to be taking Global-Chem and turning into a biobrick:
I opened up the issue here and here is the code for it: https://github.com/Global-Chem/global-chem-brick. The first thing to do is create a python file that converts the CSV from global-chem into a parquet file.
Should be a one-liner. Make the directory similar to the one labeled here: https://github.com/biobricks-ai/drugbank-open.
And place your script in the global-chem brick repository: https://github.com/Global-Chem/global-chem-brick