Updated: 2022.11.25
We are now moving towards the second CMI-PB contest for which we have generated new data. @Joaquin is working on re-harmonizing the data such that we have 3 datasets:
Updated: 2022.03.31
At this point this repository stores the codes to standardize (mostly) the CMI-PB datasets between 2020 and 2021. To access these datasets please check out: https://github.com/joreynajr/cmi-pb-multiomics/tree/main/results/main/cmi_pb_datasets/processed/harmonized
Download the data by using:
bash workflow/scripts/download_from_webpage_links.sh
Note: Don't use workflow/scripts/download_from_ftp_server.sh
, not working due to certificate issues.
Aftewards run the standarizing notebook with Jupyter:
workflow/notebooks/development/Standardize_data.ipynb