cancerDHC / example-data

This repository is intended to act as a store of example data files from across the NCI Cancer Research Data Commons (CRDC) nodes in a number of formats.
MIT License
0 stars 3 forks source link

Convert "head and neck" Juypter Notebooks into an automated Python test suite #8

Closed gaurav closed 3 years ago

gaurav commented 3 years ago

The head-and-mouth directory contains two datasets (one downloaded from GDC, and one downloaded from PDC) from cancers of the head and neck (not including the brain). This is the dataset we currently use in our first GDC-to-CRDC-H transformation demonstration Jupyter Notebook. Some or all of this should be converted into a test suite, which can be used to test:

I'm not sure if we need to convert the Head and Mouth Cancer Datasets download notebook with you as well -- it might be useful to always use the most up-to-date data, or it might just be confusing if we run into problems with the PDC/GDC APIs. I'm inclined to leave them out for now.

turbomam commented 3 years ago

I may not be able to do all of this on my own

gaurav commented 3 years ago

@turbomam No worries! Which parts would you like to focus on?

turbomam commented 3 years ago

Is that OK?

gaurav commented 3 years ago

Sounds like a plan! 👍🏾

gaurav commented 3 years ago

@turbomam Do you have any changes to the Jupyter Notebook you'd like to commit? If not, I'll come up with a PR early next week that applies all the fixes we've discussed on Slack, and then you can keep working on that.

turbomam commented 3 years ago

@gaurav No I don't think I have anything to commit outside of what we've already discussed. Thanks.