Closed MxMstrmn closed 2 years ago
Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB
Looks good, a true data printing machine. I may schedule a few runs just to check.
gene_id
stored in both LINCS & Trapnell? We'll need this for the surgery..var.gene_id
adata_cpi
, it is just that I perform the matching via a ictionary (drug_name, smiles)
which imo is the preferable method.There was a problem with the new dataset, resulting in JQ1
getting assigned a NaN Smiles (since it was renamed after the dict mapping had been applied). I fixed it, but don't have the permission to save the dataset. @MxMstrmn can you look through it and just run the notebook again? That'll save the updated version to storage
Closes #61
The first notebook simply does the gene matching with the lincs data, ignoring the subsetting from before. The second notebook is an updated version of the addition of SMILES strings to the
.obs
dataframe. As a result all.h5ad
files in thePROJECT_DIR/'datasets'
folder are updated and ready to be used in our model sweeps.