Closed AdeboyeML closed 3 years ago
this looks great @AdeboyeML - a couple of immediate things to change, before I take a deeper dive.
1.data-exploration
.moa_sizes_consensus_datasets/consensus_median_moa_sizes
should be moa_sizes_consensus_datasets/consensus_median_moa_sizes.csv
.The reason for 1 is that we want to perform several analyses in this repo, and this is just one component. Each new analysis will belong in an analysis submodule.
@gwaygenomics Thanks for the observations.
I have created the 1.Data-exploration
folder (putting all files and sub-folders into the folder), and also updated the notebook with the null distribution analysis and I have also saved all the dataframe outputs as .csv files
.
The .csv files
has both the median_scores and p_values dataframes for each consensus dataset per dose (for each of the 231 MOAs with more than one compound).
Looks great @AdeboyeML - sorry this has taken me so long to review. I am just about ready to approve.
consensus_moa_sizes_per_dose_analysis.ipynb
becomes 0.consensus_moa_sizes_per_dose_analysis.ipynb
and one other script to visualize results, or something).ipynb
files. These nbconverted files can be generated with the following command: jupyter nbconvert --to=script --FilesWriter.build_directory=scripts/nbconverted *.ipynb
. The nbconverted files are so much easier to review than .ipynb files.Dropping in with a minor comment: Please add the column name for the first column (moa
) to the CSV files
e.g. https://raw.githubusercontent.com/broadinstitute/lincs-profiling-comparison/9bc5db8167674e2c8bec5cee3fcc043117acfbf6/1.Data-exploration/moa_sizes_consensus_datasets/median_dmso_null_p_values.csv
@AdeboyeML - we should prioritize merging in this contribution early next week. Here is a todo list:
I see people forgetting to add column names for indices all the time, and it bugs me! It is also a very easy fix:
data_moa_values = data_moa_values.reset_index().rename({"index": "moa"}, axis="columns")
save_to_csv(data_moa_values, 'moa_sizes_consensus_datasets', 'modz_moa_median_scores.csv')
@gwaygenomics - thanks for the observations and major points highlighted.
once you address these three things:
.ipynb_checkpoints
file and add to .gitignore
file (example here).py
files for corresponding .ipynb
filesGo ahead and merge.
Nice work! 🎉
@gwaygenomics all highlighted points have been addressed and corrected.
LGTM 👍 please merge at will
@gwaygenomics I do not think I have the permission to merge the pull request.
Only those with write access to this repository can merge pull requests.
ah, right. Just made you admin. You need to accept in an email, then you should be able to merge. Thanks for pointing this out!
@gway I have added the jupyter notebook that contains the code for the analysis on the consensus datasets to get the moa sizes/values per doses(1-7), I have also included the results of the analysis i.e. the heatmap plots and the csv files (folders).