Closed jjacobson95 closed 5 months ago
I'll note other issues as I find them in this thread.
Schema alignments - all of the following must be converted from float to integer:
Never mind on these - I think this is just how pandas is reading them into the dataframes.
I'm guessing these were all a result of my working off of main instead of the builder branch that was way ahead, I'll get to these shortly, still fixing another docker build issue.
Okay taking a look at the new build (2024_03_20 build) and here are a couple updates needed to get these aligned to the schema and working with the package:
Another issue with the 2024_03_20 build that I just found: Depmap proteomics file is missing most of its data. There is only proteomics info for a single sample.
please create separate issues so i can tag them. this was an easy fix, and is going in the latest PR.
This may not be up to date as I'm using some of the synapse data generated last week but I'm just trying to get ahead of a few fixes.
BeatAML mutations file has a column titled "mutations" instead of "mutation". This should already be fixed in main, but I noticed it was wrong in those files.
DepMap proteomics has the index column in it. just add 'index=False' argument to pd.to_csv.
DepMap Experiments file dose_reponse_metrics have these values: fit auc', 'fit ic50', 'fit ec50', 'fit r2', 'fit ec50se', 'fit einf'.
DepMap experiments file has these extra columns: