meyer-lab / mechanismEncoder

Developing patient-specific phosphoproteomic models using mechanistic autoencoders
4 stars 1 forks source link

Issues with cptac data #24

Closed FFroehlich closed 3 years ago

FFroehlich commented 3 years ago

While processing the cptac data, I ran into a couple of issues with missing/incomplete metadata:

sgosline commented 3 years ago

First off, I think you're referring to the cell line data, not the CPTAC data, so I'm assuming that's what you meant? Quick answers:

sgosline commented 3 years ago

As far as the data update goes - it hasn't fully be normalized/batch corrected. I hope to have it in my hands by the middle of next week (depending on how many batch effects there are).

sgosline commented 3 years ago

Get this - we "found" a dataset from 2018 that has never been used! It might be published with an existing paper, but either way we can use it right away. We have MOLM14 cells treated with gilteritinib and DMSO after 30 minutes and 3hrs. It looks like there are two sets of cells. Phospho data is here: https://www.synapse.org/#!Synapse:syn24189487/tables/ "site-corrected" data is here: https://www.synapse.org/#!Synapse:syn24189489/tables/ This second table matches the site-correction in the patient data.

I did this late last night, and there are some typos in the table, I will fix them soon.

sgosline commented 3 years ago

Typos are fixed, and zero values are removed.