cmap / cmapPy

Assorted tools for interacting with .gct, .gctx files and other Connectivity Map (Broad Institute) data/tools
https://clue.io/cmapPy/index.html
BSD 3-Clause "New" or "Revised" License
126 stars 76 forks source link

Math #29

Closed EvanLemire closed 6 years ago

EvanLemire commented 6 years ago

modz - Takes a pandas df, makes a weighted average of all columns

robust_zscore - Takes one or two pandas dataframes. If a second is provided, it uses this second one for calculating the medians and mads for zscoring (eg. vehicle control)

differential - Use GCToo object metadata to pass either a single dataframe or two dataframes to robust_zscore function based on a given field/value pair

levlitichev commented 6 years ago

Looking good! Are you working on tests? Let me know if I can offer a hand.

EvanLemire commented 6 years ago

@levlitichev Yea I actually already have some tests for these from merino, need to think about how I might change them now that the structure is different / what parts I want to transfer over

levlitichev commented 6 years ago

Made some changes!

@EvanLemire @oena let me know what you think