EngreitzLab / gene_network_evaluation

Evaluation framework for computationally inferred gene networks from single-cell data.
GNU General Public License v2.0
9 stars 7 forks source link

Celloracle updates #12

Closed adamklie closed 6 months ago

adamklie commented 6 months ago

I refactored the CellOracle worfklow and am currently testing for robustness. The major changes are:

  1. Cicero parameters can be specified in the config. Including a precomputed dimensionality reduction for building the kNN graph used for aggregation
  2. Cluster key in mdata.obs needs to be specified in the config
  3. r2g and tf2r scripts now only consider genes and TFs present in the starting MuData. This means that you can subset it ahead of time and only those genes and TFs will be present in any of the outputs.
  4. grn script now doesn't do any of the steps required for CellOracle simulations. Now you just specify a layer and a bagging ridge regression is fit for each target gene using the TFs constrained by the r2g and tf2r steps
  5. Main outputs are r2g.csv, tf2r.csv and grn.csv that are structured very similar to previously with some minor edits to column names. These are stored in the uns as dictionaries with keys "r2g", "tf2r" and "grn" respectively.