shendurelab / MPRAflow

A portable, flexible, parallelized tool for complete processing of massively parallel reporter assay data
Apache License 2.0
31 stars 16 forks source link

significant speed up of insert count normalization #60

Closed lotard closed 2 years ago

lotard commented 2 years ago

using pandas 'groupby' and 'agg' functions to obtain normalized counts in a single pass instead of a loop

visze commented 2 years ago

thank you!