carmonalab / STACAS

R package for semi-supervised single-cell data integration
GNU General Public License v3.0
75 stars 9 forks source link

option to only use intersection of features between datasets #16

Closed sjcarmona closed 1 year ago

sjcarmona commented 2 years ago

It is common that users integrate datasets that have inconsistent gene symbols (e.g. different versions of genome annotation were used). Include check on gene symbol overlaps across input datasets. Default conservative behaviour might be to consider only the intersection (and stop if number is below a min. threshold).