ScalableCytometryImageProcessing / SCIP

Scalable Cytometry Image Processing (SCIP) is an open-source tool that implements an image processing pipeline on top of Dask, a distributed computing framework written in Python. SCIP performs projection, illumination correction, image segmentation and masking, and feature extraction.
https://scalable-cytometry-image-processing.readthedocs.io/en/latest/
GNU General Public License v3.0
7 stars 0 forks source link

For large datasets, SCIP hangs when exporting features #46

Closed MaximLippeveld closed 2 years ago

MaximLippeveld commented 2 years ago

Most likely due to features being collected to one node in the delayed call to final. Fix by indexing features, joining with meta, and calling to_parquet directly on Dask DataFrame, rather then on pandas DataFrame in final.