cytomining / cytominer-transport

[Deprecated]
Other
1 stars 2 forks source link

`to_parquet` initial testing #5

Closed bethac07 closed 3 years ago

bethac07 commented 3 years ago

1) If my decoder is decoding correctly (which I'm not certain it is), the number of "objects" is the same as the number of rows in the image CSV- ie my Image.csv has two rows, my Cells.csv has 137, but I only see two lines in the file. 2) Some of the headers are wrong- The value for Area_Shape_Center_X from the Cells.csv is 291.457696827262 (and that value is NOT present at all in Cytoplasm.csv, but when I search that value in my parquet file it is under AreaShape_Center_X_Cytoplasm_Image.

CSV_Input.zip

output.zip

Code run:

from cytominer_transport import to_parquet

example_source = "/Users/bcimini/Desktop/test/transport/per_well/20585_A02/"
example_objects = ["Cells.csv", "Cytoplasm.csv", "Nuclei.csv"]
example_destination = "test_dir"

to_parquet(source=example_source, destination=example_destination, objects=example_objects)
bethac07 commented 3 years ago

These issues have all been fixed :)