SCIInstitute / ShapeWorks

ShapeWorks
http://sciinstitute.github.io/ShapeWorks/
Other
100 stars 32 forks source link

Test systems data preparation #970

Open sheryjoe opened 3 years ago

sheryjoe commented 3 years ago

Data are needed to be prepared for all below test systems to evaluate and validate ShapeWorks tools.

In house:

Public:

Each test system include the following. Data should be organized based on the portal datasets requirements.

jadie1 commented 2 years ago

We are keeping track of this via this spreadsheet: https://docs.google.com/spreadsheets/d/1MpYoPxEbv0IsqWzHvmDUB6QE3L_n__Dlegsk-NLrLYY/edit?ts=60c7815a#gid=0

iyerkrithika21 commented 2 years ago

In the last Cardiology meeting Jake had a question regarding the size of the datasets. Do we have any specifications for that ? And do we need to include pathological and controls in all datasets or just controls would be enough?

sheryjoe commented 2 years ago

Moving this to 6.3 but we have to finish this by December max regardless of the release date.

cchriste commented 2 years ago

Moving this to 6.3 but we have to finish this by December max regardless of the release date.

Looks really close. Might it have been completed?

jadie1 commented 2 years ago

I am just finishing up organizing the prostate dataset

iyerkrithika21 commented 2 years ago

I need to finish organizing the biventricle data.

jadie1 commented 2 years ago

Datasets that need a license file:

I will ask Amy about the ankle license. @akenmorris and @sheryjoe who should I ask about the cardiac licenses?

jadie1 commented 2 years ago

Potential public test system: https://www2.childmind.org/webmail/908232/621532556/ab06ec718e8753cf11f86c9c7a8da595abbd6e12bf69f0e2a59ab80f3243a0be

jadie1 commented 11 months ago

Note the data folder has changed from /usr/sci/data/SSM-Data/ to CHPC storage automounted at /usr/sci/datanew/SSM-Data/

jadie1 commented 10 months ago

I have added the TotalSegmentator dataset to /usr/sci/datanew/SSM-Data/Public_Data/Totalsegmentator_dataset/. This dataset contains 1204 subjects with 104 shape classes (i.e., bones, organs, muscles, etc). There are 76,888 non-empty segmentations, and of those 49,216 do not lie on image boundary/are not cutoff, these have been marked as "complete".

The test systems spreadsheet has been updated and I created a spreadsheet for the entire dataset and complete data: /usr/sci/datanew/SSM-Data/Public_Data/Totalsegmentator_dataset/all_shapes.xlsx and /usr/sci/datanew/SSM-Data/Public_Data/Totalsegmentator_dataset/complete_shapes.xlsx

I've also added json files with the shape counts for the available and complete segmentations: /usr/sci/datanew/SSM-Data/Public_Data/Totalsegmentator_dataset/all_shape_counts.json and /usr/sci/datanew/SSM-Data/Public_Data/Totalsegmentator_dataset/complete_shape_counts.json