juliandewit / kaggle_ndsb2017

Kaggle datascience bowl 2017
MIT License
624 stars 290 forks source link

Where does the data in resource.rar come from? #26

Open caojiehui opened 7 years ago

caojiehui commented 7 years ago

Hi, julian,

Your work is great. Thanks for sharing.

I download the resource.rar and there are several folders including different data. As far as I know, the data of the folder 'luna16_annotations' is from LUNA16 and LIDC-IDRI ,and the data of the folders 'luna16_manual_labels' and 'ndsb3_manual_labels' are generated manually. How about other folders? Such as annotations_excluded.csv of the folder 'luna16_annotations', candidates_V2.csv of the folder 'luna16_annotations' , the folder 'luna16_falsepos_labels' and the folder 'segmenter_traindata'.

Thanks Cao jiehui

juliandewit commented 6 years ago

It's in the blog under Preprocessing and creating a trainset