dhammack / DSB2017

Code for 2nd place solution to the 2017 National Data Science Bowl
346 stars 156 forks source link

about the data #10

Closed HarryPotter2017 closed 7 years ago

HarryPotter2017 commented 7 years ago

Hi Daniel,

When I run the file DSB2017-master/training_code/FLung_nodule_models/create_nodules_from_modelv29.py, I find the line 230:

train_files = [f for f in os.listdir(r'E:\lung\data_raw\data')]

I want to know if the data in the directory 'E:\lung\data_raw\data' is the data provided by kaggle ? If this is the case, kaggle provides the stage1 data is not like LUNA. Do I need to convert the .dcm file provided by kaggle into a .raw file and a .mhd file? How can I do it?

HarryPotter2017 commented 7 years ago

I found that the file preprocess.py can solve it.