Preprocessing of files - Githubissues

KCL-BMEIS / VS_Seg

Automatic Segmentation of Vestibular Schwannoma with MONAI (PyTorch)

Apache License 2.0

43 stars 8 forks source link

Preprocessing of files #2

Closed mabbasi6 closed 2 years ago

mabbasi6 commented 2 years ago

I have been trying to use the preprocessing code and I have been getting these errors: When I enter <data/Vestibular-Schwannoma-SEG>:

Traceback (most recent call last):
  File "TCIA_data_convert_into_convenient_folder_structure.py", line 125, in <module>
    assert(all(found)), f"Not all required files found"
AssertionError: Not all required files found

When I point to the folder :

Traceback (most recent call last):
  File "TCIA_data_convert_into_convenient_folder_structure.py", line 42, in <module>
    dd = pydicom.read_file(first_file)
  File "/Users/mabbasi6/opt/anaconda3/envs/momo_seg/lib/python3.6/site-packages/pydicom/filereader.py", line 993, in dcmread
    fp = open(fp, 'rb')
IsADirectoryError: [Errno 21] Is a directory: '/Users/mabbasi6/Downloads/VS_Seg/data/new/manifest-1614264588831/Vestibular-Schwannoma-SEG/VS-SEG-061/03-17-1996-NA-Avanto RoutineImage Guidance-11244'

Could you please let me know how I could solve it? and/or the data has changed causing some errors?

aaronkujawa commented 2 years ago

Hi, the data has not changed. Did you download the complete dataset from TCIA in "Descriptive Directory Name" format? The --input path should be Vestibular-Schwannoma-SEG (in your case it should be /Users/mabbasi6/Downloads/VS_Seg/data/new/manifest-1614264588831/Vestibular-Schwannoma-SEG )

What does your full command look like? Does the error appear straight away or are some files copied to the output folder?

mabbasi6 commented 2 years ago

Hi, I downloaded the complete dataset from TCIA in "Descriptive Directory Name" format. Using the directory you recommended, the command is: python3 TCIA_data_convert_into_convenient_folder_structure.py --input /Users/mabbasi6/Downloads/VS_Seg/data/new/manifest-1614264588831/Vestibular-Schwannoma-SEG --output /Users/mabbasi6/Downloads/VS_Seg/prepped/

and the error is:

Traceback (most recent call last):
  File "TCIA_data_convert_into_convenient_folder_structure.py", line 125, in <module>
    assert(all(found)), f"Not all required files found"
AssertionError: Not all required files found

No data is written into the output folder.

aaronkujawa commented 2 years ago

Looks like you are doing everything right. Sorry, I'll have to download the data again from TCIA to run this on my side. Will get back to you when I can confirm that it is still running as expected.

aaronkujawa commented 2 years ago

Sorry for the delay, I couldn't immediately download the dataset because TCIA has added a new license to the dataset that required me to ask for access first. The newly added LICENSE file in the Vestibular-Schwannoma-SEG folder is also the reason for the script failing: The script assumes it's another case. For now you can just delete the LICENSE file, then the script will work. I'll also update the script to ignore the file. Thanks for raising the issue.

aaronkujawa commented 2 years ago

I updated the script to ignore the file. Thanks again.