microsoft / BiomedParse

BiomedParse: A Foundation Model for Joint Segmentation, Detection, and Recognition of Biomedical Objects Across Nine Modalities
https://microsoft.github.io/BiomedParse/
Apache License 2.0
240 stars 19 forks source link

Preprocessing for Fine-tuning #19

Closed evanrubel closed 1 week ago

evanrubel commented 1 week ago

Thank you all for this excellent work and for open-sourcing it as well!

I am interested in fine-tuning the Biomedparse model for a CT dataset. In the inference code, there are some preprocessing steps for CT scans (e.g., windowing, normalization, etc.). When preparing the dataset for finetuning, should the input images in the train and test directories already be preprocessed, or will the training and evaluation code automatically preprocess the data?

theodore-zhao commented 1 week ago

They should be preprocessed before putting in the train and test folders. If there are image types that are out of the current preprocessing categories, feel free to preprocess in the way that makes most sense for your application. The model should be able to adapt to that after finetuning.