thomashopkins32 / HuBMAP

Hacking the Human Vasculature (Kaggle Competition)
Apache License 2.0
0 stars 0 forks source link

Figure out what to do with unlabeled data #7

Closed thomashopkins32 closed 1 year ago

thomashopkins32 commented 1 year ago

There are ~6,000 images in the training set that have no labels. Examine a few a try to determine the best method for utilizing this data.

Look into the following and report back:

Also look into how we might use this information in other ways.

thomashopkins32 commented 1 year ago

I want to try self-supervised learning where we randomly mask out a portion of the image and then predict the missing region. I need to do more research on this topic but I think this might be the easiest way to incorporate the 7,000 unlabeled images.