CoVital-Project / Spo2_evaluation

Python script to evaluation the correctness of SpO2 estimation algorithms
18 stars 6 forks source link

Dataset structure #21

Open MalcolmMielle opened 4 years ago

MalcolmMielle commented 4 years ago

Hi all,

Since we should soon get the dataset up and running I'd like to talk about how we plan to provide it to users especially since students are going to be working on it.

building the dataset

Im talking with Dave Hagman about separating the data we will get from the MD from a dataset of community sample. That way we have the original dataset from the doctor which would be a medical dataset, and then we can distribute the app collection to other users and build a larger (but less accurate) dataset. I think the method we will provide to MD has to score high on the medical dataset but a community dataset could be used for training.

Thoughts?

providing the dataset

What do you guys think about making only one half of the dataset public? The non-public part of the datast could be used as testing sub-dataset. This way the user would have only access to the training/validation set but not the final dataset. It's only an idea I wanted to pitch but we could work with the back end people to create an architecture so that students wiłl only be able to upload the result (or method) and would never be able to see the test dataset (I know some dataset have been set up this way by some uni).

It's definitely low priority but I thought it would be interesting to raise this point.

MohammedSoliman11 commented 2 years ago

ok , as a student how can i get this dataset to be able to run the full model and test it ?

MalcolmMielle commented 2 years ago

I don't think you can (or that it is easy to get this data) atm. Sadly, the data wasn't collected as extensively as we wished in the end. @YoniSchirris do you still have access to it ?

YoniSchirris commented 2 years ago

I'm afraid our collected dataset was never finished in this sense. You can play around with the public datasets that were previously collected, however