carpentries-incubator / ml4bio-workshop

Materials for a workshop introducing machine learning to biologists
https://carpentries-incubator.github.io/ml4bio-workshop/
Other
21 stars 9 forks source link

Edit content on data pre-processing and limitations #123

Open agitter opened 2 years ago

agitter commented 2 years ago

We currently discuss data pre-processing in the T-cells lesson and give examples there. We do not teach specific data cleaning strategies because they are domain specific. However, we can do more to stress the importance of data cleaning so that participants are not mislead. The choice of classifier is not relevant if the data are of poor quality.

In addition, we can close the workshop with some discussion of limitations. These includes limitations of ML in general and limitations of participants' knowledge after the workshop. That can transition into the presentation of next steps and resources that participants can use to continue their ML education.