kvarada / DSI-ML-workshop-2024

Introductory machine learning workshop for non-stem audience.
https://kvarada.github.io/DSI-ML-workshop-2024
Creative Commons Zero v1.0 Universal
0 stars 0 forks source link

Activity when ML is not applicable #7

Open yuliaUU opened 3 weeks ago

yuliaUU commented 3 weeks ago

this can be done as activity: giving each team dataset, its description and research question. and ask whether it is ok to use ML:

yuliaUU commented 3 weeks ago

Sonar Dataset • Description: This dataset involves predicting whether an object is a mine or a rock based on sonar returns. It includes 208 observations with 60 input variables. • Link: Connectionist Bench (Sonar, Mines vs. Rocks) - UCI Machine Learning Repository • Question: Can we accurately classify sonar returns as either mines or rocks? • Issues with Dataset: The small size (208 instances and 60 features) and imbalanced class distribution make it difficult for machine learning models to generalize well, often resulting in overfitting. The limited number of observations may not be sufficient to train a robust model effectively.

yuliaUU commented 3 weeks ago

Iris Dataset • Description: The Iris dataset contains 150 instances of iris flowers, each described by four features (sepal length, sepal width, petal length, and petal width). It's one of the most famous datasets in the machine learning community. • Link: UCI Machine Learning Repository - Iris Dataset • Question: Can we classify the species of an iris flower based on its features? • Issues with Dataset: The dataset is too simplistic and small to be representative of more complex real-world problems. It's often used for educational purposes but doesn't provide sufficient complexity for practical machine learning challenges.

yuliaUU commented 3 weeks ago

This on is tricky: cause you can use ML- but tehre is a "but" Weather Forecasting Dataset