MDAIceland / WaterSecurity

1 stars 1 forks source link

Model Exploration #10

Closed VasLem closed 3 years ago

VasLem commented 3 years ago
adriana-madi commented 3 years ago

Research Stage Update (with @OlympiaG): Discussion points for 3/05/21

Action for next stand-up:

OlympiaG commented 3 years ago

Research Findings Document - Sprint 1: Research Summary -Sprint 1 (2).pdf

It needs further investigation.

bajo1207 commented 3 years ago

Ok, so, from my understanding this is how the prediction is gonna work:

  1. We select x amount of cities from all countries. The cities that we select are based on some metric e.g. biggest city.
  2. With the help of the labeled dataset and unlabeled datasets (Aquastat, education, economics, development) we predict the water security for those cities.
    • This predictor/classifier also considers geographical data
  3. We create a geographical perimeter around these cities for which we can make predictions
  4. For a given lat lng within these perimeters we predict the risk based on clustering to nearby cities
  5. The prediction can optionally be given with a confidence interval
VasLem commented 3 years ago

Regarding my findings, it seems that the proposed way from @adriana-madi and @OlympiaG is fairly straightforward and clean. Another direction to take would be to consider the problem as a graph, with edges weights produced by the distance matrix of the cities and with the features being nodes attributes. To this end, if we have time or we see that the model is not efficient enough, we could try to get into Graph Convolutional Networks in a simliar way as described here and more specifically to our task there

bajo1207 commented 3 years ago

We have agreed on the general structure - lets continue the more detailed model discussion in #29 and #31.