naobservatory / mgs-pipeline

MIT License
3 stars 1 forks source link

Spurbeck doesn't name the counties #9

Closed jeffkaufman closed 1 year ago

jeffkaufman commented 1 year ago

Spurbeck 2023 doesn't name its counties, but it does (a) say their populations and (b) say they're sites in the Ohio Wastewater Monitoring Network. Maybe this is enough to let us connect county-level prevalence to relative abundance?

https://storymaps.arcgis.com/stories/0fc051c015dc4e8b9a80b76cdf406b24 has a list of sites and the sizes of the populations they serve.

jeffkaufman commented 1 year ago

Wrote a script:

A:
        1103 Summit
       42554 Lucas
       51842 Cuyahoga
       95150 Montgomery
      123000 Hamilton
      159154 Cuyahoga
B:
          37 Trumbull
         500 Lorain
        1000 Erie
        2000 Richland
        6000 Clark
        6500 Muskingum
C:
         554 Lucas
        9842 Cuyahoga
       40897 Summit
       53150 Montgomery
      165000 Hamilton
      180000 Hamilton
D:
         555 Lawrence
        1000 Portage
        1362 Crawford
        1500 Clinton
        2000 Darke
        2169 Belmont
E:
           0 Sandusky
         323 Union
         464 Athens
         697 Fairfield
        2000 Wayne
        3000 Butler
F:
        4060 Franklin
        4817 Franklin
      125846 Cuyahoga
      162000 Hamilton
      286103 Summit
      327554 Lucas
G:
           0 Licking
        1160 Greene
        2500 Muskingum
        3000 Hancock
        7000 Richland
        8850 Montgomery
H:
         183 Franklin
        9060 Franklin
      130846 Cuyahoga
      167000 Hamilton
      291103 Summit
      332554 Lucas
I:
         160 Greene
        1000 Licking
        1500 Muskingum
        4000 Hancock
        6000 Richland
        8037 Trumbull
J:
           0 Montogmerry
         469 Mahoning
        4000 Allen
        5000 Clark
        7000 Lorain
       10000 Erie

All of these are clear except F, but both top possibilities for F are in Franklin County.

This gives us:

site_counties = {
    "A": "Summit",
    "B": "Trumbull",
    "C": "Lucas", 
    "D": "Lawrence",
    "E": "Sandusky",
    "F": "Franklin",
    "G": "Licking",
    "H": "Franklin",
    "I": "Greene",
    "J": "Montogmery",
}
simonleandergrimm commented 1 year ago

Great work! Will thus get state-level numbers for Ohio.

jeffkaufman commented 1 year ago

@simonleandergrimm this means we can use county-level info for Ohio