ebmdatalab / global-trial-landscape

Other
0 stars 0 forks source link

Get lat/long from country/city #18

Open ccunningham101 opened 11 months ago

ccunningham101 commented 11 months ago

Industry sites such as

169,NCT05943990,GSK Investigational Site,New Haven,United States,ctgov
169,NCT05943990,GSK Investigational Site,Jacksonville,United States,ctgov
169,NCT05943990,GSK Investigational Site,Tampa,United States,ctgov
169,NCT05943990,GSK Investigational Site,Atlanta,United States,ctgov
169,NCT05943990,GSK Investigational Site,Westwood,United States,ctgov
169,NCT05943990,GSK Investigational Site,Lexington,United States,ctgov
169,NCT05943990,GSK Investigational Site,Baltimore,United States,ctgov
169,NCT05943990,GSK Investigational Site,Saint Louis,United States,ctgov
169,NCT05943990,GSK Investigational Site,New York,United States,ctgov
169,NCT05943990,GSK Investigational Site,New York,United States,ctgov
169,NCT05943990,GSK Investigational Site,Philadelphia,United States,ctgov
169,NCT05943990,GSK Investigational Site,Houston,United States,ctgov

In ROR, GSK is incorporated in 1 placer per country https://ror.org/search?query=GSK So if for industry we ignore the city it will resolve with ROR, but then the lat/long will be wrong

We might need lat/long for plotting (such as mapbox)

We can

  1. Use external datasets for lat/long i.e. https://simplemaps.com/data/world-cities, https://public.opendatasoft.com/explore/dataset/geonames-all-cities-with-a-population-1000/table/?disjunctive.cou_name_en&sort=name, https://www.kaggle.com/datasets/juanmah/world-cities
  2. Does ctgov have site lat/long data?
ccunningham101 commented 11 months ago

We may or may not need lat long Geopandas can use the country data (what can geopandas do with city? or lat/long data?) So probably plan on using open source mappings for city/country for now