Open tomsail opened 1 year ago
Update on this: the same problem was seen when analyzing results from models and observed data from ioc_cleanup
(for prin
and prin2
stations)
Actions suggested:
create_seaset()
to initiate a first version of the seaset catalog, without dropping any ioc_code. This function would have to be run only once to generate a reference catalog. update_seaset()
to add new stations and attribute them a new unique provider, if not already present in the database.id, dist = get_seaset_id(lat,lon)
that return the N unique seaset ids for the coords entered and vice versa lat, lon = get_coords(id)
that return coordinates for the seaset id returned. Assuming that seaset
is a GeoDataFrame
(or at least a DataFrame with lon
/lat
columns), then we should think carefully which functions are really needed, because some of them are trivial to implement. For example get_coords(id)
is just seaset[seaset.id==id].geometry
or similar,
There is a problem in discarding stations that have same set of lat/lon or too close to each other. Some - not exhaustive - reasons
https://github.com/oceanmodeling/seaset/blob/main/Notebooks/Updated_Catalog.ipynb
I suggest to:
grouped_df
dataframe --> for data retrieval purposes