Open wildintellect opened 11 months ago
@mccabete as discussed, we're going to ingest the HourlyFWIFromHourlyInterpContinuous
data from s3://veda-data-store-staging/EIS/other/station-FWI/20000101.20221021.hrlyInterp/FWI/
.
The existing csv files don't have any geospatial data relating to the station ID so I got that data from s3://veda-data-store-staging/EIS/other/station-FWI/20000101.NRT.nStns.1/isd-history.csv
. Also I had to concatenate some columns to create a datetime column
The transformed csv files are saved in: s3://veda-data-store-staging/EIS/other/station-FWI/20000101.20221021.hrlyInterp/FWI-transformed/
Please have a look through a sample of those files and let me know if they are ok. Once I get the go ahead, we can start the ingestion process
Here is the transformation script for the FWI dataset that will loaded to the features API: https://github.com/NASA-IMPACT/veda-data/blob/main/transformation-scripts/fwi-stations/FWI_Transformation.ipynb
Hey it looks good @smohiudd. Sorry, I realized belatedly that I did in fact know that the files didn't have lat and lon and that the datetimes are none standard -- and have had to solve those issues in my own work. So sorry about the lack of heads up.
@mccabete we have most of the data in the eis_fire_fwi
: https://firenrt.delta-backend.com/collections/public.eis_fire_fwi
Please try it out and let me know if you have any feedback.
Unfortunately the airflow ECS container timed out at 2 hours. I'll need to increase it to make sure all the data is ingested.
@smohiudd This is great! I will let you know when I take it for a spin.
Contact Details
tmccabe@umd.edu
URL/DOI
No response
Data License
No response
Data Location
s3://veda-data-store-staging/EIS/other/station-FWI/20000101.20221021.hrlyInterp/
Size Estimate
No response
Number of Items
No response
Description
Daily and hourly station-based Fire Weather Index calculations over the Western US. The main source of station data is the NOAA NCEI Integrated Surface Database (ISD)
Collection Creation Notebook
No response
Item Creation Notebook
No response
Checklist
rio cogeo validate
Any additional info you think is relevant, possibly including spatial or temporal subset if applicable?
The first of several Vector (Point) datasets in CSV format.
As determined in discussions on https://github.com/MAAP-Project/Community/issues/767
To Do