BigDataWUR / AgML-CY-Bench

CY-Bench (Crop Yield Benchmark) is a comprehensive dataset and benchmark to forecast crop yields at subnational level. CY-Bench standardizes selection, processing and spatio-temporal harmonization of public subnational yield statistics with relevant predictors. Contributors include agronomers, climate scientists and machine learning researchers.
https://cybench.agml.org/
Other
17 stars 8 forks source link

Validate names of columns and indices #100

Open krsnapaudel opened 8 months ago

krsnapaudel commented 8 months ago

Names of columns and indices must follow data format.

ellaampy commented 3 months ago

@mzachow For Argentina and Brazil, please revise the column name from "harvested_area" to "harvest_area" in the data preparation script.

ellaampy commented 2 months ago

for maize database

for wheat database

General comment on yield csvs. Perhaps doesnt need fixing all countries have standard column names in the yield csvs['crop_name','country_code','adm_id','harvest_year','yield']. A large portion of countries includes extra columns such as harvest_area and production.

krsnapaudel commented 1 month ago

@ellampy Looks like these issues are fixed except for the extra columns in yield. We will leave them there. Thanks for checking. You can close this after checking the updated dataset.