artc-dsc / Tasks

For recording purpose.
0 stars 0 forks source link

Create a centralized folder for all data sources #25

Closed ARTC-Doris closed 1 month ago

ARTC-Doris commented 1 month ago

Goal: To have a centralized folder for all open-source dataset (could be used for benchmark model performance)

Format description

  1. dataset folder at DataSet
  2. All datasets to provide master information at Dataset_compiled.xlsx
  3. files includes at least:
    • dataset download links & raw file (if size not big),
    • processed files (after cleaning, mapping to the template and resampling to certain frequency)
      formatted in "processedxxx_.csv"
    • stored in individual folders for each source data_source
ZengyuCao-ARTC commented 1 month ago

Uploaded 2 datasets:

@ARTC-Doris @namtuanle Please follow this guide to add new data source. Thanks~