MetOffice / XBTs_classification

Project for the classification of eXpendable Bathy Thermographs
BSD 3-Clause "New" or "Revised" License
4 stars 2 forks source link

Preprocessing scripts for non-WOD data sources #53

Open stevehadd opened 4 years ago

stevehadd commented 4 years ago

Following discussions with Rachel, we've confirmed that input netCDF files which were used as the start of the current pipeline and feed into preprocessing/xbt_extract_year.py, are based data downloaded from WOD. The EN4 dataset uses other data sources which are differently formatted. The current definition of the input interface to the ML pipeline of year CSV files is a useful one, and so rather than change that, we should update the preprocessing functionality to take in different formats and output yearly CSV files. Among the tasks will be