DFO-Ocean-Navigator / Ocean-Data-Map-Project

The Ocean Navigator is an online tool that is used to help visualise scientific research data. a users guide is available at https://dfo-ocean-navigator.github.io/Ocean-Navigator-Manual/ and the tool is live at
http://navigator.oceansdata.ca
GNU General Public License v3.0
49 stars 20 forks source link

Unable to access CLASS4 data after 2019 and only CONCEPTS is viewable #785

Closed dwayne-hart closed 1 year ago

dwayne-hart commented 3 years ago

At this time any of the 2020 CLASS4 data files are not able to be viewed in the Ocean Navigator. Any of the CONCEPTS datasets from January 2016 to December 2019 are accessible but do not show any of the other models.

douglatornell commented 3 years ago

PR #797 fixes the "models other than CONCEPTS not viewable" part of this issue.

I'm looking into getting the indexing of class 4 files to be up to date to address the other part.

douglatornell commented 3 years ago

The the "no class 4 datasets after 2019" part is probably due to issue #757.

jmunroe commented 3 years ago

I am looking at what we really need when generating our list of CLASS4 files. Here's what we are currently doing:

def list_class4_files(class4_path):
    files = {f for f in Path(class4_path).glob("**/*GIOPS*profile.nc")}
    result = [
        {
            "name": datetime.datetime.strptime(class4_id.split("_")[1], "%Y%m%d").strftime("%Y-%m-%d"),
            "id": class4_id
        }
        for class4_id in sorted((f.stem for f in files), reverse=True)
    ]
    return result

Even with all the good changes from #809, it looks like we are still only considering CLASS4 files for the GIOPS model.

For reference, our CLASS4 dataset (/data/class4) is 4.9 TB over about 128175 files. I think that explain why that initial glob is so slow. But are we really just interested in files with a basename of the form _profile.nc -- there are ~13000 of those.

jmunroe commented 3 years ago

CLASS4 files seem to be a recurring issue with the ocean navigator. For example, here are current open issues that relate to CLASS4 files:

It doesn't look like we have ever documented with these issues exactly what are those CLASS4 files and why do we care? The context for the problem is weather/ocean forecast verification between different ocean models and observations. From @FraserO 's presentation at a GODAE OceanView workshop in 2015 we learn

Metrics for Physical Oceanography • Class 1: Common variables on same horizontal grid • Class 2: Results interpolated to common vertical transects or virtual moorings • Class 3 Derived quantities: MeridionalOverturning SF, net heat flux • Class 4 Forecast/Analysis fields interpolated to observations

It looks like the OceanNavigator currently only supports "in situ profile T/S" but there are class4 files also available for other variables such as SST satellite, SST drifters, SSH Altimetry, and Ice Concentration (AMSR).