metno / metsis-drupal

contains METNO METSIS software
GNU General Public License v3.0
0 stars 0 forks source link

Handling of harvested records with multiple data_access elements #19

Open steingod opened 1 year ago

steingod commented 1 year ago

@ferrighi and @magnarem I am struggling with how to handle information that is harvested from a number of sources, including CMEMS and NMDC. These providers expose discovery metadata with multiple data_access elements. I.e. the record is not a strictly dataset element but almost a data collection element. In our system we would have used parent/child relations for this, but these providers do not have the concept and this complicates how to present result in the our portal solution, not to speak about how to work with the basket. Currently we have an understanding that there is one data_access per access type (e.g. direct file, OPeNDAP, WMS etc), but these typically expose a list of files with direct download elements. E.g. IMR may have multiple FTP links for different files to download, but no information on the content in each file. And CMEMS expose multiple WMS and OPeNDAP end points for different variables or products in the same discovery metadata record.

Is it possible for us to handle this somehow and if so how should we address it? Do we need modifications to MMD? Any ideas?

magnarem commented 1 year ago

In solr we support multiple urls in the dataaccess_url fields, so It is possible to add multiple http,opendap,etc links to those fields. They should also be listed in the search results and metadata details. however it will be one download button for each link in the list.

steingod commented 1 year ago

Sounds good. How would that go with visualisation and basket interaction?

magnarem commented 11 months ago

Will investigate and find some solution for this. Do we have some example datasets? Or I will create some test mmd for this.

steingod commented 11 months ago

Not from the top of my mind, but as mentioned several records harvested from NMDC, should be in .../storeB/project/fou/fd/project/adc/mdharvest/imr/mmd

ferrighi commented 11 months ago

I agree with Magnar, that by design having a multivalue for the same access type is making this already possible. If you do not have more info about that specific URL, we could maybe think to have the "Download dataset" as a expandable widget? showing the different urls or file names. For the basket I would assume that in the first round all urls should go into it.

steingod commented 11 months ago

The issue is that it is hard to inform the user on how to behave. An expandable widget could be an option. In the basket I am not user how we can make useful,, except for download of multiple files which would work. Probably next step would be to establish test based on NPI and NMDC records.

magnarem commented 11 months ago

It is possible to implement some pop-up download widget if the multi-fields have more than one entry. This could also provide the possibility to add those multiple datasets to the basket.