CanadianClimateDataPortal / Canadian-Climate-Data-Portal

3 stars 0 forks source link

Design ZIP and/or streaming data servers and clients #57

Open tomLandry opened 5 years ago

tomLandry commented 5 years ago

Following AWG discussion here, now that several files are expected for large subsetted dowloads, the group needs to determine the way to serve and consume metalinks: https://ccdpwiki.atlassian.net/wiki/spaces/CCDP/pages/64520247/Architecture?focusedCommentId=64618644#comment-64618644

This issue is linked to https://github.com/CanadianClimateDataPortal/Canadian-Climate-Data-Portal/issues/56 as we need the UI/UX perspective along the way.

tomLandry commented 5 years ago

We report advancement of metalink feature here, and collaboration of DKRZ: https://github.com/geopython/pywps/issues/298#issuecomment-472751682 Note that priority of this task is still medium, but will become blocker on April 1st. So expect to go back furiously to drawing boards.

tomLandry commented 5 years ago

Hello @huard ! Could you take 5 min to describe recent progress with Metalinks and ZIP support, and how it relates to this issue? Thanks.

huard commented 5 years ago

We have a couple of branches and PR that strive to improve the user experience with respect to WPS outputs storing multiple files under the same identifier. For example, if you are computing an indicator on an ensemble of climate simulations, the output is going to be a list of files. At the moment, the only way to send that back to the user is to either zip these files and provide the reference to the zip file, or to create a txt file storing the list of references.

On the PyWPS side:

On the birdy side

The metalink format opens up a couple of nice possibilities, including distributing files as torrents, and including a checksum. I suggest looking into the torrent option as a way to distribute files to users.