Open glevava opened 8 years ago
I'm not surprised by this behavior, as the previous use was not to publish multiple versions in bulk but publish each dataset version as they are generated. (so the next one would be a bit later). The obvious solution is to publish each version to postgresql, thredds and solr in succession before publishing the next. If this is too much of a hassle, we'd need to determine if the hessian service can take a specific thredds version as a parameter and publish that, or will it always just stick with the most recent. Then check if the publisher includes that too,
I want to keep this in mind for the new publisher wrt versioning. A requirement is that it should handle the publication of an old version of a dataset if both maps are required, but the new version must be detected so the old can be flagged latest=false
We haven't considered the use case of multiple datasets per mapfile for bulk publishing, but that is likely going to be a workflow issue rather than a component.
It appears that concatenating several mapfile of different versions of the same dataset leads to a "silent failed" publication on indexnode.
For instance, here are two CORDEX mapfiles:
We concatenate both mapfiles to publish in a bulk. The
sort
ensures that the versions are chronologically published as enforced by the new publisher (ESGF 2.x):Publication on datanode goes right for both versions without any warnings or error:
Both XML catalogs are generated and files and aggregations are accessible through all protocols (OpenDAP, HTTP).
Publication on index node goes right without any warnings or error:
Nevertheless, only the latest version (v20150515 in our case) is published on indexnode. There are two publication instance for the same dataset but it seems that the new publisher only takes into account the latest version for each publication instance on the indexnode.
Is this new behavior expected by the new publisher (included in ESGF 2.x) or not?